Aaron Parecki
|
921d5262ea
|
also parse instagram profile URLs
|
6 years ago |
Aaron Parecki
|
6f39655c8a
|
parse instagram user info from HTML instead of secret JSON API
adds script to refresh the downloaded instagram data for the tests as well
|
6 years ago |
Aaron Parecki
|
c70b29479a
|
updates for instagram parsing
instagram seems to have rolled out the `graphql` key everywhere now
|
6 years ago |
Aaron Parecki
|
85c3a17934
|
whitespace cleanup
|
6 years ago |
Aaron Parecki
|
4959ec15f2
|
remove duplicate url values
|
6 years ago |
Aaron Parecki
|
8026279cba
|
fix tests for new mf2 parser
main difference is the deprecated rel handling
|
6 years ago |
Aaron Parecki
|
a50cd6284b
|
fix whitespace handling for br tags in html
|
6 years ago |
Aaron Parecki
|
c27f228314
|
include in-reply-to URL for tweets
|
6 years ago |
Aaron Parecki
|
c68c7661c8
|
inspect content to determine if a page is atom or rss
closes #62
|
6 years ago |
Aaron Parecki
|
cb1e32278d
|
convert newlines to <br> for html in tweets
|
6 years ago |
Aaron Parecki
|
bf4bc3a668
|
extract photos and videos from streaming tweets when truncated
|
6 years ago |
Aaron Parecki
|
fb2fcec9c6
|
include HTML for tweets with links or user mentions
also expands parsing to be able to handle twitter JSON from the streaming API which is subtly different from the HTTP API.
closes #61
|
6 years ago |
Aaron Parecki
|
b995a1d3ee
|
whitespace
|
6 years ago |
Aaron Parecki
|
452accf6bf
|
include `quotation-of` property for quoted tweets
|
6 years ago |
Aaron Parecki
|
0decb9dcb4
|
return error info after finding feeds if available
|
6 years ago |
Aaron Parecki
|
c67dd9088d
|
bugfix
|
6 years ago |
Aaron Parecki
|
aba067234c
|
add h-x-app vocabulary
closes #13
|
6 years ago |
Aaron Parecki
|
171ca175f2
|
adds an option to process a parsed mf2 page
|
6 years ago |
Aaron Parecki
|
2515f618c7
|
include featured image for h-entry
closes #51
|
6 years ago |
Aaron Parecki
|
c376833f4c
|
fix for recipe parsing
|
6 years ago |
Aaron Parecki
|
4d65b1ca1e
|
if removing the img results in empty content, put the name value back
closes #57
|
6 years ago |
Aaron Parecki
|
150683e1a7
|
fix error when mailto links are encountered
|
6 years ago |
Aaron Parecki
|
66adfbe2f8
|
run name/content dedupe before munging HTML
fix for #53
|
6 years ago |
Aaron Parecki
|
bdedef6e1e
|
adds a bunch of broken tests for #52
|
6 years ago |
Aaron Parecki
|
b686349ded
|
remove duplicate code
use parseHTMLValue function for event description
|
6 years ago |
Aaron Parecki
|
d67e76f4b5
|
more strict match for XKCD comics
allows the XKCD XML feeds to be parsed with the feed parser
|
7 years ago |
Aaron Parecki
|
66f4a8b007
|
check for "type" property on alternates
|
7 years ago |
Aaron Parecki
|
70e9f60c42
|
update jsonfeed detection
|
7 years ago |
Aaron Parecki
|
a9b1001e62
|
switch to fork of picofeed with authorUrl support
* adds test of instagram-atom feed with individual authors per item
* dedupes atom/rss title if it's a prefix of the content
|
7 years ago |
Aaron Parecki
|
7872429f0c
|
prioritize url on the same domain
if an item has multiple URL values, return the one that is on the same domain
|
7 years ago |
Aaron Parecki
|
65d36a74de
|
always return arrays for photo and audio from XML feeds
|
7 years ago |
Aaron Parecki
|
12f27517f4
|
assume text/xml is an RSS feed
|
7 years ago |
Aaron Parecki
|
206e27ea25
|
add feed discovery API
|
7 years ago |
Aaron Parecki
|
85b8a35212
|
normalize URLs when comparing
Treats `https://example.com` and `https://example.com/` as equivalent when comparing URLs. Closes #33
|
7 years ago |
Aaron Parecki
|
15743d411d
|
Find author when author is a property of the h-feed
closes #32
|
7 years ago |
Aaron Parecki
|
05f7d9c86c
|
implement h-feed and other microformats feed parsing
|
7 years ago |
Aaron Parecki
|
7b16371418
|
add basic support for JSONFeed
|
7 years ago |
Aaron Parecki
|
e8e63caba6
|
implements parsing Atom and RSS feeds
|
7 years ago |
Aaron Parecki
|
a37ed3bbae
|
update to support multiple photos
uses the video's poster frame as the photo if any of the multi-post images are videos
|
7 years ago |
Aaron Parecki
|
07d1df2cef
|
convert rsvp value to lowercase
according to http://microformats.org/wiki/h-entry the rsvp values should be interpreted case-insensitive
|
7 years ago |
sebsel
|
6b286157e3
|
based tests on TwitterTest.php
|
7 years ago |
sebsel
|
8c1ffe3f05
|
fix proper mf2 names, cast lon/lat as string, optional end, array notation
|
7 years ago |
sebsel
|
b3c0813610
|
store location as h-card in refs
|
7 years ago |
Aaron Parecki
|
a16b845af3
|
adds in-reply-to property for HN comments
|
7 years ago |
Aaron Parecki
|
d50231142a
|
adds support for parsing checkins
checkin data is returned embedded like author data rather than in the `refs` object
closes #35
|
7 years ago |
Aaron Parecki
|
0ccf34b656
|
use consistent pattern for fetching from silos
|
7 years ago |
Aaron Parecki
|
4fab3e9e0a
|
add test for HN comment
|
7 years ago |
Aaron Parecki
|
d0de523746
|
add hackernews support
closes #40
|
7 years ago |
sebsel
|
12f0bebc8a
|
add Facebook events
|
7 years ago |
Aaron Parecki
|
236c3f5f9c
|
include version info in response header
|
7 years ago |