Aaron Parecki
491f3796f8
also check for opening rss tag
closes #89
5 years ago
Aaron Parecki
f381cf30cf
drop extra request for profiles when fetching instagram photos
5 years ago
Aaron Parecki
b0b2fc1ee8
support featured images from events
closes #92
5 years ago
Aaron Parecki
7c48a0c1fb
return jsonfeed photos as arrays too
5 years ago
Aaron Parecki
5b2b4f3142
return original input URL for feed discovery when 302 is found
closes #86
5 years ago
Aaron Parecki
19126b5836
pass thru HTTP code and parse deleted pages
more consistently returns HTTP 410 now
5 years ago
Aaron Parecki
c5d417e87e
leave out placeholder alt text from instagram
6 years ago
Aaron Parecki
8fe9b70526
update instagram test data
6 years ago
Aaron Parecki
156fd62678
Include alt text from Instagram posts
This adds a new property, "meta", which includes alt text and has room to include additional properties later.
closes #85
6 years ago
sebsel
fdbeb4fdc1
add test with two h-cards before the h-entry
6 years ago
sebsel
db780a9789
create failing test: h-entry-with-h-card-before-it
6 years ago
Aaron Parecki
bed6efba29
jsonfeed: use feed info as author as fallback
6 years ago
Aaron Parecki
87b2cf10d8
add follow-of posts
replaces #78
6 years ago
Aaron Parecki
8043ba575f
improve authorship discovery
closes #79
6 years ago
Aaron Parecki
18dc92966b
recognize pattern of h-entry + h-card
* a single h-entry and h-card, where the h-entry has no URL, will result in a permalink page with that h-entry
* multiple h-entrys followed by an h-card is a feed
6 years ago
Aaron Parecki
9163341af2
normalize relative URLs in JSONFeed items
closes #77
6 years ago
Aaron Parecki
b618f9afc6
fix test for emoji
6 years ago
Aaron Parecki
470639f486
recognize h-event "content" in addition to "description"
6 years ago
Aaron Parecki
5418072704
don't use twitter bio URL as author URL
6 years ago
Aaron Parecki
43db6098fc
handle the case where the server returns multiple content-type headers
6 years ago
Aaron Parecki
707e750429
fix likes and reposts
6 years ago
Aaron Parecki
7252d5a3f4
also parse the object inside Create activities
6 years ago
Aaron Parecki
ca9c8c02ef
AS: parse likes and reposts
6 years ago
Aaron Parecki
85d973916f
support articles and summary
6 years ago
Aaron Parecki
7ee8240ebd
fix test for unknown parsed result
6 years ago
Aaron Parecki
d3e36038b2
parse basic ActivityStreams objects
including from rel=alternate
6 years ago
Aaron Parecki
154b7e874a
check for a rel=alternate to existing parsed mf2 JSON and use that instead
6 years ago
Aaron Parecki
7621bca4a6
adds new "source-format" property to indicate how XRay found the data
* mf2+html
* mf2+json
* feed+json
* xml
* instagram/facebook/github/xkcd
6 years ago
Aaron Parecki
38d307de1c
implements post type discovery
returns a new property `post-type` next to `type`
closes #25
6 years ago
Aaron Parecki
70f1576926
support twitter animated gifs
6 years ago
Aaron Parecki
6db2614afd
fix test for quotation-of
6 years ago
Aaron Parecki
112b75b623
parse quotation-of from HTML as well
closes #73
6 years ago
Aaron Parecki
01cce9b823
sends an Accept header when fetching posts
6 years ago
Aaron Parecki
e2780ba0a0
when interpreting JSON, don't require `value` for html values
6 years ago
Aaron Parecki
417cc1b3cc
parse redirect uri for h-app
parse from both link tags and the u-redirect-uri property
6 years ago
Aaron Parecki
fc74da5be9
add parser for instagram user feeds
6 years ago
Aaron Parecki
bb7821b94f
fix instagram tests
6 years ago
Aaron Parecki
2d19db0308
include instagram bio in h-card
6 years ago
Aaron Parecki
921d5262ea
also parse instagram profile URLs
6 years ago
Aaron Parecki
6f39655c8a
parse instagram user info from HTML instead of secret JSON API
adds script to refresh the downloaded instagram data for the tests as well
6 years ago
Aaron Parecki
c70b29479a
updates for instagram parsing
instagram seems to have rolled out the `graphql` key everywhere now
6 years ago
Aaron Parecki
25b6f85c14
use html5 parser and update php-mf2
6 years ago
Aaron Parecki
dc0557ddb9
update tests for fixed mf2 parsing
6 years ago
Aaron Parecki
4959ec15f2
remove duplicate url values
6 years ago
Aaron Parecki
8026279cba
fix tests for new mf2 parser
main difference is the deprecated rel handling
6 years ago
Aaron Parecki
a50cd6284b
fix whitespace handling for br tags in html
6 years ago
Aaron Parecki
c27f228314
include in-reply-to URL for tweets
6 years ago
Aaron Parecki
c68c7661c8
inspect content to determine if a page is atom or rss
closes #62
6 years ago
Aaron Parecki
cb1e32278d
convert newlines to <br> for html in tweets
6 years ago
Aaron Parecki
bf4bc3a668
extract photos and videos from streaming tweets when truncated
6 years ago