Aaron Parecki
aacac198a8
fix checking for empty post content
XRay now looks for images inside the HTML and does not consider those empty posts
4 years ago
Jamie Tanna
7f49c0381c
Add support for parsing MF2 JSON
To make it possible to consume feeds of pre-rendered Microformats2 JSON
data, we need to be able to discover that it is a Microformats2 JSON
file using the known MIME types documented in
http://microformats.org/wiki/microformats2-mime-type
4 years ago
Aaron Parecki
fb3af995a6
return rel-canonical if present
4 years ago
Aaron Parecki
989d42a85f
fix for target check for all formats
move target check outside of all format classes and operating on the final parsed JSON so that it works regardless of the input content type
5 years ago
Aaron Parecki
1213ee080d
include author for events
for https://github.com/aaronpk/Aperture/issues/56
5 years ago
Aaron Parecki
491f3796f8
also check for opening rss tag
closes #89
5 years ago
Aaron Parecki
f381cf30cf
drop extra request for profiles when fetching instagram photos
5 years ago
Aaron Parecki
b0b2fc1ee8
support featured images from events
closes #92
5 years ago
Aaron Parecki
7c48a0c1fb
return jsonfeed photos as arrays too
5 years ago
Aaron Parecki
5b2b4f3142
return original input URL for feed discovery when 302 is found
closes #86
5 years ago
Aaron Parecki
19126b5836
pass thru HTTP code and parse deleted pages
more consistently returns HTTP 410 now
5 years ago
Aaron Parecki
c5d417e87e
leave out placeholder alt text from instagram
5 years ago
Aaron Parecki
8fe9b70526
update instagram test data
5 years ago
Aaron Parecki
156fd62678
Include alt text from Instagram posts
This adds a new property, "meta", which includes alt text and has room to include additional properties later.
closes #85
5 years ago
sebsel
fdbeb4fdc1
add test with two h-cards before the h-entry
6 years ago
sebsel
db780a9789
create failing test: h-entry-with-h-card-before-it
6 years ago
Aaron Parecki
bed6efba29
jsonfeed: use feed info as author as fallback
6 years ago
Aaron Parecki
87b2cf10d8
add follow-of posts
replaces #78
6 years ago
Aaron Parecki
8043ba575f
improve authorship discovery
closes #79
6 years ago
Aaron Parecki
18dc92966b
recognize pattern of h-entry + h-card
* a single h-entry and h-card, where the h-entry has no URL, will result in a permalink page with that h-entry
* multiple h-entrys followed by an h-card is a feed
6 years ago
Aaron Parecki
9163341af2
normalize relative URLs in JSONFeed items
closes #77
6 years ago
Aaron Parecki
b618f9afc6
fix test for emoji
6 years ago
Aaron Parecki
470639f486
recognize h-event "content" in addition to "description"
6 years ago
Aaron Parecki
5418072704
don't use twitter bio URL as author URL
6 years ago
Aaron Parecki
43db6098fc
handle the case where the server returns multiple content-type headers
6 years ago
Aaron Parecki
707e750429
fix likes and reposts
6 years ago
Aaron Parecki
7252d5a3f4
also parse the object inside Create activities
6 years ago
Aaron Parecki
ca9c8c02ef
AS: parse likes and reposts
6 years ago
Aaron Parecki
85d973916f
support articles and summary
6 years ago
Aaron Parecki
7ee8240ebd
fix test for unknown parsed result
6 years ago
Aaron Parecki
d3e36038b2
parse basic ActivityStreams objects
including from rel=alternate
6 years ago
Aaron Parecki
154b7e874a
check for a rel=alternate to existing parsed mf2 JSON and use that instead
6 years ago
Aaron Parecki
7621bca4a6
adds new "source-format" property to indicate how XRay found the data
* mf2+html
* mf2+json
* feed+json
* xml
* instagram/facebook/github/xkcd
6 years ago
Aaron Parecki
38d307de1c
implements post type discovery
returns a new property `post-type` next to `type`
closes #25
6 years ago
Aaron Parecki
70f1576926
support twitter animated gifs
6 years ago
Aaron Parecki
6db2614afd
fix test for quotation-of
6 years ago
Aaron Parecki
112b75b623
parse quotation-of from HTML as well
closes #73
6 years ago
Aaron Parecki
01cce9b823
sends an Accept header when fetching posts
6 years ago
Aaron Parecki
e2780ba0a0
when interpreting JSON, don't require `value` for html values
6 years ago
Aaron Parecki
417cc1b3cc
parse redirect uri for h-app
parse from both link tags and the u-redirect-uri property
6 years ago
Aaron Parecki
fc74da5be9
add parser for instagram user feeds
6 years ago
Aaron Parecki
bb7821b94f
fix instagram tests
6 years ago
Aaron Parecki
2d19db0308
include instagram bio in h-card
6 years ago
Aaron Parecki
921d5262ea
also parse instagram profile URLs
6 years ago
Aaron Parecki
6f39655c8a
parse instagram user info from HTML instead of secret JSON API
adds script to refresh the downloaded instagram data for the tests as well
6 years ago
Aaron Parecki
c70b29479a
updates for instagram parsing
instagram seems to have rolled out the `graphql` key everywhere now
6 years ago
Aaron Parecki
25b6f85c14
use html5 parser and update php-mf2
6 years ago
Aaron Parecki
dc0557ddb9
update tests for fixed mf2 parsing
6 years ago
Aaron Parecki
4959ec15f2
remove duplicate url values
6 years ago
Aaron Parecki
8026279cba
fix tests for new mf2 parser
main difference is the deprecated rel handling
6 years ago