Aaron Parecki
|
154b7e874a
|
check for a rel=alternate to existing parsed mf2 JSON and use that instead
|
6 years ago |
Aaron Parecki
|
7621bca4a6
|
adds new "source-format" property to indicate how XRay found the data
* mf2+html
* mf2+json
* feed+json
* xml
* instagram/facebook/github/xkcd
|
6 years ago |
Aaron Parecki
|
b074d652e0
|
also accept application/xml as RSS feeds
|
6 years ago |
Aaron Parecki
|
38d307de1c
|
implements post type discovery
returns a new property `post-type` next to `type`
closes #25
|
6 years ago |
Aaron Parecki
|
70f1576926
|
support twitter animated gifs
|
6 years ago |
Aaron Parecki
|
e1600cc5bc
|
real fix for quotation-of
|
6 years ago |
Aaron Parecki
|
c4da480866
|
quotation-of should always be a single value
|
6 years ago |
Aaron Parecki
|
112b75b623
|
parse quotation-of from HTML as well
closes #73
|
6 years ago |
Aaron Parecki
|
01cce9b823
|
sends an Accept header when fetching posts
|
6 years ago |
Aaron Parecki
|
e2780ba0a0
|
when interpreting JSON, don't require `value` for html values
|
6 years ago |
Aaron Parecki
|
417cc1b3cc
|
parse redirect uri for h-app
parse from both link tags and the u-redirect-uri property
|
6 years ago |
Aaron Parecki
|
fc74da5be9
|
add parser for instagram user feeds
|
6 years ago |
Aaron Parecki
|
2d19db0308
|
include instagram bio in h-card
|
6 years ago |
Aaron Parecki
|
921d5262ea
|
also parse instagram profile URLs
|
6 years ago |
Aaron Parecki
|
6f39655c8a
|
parse instagram user info from HTML instead of secret JSON API
adds script to refresh the downloaded instagram data for the tests as well
|
6 years ago |
Aaron Parecki
|
c70b29479a
|
updates for instagram parsing
instagram seems to have rolled out the `graphql` key everywhere now
|
6 years ago |
Aaron Parecki
|
85c3a17934
|
whitespace cleanup
|
6 years ago |
Aaron Parecki
|
4959ec15f2
|
remove duplicate url values
|
6 years ago |
Aaron Parecki
|
8026279cba
|
fix tests for new mf2 parser
main difference is the deprecated rel handling
|
6 years ago |
Aaron Parecki
|
a50cd6284b
|
fix whitespace handling for br tags in html
|
6 years ago |
Aaron Parecki
|
c27f228314
|
include in-reply-to URL for tweets
|
6 years ago |
Aaron Parecki
|
c68c7661c8
|
inspect content to determine if a page is atom or rss
closes #62
|
6 years ago |
Aaron Parecki
|
cb1e32278d
|
convert newlines to <br> for html in tweets
|
6 years ago |
Aaron Parecki
|
bf4bc3a668
|
extract photos and videos from streaming tweets when truncated
|
6 years ago |
Aaron Parecki
|
fb2fcec9c6
|
include HTML for tweets with links or user mentions
also expands parsing to be able to handle twitter JSON from the streaming API which is subtly different from the HTTP API.
closes #61
|
6 years ago |
Aaron Parecki
|
b995a1d3ee
|
whitespace
|
6 years ago |
Aaron Parecki
|
452accf6bf
|
include `quotation-of` property for quoted tweets
|
6 years ago |
Aaron Parecki
|
0decb9dcb4
|
return error info after finding feeds if available
|
6 years ago |
Aaron Parecki
|
c67dd9088d
|
bugfix
|
6 years ago |
Aaron Parecki
|
aba067234c
|
add h-x-app vocabulary
closes #13
|
6 years ago |
Aaron Parecki
|
171ca175f2
|
adds an option to process a parsed mf2 page
|
6 years ago |
Aaron Parecki
|
2515f618c7
|
include featured image for h-entry
closes #51
|
7 years ago |
Aaron Parecki
|
c376833f4c
|
fix for recipe parsing
|
7 years ago |
Aaron Parecki
|
4d65b1ca1e
|
if removing the img results in empty content, put the name value back
closes #57
|
7 years ago |
Aaron Parecki
|
150683e1a7
|
fix error when mailto links are encountered
|
7 years ago |
Aaron Parecki
|
66adfbe2f8
|
run name/content dedupe before munging HTML
fix for #53
|
7 years ago |
Aaron Parecki
|
bdedef6e1e
|
adds a bunch of broken tests for #52
|
7 years ago |
Aaron Parecki
|
b686349ded
|
remove duplicate code
use parseHTMLValue function for event description
|
7 years ago |
Aaron Parecki
|
d67e76f4b5
|
more strict match for XKCD comics
allows the XKCD XML feeds to be parsed with the feed parser
|
7 years ago |
Aaron Parecki
|
66f4a8b007
|
check for "type" property on alternates
|
7 years ago |
Aaron Parecki
|
70e9f60c42
|
update jsonfeed detection
|
7 years ago |
Aaron Parecki
|
a9b1001e62
|
switch to fork of picofeed with authorUrl support
* adds test of instagram-atom feed with individual authors per item
* dedupes atom/rss title if it's a prefix of the content
|
7 years ago |
Aaron Parecki
|
7872429f0c
|
prioritize url on the same domain
if an item has multiple URL values, return the one that is on the same domain
|
7 years ago |
Aaron Parecki
|
65d36a74de
|
always return arrays for photo and audio from XML feeds
|
7 years ago |
Aaron Parecki
|
12f27517f4
|
assume text/xml is an RSS feed
|
7 years ago |
Aaron Parecki
|
206e27ea25
|
add feed discovery API
|
7 years ago |
Aaron Parecki
|
85b8a35212
|
normalize URLs when comparing
Treats `https://example.com` and `https://example.com/` as equivalent when comparing URLs. Closes #33
|
7 years ago |
Aaron Parecki
|
15743d411d
|
Find author when author is a property of the h-feed
closes #32
|
7 years ago |
Aaron Parecki
|
05f7d9c86c
|
implement h-feed and other microformats feed parsing
|
7 years ago |
Aaron Parecki
|
7b16371418
|
add basic support for JSONFeed
|
7 years ago |