Aaron Parecki
|
ca9c8c02ef
|
AS: parse likes and reposts
|
il y a 7 ans |
Aaron Parecki
|
85d973916f
|
support articles and summary
|
il y a 7 ans |
Aaron Parecki
|
d3e36038b2
|
parse basic ActivityStreams objects
including from rel=alternate
|
il y a 7 ans |
Aaron Parecki
|
154b7e874a
|
check for a rel=alternate to existing parsed mf2 JSON and use that instead
|
il y a 7 ans |
Aaron Parecki
|
70f1576926
|
support twitter animated gifs
|
il y a 7 ans |
Aaron Parecki
|
112b75b623
|
parse quotation-of from HTML as well
closes #73
|
il y a 7 ans |
Aaron Parecki
|
417cc1b3cc
|
parse redirect uri for h-app
parse from both link tags and the u-redirect-uri property
|
il y a 8 ans |
Aaron Parecki
|
6f39655c8a
|
parse instagram user info from HTML instead of secret JSON API
adds script to refresh the downloaded instagram data for the tests as well
|
il y a 8 ans |
Aaron Parecki
|
c70b29479a
|
updates for instagram parsing
instagram seems to have rolled out the `graphql` key everywhere now
|
il y a 8 ans |
Aaron Parecki
|
25b6f85c14
|
use html5 parser and update php-mf2
|
il y a 8 ans |
Aaron Parecki
|
4959ec15f2
|
remove duplicate url values
|
il y a 8 ans |
Aaron Parecki
|
8026279cba
|
fix tests for new mf2 parser
main difference is the deprecated rel handling
|
il y a 8 ans |
Aaron Parecki
|
a50cd6284b
|
fix whitespace handling for br tags in html
|
il y a 8 ans |
Aaron Parecki
|
c27f228314
|
include in-reply-to URL for tweets
|
il y a 8 ans |
Aaron Parecki
|
c68c7661c8
|
inspect content to determine if a page is atom or rss
closes #62
|
il y a 8 ans |
Aaron Parecki
|
cb1e32278d
|
convert newlines to <br> for html in tweets
|
il y a 8 ans |
Aaron Parecki
|
bf4bc3a668
|
extract photos and videos from streaming tweets when truncated
|
il y a 8 ans |
Aaron Parecki
|
fb2fcec9c6
|
include HTML for tweets with links or user mentions
also expands parsing to be able to handle twitter JSON from the streaming API which is subtly different from the HTTP API.
closes #61
|
il y a 8 ans |
Aaron Parecki
|
584f34e1ed
|
add test from ascraeus.org which was causing an INTL error
|
il y a 8 ans |
Aaron Parecki
|
2cc215d370
|
add .editorconfig to data folder
tells the editor to save data files with crlf needed for parsing the test http responses
|
il y a 8 ans |
Aaron Parecki
|
aba067234c
|
add h-x-app vocabulary
closes #13
|
il y a 8 ans |
Aaron Parecki
|
fe65def90f
|
comment out two tests until open mf2 parser issues are resolved
|
il y a 8 ans |
Aaron Parecki
|
2515f618c7
|
include featured image for h-entry
closes #51
|
il y a 8 ans |
Aaron Parecki
|
4d65b1ca1e
|
if removing the img results in empty content, put the name value back
closes #57
|
il y a 8 ans |
Aaron Parecki
|
3ac38f9dbf
|
add simple case of Known markup
for #57
|
il y a 8 ans |
Aaron Parecki
|
85c2b9b15f
|
add failing test for `p-content` containing an `u-photo`
|
il y a 8 ans |
Aaron Parecki
|
44770396f9
|
add test to ensure a content property is not returned unless it is defined
|
il y a 8 ans |
Aaron Parecki
|
bdedef6e1e
|
adds a bunch of broken tests for #52
|
il y a 8 ans |
Aaron Parecki
|
a9b1001e62
|
switch to fork of picofeed with authorUrl support
* adds test of instagram-atom feed with individual authors per item
* dedupes atom/rss title if it's a prefix of the content
|
il y a 8 ans |
Aaron Parecki
|
7872429f0c
|
prioritize url on the same domain
if an item has multiple URL values, return the one that is on the same domain
|
il y a 8 ans |
Aaron Parecki
|
206e27ea25
|
add feed discovery API
|
il y a 8 ans |
Aaron Parecki
|
85b8a35212
|
normalize URLs when comparing
Treats `https://example.com` and `https://example.com/` as equivalent when comparing URLs. Closes #33
|
il y a 8 ans |
Aaron Parecki
|
15743d411d
|
Find author when author is a property of the h-feed
closes #32
|
il y a 8 ans |
Aaron Parecki
|
05f7d9c86c
|
implement h-feed and other microformats feed parsing
|
il y a 8 ans |
Aaron Parecki
|
7b16371418
|
add basic support for JSONFeed
|
il y a 8 ans |
Aaron Parecki
|
e8e63caba6
|
implements parsing Atom and RSS feeds
|
il y a 8 ans |
Aaron Parecki
|
a37ed3bbae
|
update to support multiple photos
uses the video's poster frame as the photo if any of the multi-post images are videos
|
il y a 8 ans |
sebsel
|
6b286157e3
|
based tests on TwitterTest.php
|
il y a 8 ans |
sebsel
|
67c159ec29
|
added tests
|
il y a 8 ans |
Aaron Parecki
|
d50231142a
|
adds support for parsing checkins
checkin data is returned embedded like author data rather than in the `refs` object
closes #35
|
il y a 8 ans |
Aaron Parecki
|
4fab3e9e0a
|
add test for HN comment
|
il y a 8 ans |
Aaron Parecki
|
d0de523746
|
add hackernews support
closes #40
|
il y a 8 ans |
Aaron Parecki
|
330bc9024d
|
fix parsing for hReview
thanks to the new backcompat in php-mf2 0.3.2
|
il y a 8 ans |
Aaron Parecki
|
b76d72a77b
|
return issue labels as category
|
il y a 9 ans |
Aaron Parecki
|
f8e9a87667
|
parse github issues and comments
closes #20
|
il y a 9 ans |
Aaron Parecki
|
5f63ed7944
|
updates for instagram scraping
|
il y a 9 ans |
Aaron Parecki
|
63ab3031a3
|
parse XKCD comics
skip image alt text for now
closes #34
|
il y a 9 ans |
Aaron Parecki
|
5f5392a7b8
|
deduplicate categories, and strip leading hashtags
|
il y a 9 ans |
Aaron Parecki
|
a1234f61e3
|
recognize h-card if it's the only object
closes #36
|
il y a 9 ans |
Aaron Parecki
|
4a4bc73f5e
|
don't include the RT'd photo or video in the main entry
they are part of the reposted object instead.
closes #27
|
il y a 9 ans |