Aaron Parecki
85b8a35212
normalize URLs when comparing
Treats `https://example.com ` and `https://example.com/ ` as equivalent when comparing URLs. Closes #33
7年前
Aaron Parecki
15743d411d
Find author when author is a property of the h-feed
closes #32
7年前
Aaron Parecki
05f7d9c86c
implement h-feed and other microformats feed parsing
7年前
Aaron Parecki
7b16371418
add basic support for JSONFeed
7年前
Aaron Parecki
e8e63caba6
implements parsing Atom and RSS feeds
7年前
Aaron Parecki
a37ed3bbae
update to support multiple photos
uses the video's poster frame as the photo if any of the multi-post images are videos
7年前
sebsel
6b286157e3
based tests on TwitterTest.php
8年前
sebsel
67c159ec29
added tests
8年前
Aaron Parecki
d50231142a
adds support for parsing checkins
checkin data is returned embedded like author data rather than in the `refs` object
closes #35
8年前
Aaron Parecki
4fab3e9e0a
add test for HN comment
8年前
Aaron Parecki
d0de523746
add hackernews support
closes #40
8年前
Aaron Parecki
330bc9024d
fix parsing for hReview
thanks to the new backcompat in php-mf2 0.3.2
8年前
Aaron Parecki
b76d72a77b
return issue labels as category
8年前
Aaron Parecki
f8e9a87667
parse github issues and comments
closes #20
8年前
Aaron Parecki
5f63ed7944
updates for instagram scraping
8年前
Aaron Parecki
63ab3031a3
parse XKCD comics
skip image alt text for now
closes #34
8年前
Aaron Parecki
5f5392a7b8
deduplicate categories, and strip leading hashtags
8年前
Aaron Parecki
a1234f61e3
recognize h-card if it's the only object
closes #36
8年前
Aaron Parecki
4a4bc73f5e
don't include the RT'd photo or video in the main entry
they are part of the reposted object instead.
closes #27
8年前
Aaron Parecki
5e60e13b5a
add h-recipe
closes #24
8年前
Aaron Parecki
5d8fb4e13c
support h-review and h-product vocab
* closes #23
* major refactor of the methods for extracting properties to consolidate the logic
* hReview parsing is incomplete due to issues with the php-mf2 backcompat parsing. see https://github.com/indieweb/php-mf2/issues/107
8年前
Aaron Parecki
88a2c7f5bf
add test for syndication property
8年前
Aaron Parecki
7d781c3129
match `http-equiv=Status` in addition to `status`
8年前
Aaron Parecki
227311faa9
check for meta-equiv HTTP deleted
closes #16
8年前
Aaron Parecki
bc74919ade
return status code and final URL in response
* closes #14
* updated readme with details of the response
* includes `url` and `code` in the response with the final URL after following redirects and the HTTP status code returned
8年前
Aaron Parecki
876d4696fb
catch non-expanded profile URLs
apparently some people's profile URLs don't get t.co'd
8年前
Aaron Parecki
041cc92a8b
add test file
:headdesk:
8年前
Aaron Parecki
0beac036b9
add twitter support
closes #18
8年前
Aaron Parecki
773252559d
parse instagram photos and videos
8年前
Aaron Parecki
3bdafad98e
parse URLs with fragment IDs
If the input URL contains a fragment, finds the DOM tree at that ID and runs the subtree through the mf2 parser.
closes #15
8年前
Aaron Parecki
c59ab9a2d6
also check img/video/audio for target URL
8年前
Aaron Parecki
565d50b862
add token fetching and authentication for posts
8年前
Aaron Parecki
62697ee46b
strict type checking on properties
9年前
Aaron Parecki
1f6de10aba
add tests for validating URL fields
* fields that should be URLs will now be omitted if the value was not a URL, such as when the value is `javascript:alert()`
* makes Mf2 class slightly more self-contained by duplicating the URL helper functions into it
* fixes tests to not cache responses in memcache
9年前
Aaron Parecki
3edc01d1b7
add test for invitee
9年前
Aaron Parecki
6de9be2567
parse h-event
closes #9
9年前
Aaron Parecki
ee5e48e1ef
if there is exactly one item and it's an h-entry, use that
9年前
Aaron Parecki
2d52b982cb
fix test data
9年前
Aaron Parecki
8dc0caa4d0
use effective URL after following redirects when comparing URLs
9年前
Aaron Parecki
162d2f5ef8
add tests for feeds, catch case when a permalink has other h-entrys
9年前
Aaron Parecki
d1c6dc9268
add a test for rsvp
9年前
Aaron Parecki
4cb548fdfc
add test files illustrating a few different ways feeds (lists of posts) can appear
9年前
Aaron Parecki
075f78a6c1
parse h-entry even if it's not the first objet
9年前
Aaron Parecki
a202aa9c9a
tests for sanitizing and escaping HTML
use fork of php-mf2 until https://github.com/indieweb/php-mf2/pull/83 is merged
9年前
Aaron Parecki
d7672df96c
allow ul/li/ol
9年前
Aaron Parecki
e3ff109b37
restrict matching mf2 classes to only lowercase names
see http://microformats.org/wiki/microformats2-parsing-issues#ignore_u-camelCase_properties for background
9年前
Aaron Parecki
66a9b1cc9e
sanitize HTML in the entry
allow only a basic set of tags, and remove any non-mf2 classes
closes #2
9年前
Aaron Parecki
241594dcf5
sanitize HTML
sanitize the HTML returned in the content property. allows a common set of HTML tags.
for #2
9年前
Aaron Parecki
ac6d86c0db
includes nested h-cite and other objects
if a property such as `in-reply-to` is an h-cite, the URL is still returned as the `in-reply-to` value, and the h-cite object is available in a different part of the response.
closes #6
9年前
Aaron Parecki
097e999768
return type=unknown instead of error=no_content
9年前