Aaron Parecki
66adfbe2f8
run name/content dedupe before munging HTML
fix for #53
6 years ago
Aaron Parecki
bdedef6e1e
adds a bunch of broken tests for #52
6 years ago
Aaron Parecki
b686349ded
remove duplicate code
use parseHTMLValue function for event description
6 years ago
Aaron Parecki
d67e76f4b5
more strict match for XKCD comics
allows the XKCD XML feeds to be parsed with the feed parser
6 years ago
Aaron Parecki
66f4a8b007
check for "type" property on alternates
7 years ago
Aaron Parecki
70e9f60c42
update jsonfeed detection
7 years ago
Aaron Parecki
a9b1001e62
switch to fork of picofeed with authorUrl support
* adds test of instagram-atom feed with individual authors per item
* dedupes atom/rss title if it's a prefix of the content
7 years ago
Aaron Parecki
7872429f0c
prioritize url on the same domain
if an item has multiple URL values, return the one that is on the same domain
7 years ago
Aaron Parecki
65d36a74de
always return arrays for photo and audio from XML feeds
7 years ago
Aaron Parecki
12f27517f4
assume text/xml is an RSS feed
7 years ago
Aaron Parecki
206e27ea25
add feed discovery API
7 years ago
Aaron Parecki
85b8a35212
normalize URLs when comparing
Treats `https://example.com ` and `https://example.com/ ` as equivalent when comparing URLs. Closes #33
7 years ago
Aaron Parecki
15743d411d
Find author when author is a property of the h-feed
closes #32
7 years ago
Aaron Parecki
05f7d9c86c
implement h-feed and other microformats feed parsing
7 years ago
Aaron Parecki
7b16371418
add basic support for JSONFeed
7 years ago
Aaron Parecki
e8e63caba6
implements parsing Atom and RSS feeds
7 years ago
Aaron Parecki
a37ed3bbae
update to support multiple photos
uses the video's poster frame as the photo if any of the multi-post images are videos
7 years ago
Aaron Parecki
07d1df2cef
convert rsvp value to lowercase
according to http://microformats.org/wiki/h-entry the rsvp values should be interpreted case-insensitive
7 years ago
sebsel
6b286157e3
based tests on TwitterTest.php
7 years ago
sebsel
8c1ffe3f05
fix proper mf2 names, cast lon/lat as string, optional end, array notation
7 years ago
sebsel
b3c0813610
store location as h-card in refs
7 years ago
Aaron Parecki
a16b845af3
adds in-reply-to property for HN comments
7 years ago
Aaron Parecki
d50231142a
adds support for parsing checkins
checkin data is returned embedded like author data rather than in the `refs` object
closes #35
7 years ago
Aaron Parecki
0ccf34b656
use consistent pattern for fetching from silos
7 years ago
Aaron Parecki
4fab3e9e0a
add test for HN comment
7 years ago
Aaron Parecki
d0de523746
add hackernews support
closes #40
7 years ago
sebsel
12f0bebc8a
add Facebook events
7 years ago
Aaron Parecki
236c3f5f9c
include version info in response header
7 years ago
Aaron Parecki
330bc9024d
fix parsing for hReview
thanks to the new backcompat in php-mf2 0.3.2
7 years ago
Aaron Parecki
78e3e16592
finishes the refactor!
7 years ago
Aaron Parecki
01b53edc95
refactor Twitter parser
7 years ago
Aaron Parecki
f19b8fd7dd
refactor XKCD parsing
7 years ago
Aaron Parecki
6b65ae1b94
refactor for parsing Instagram and GitHub complete
7 years ago
Aaron Parecki
2f52eba556
rename Fetcher class, add stub Parser class
7 years ago
Aaron Parecki
4014da6dc7
moves fetching logic into a library class
7 years ago
Aaron Parecki
932cbedf45
refactor Rels class into library and controller
7 years ago
Aaron Parecki
00dbc3dae1
relicense under MIT
7 years ago
Aaron Parecki
5221cf79e9
get rid of global functions
moves XRay classes to `p3k\XRay` namespace
7 years ago
Aaron Parecki
35669270e3
reorganize XRay classes, use p3k-http lib
* removes the HTTP classes from this project and uses p3k-http library instead
* reorganizes the XRay classes into a psr-4 compatible folder
* moves controller autoload into -dev in preparation for turning this into a library (#17 )
7 years ago
Aaron Parecki
2a3d7b4923
add in-reply-to for issue comments
7 years ago
Aaron Parecki
b76d72a77b
return issue labels as category
7 years ago
Aaron Parecki
fc41098477
fix
7 years ago
Aaron Parecki
6733145b47
pass through http response code
7 years ago
Aaron Parecki
19908117eb
set user-agent header for github requests
7 years ago
Aaron Parecki
f8e9a87667
parse github issues and comments
closes #20
7 years ago
Aaron Parecki
5f63ed7944
updates for instagram scraping
7 years ago
Aaron Parecki
ee7fa97654
skip parsing xkcd home page
7 years ago
Aaron Parecki
63ab3031a3
parse XKCD comics
skip image alt text for now
closes #34
7 years ago
Aaron Parecki
5f5392a7b8
deduplicate categories, and strip leading hashtags
7 years ago
Aaron Parecki
a1234f61e3
recognize h-card if it's the only object
closes #36
7 years ago