Aaron Parecki
2a3d7b4923
add in-reply-to for issue comments
8 years ago
Aaron Parecki
b76d72a77b
return issue labels as category
8 years ago
Aaron Parecki
fc41098477
fix
8 years ago
Aaron Parecki
6733145b47
pass through http response code
8 years ago
Aaron Parecki
19908117eb
set user-agent header for github requests
8 years ago
Aaron Parecki
f8e9a87667
parse github issues and comments
closes #20
8 years ago
Aaron Parecki
5f63ed7944
updates for instagram scraping
8 years ago
Aaron Parecki
ee7fa97654
skip parsing xkcd home page
8 years ago
Aaron Parecki
63ab3031a3
parse XKCD comics
skip image alt text for now
closes #34
8 years ago
Aaron Parecki
5f5392a7b8
deduplicate categories, and strip leading hashtags
8 years ago
Aaron Parecki
a1234f61e3
recognize h-card if it's the only object
closes #36
8 years ago
Aaron Parecki
c255df7421
add swarm-coins to h-entry
8 years ago
Aaron Parecki
4a4bc73f5e
don't include the RT'd photo or video in the main entry
they are part of the reposted object instead.
closes #27
9 years ago
Aaron Parecki
345bed6075
fix for #26
9 years ago
Aaron Parecki
5e60e13b5a
add h-recipe
closes #24
9 years ago
Aaron Parecki
5d8fb4e13c
support h-review and h-product vocab
* closes #23
* major refactor of the methods for extracting properties to consolidate the logic
* hReview parsing is incomplete due to issues with the php-mf2 backcompat parsing. see https://github.com/indieweb/php-mf2/issues/107
9 years ago
Aaron Parecki
bc74919ade
return status code and final URL in response
* closes #14
* updated readme with details of the response
* includes `url` and `code` in the response with the final URL after following redirects and the HTTP status code returned
9 years ago
Aaron Parecki
693cb9d636
use p3k\timezone library
9 years ago
Aaron Parecki
4a08c1fd2f
package for releasing to shared servers
* adds .htaccess files for apache
* adds installation instructions and nginx example
* prevent source folders from being viewed with .htaccess
* adds `index.php` in root folder in case you deploy this whole thing to a subfolder
* updates to work when installed in a subfolder
9 years ago
Aaron Parecki
876d4696fb
catch non-expanded profile URLs
apparently some people's profile URLs don't get t.co'd
9 years ago
Aaron Parecki
755fe8c222
fix positive timezones and case-insensitive username check
9 years ago
Aaron Parecki
ebea6869e1
set UTF-8 for mb_substr
attempting to fix in php 5.5
9 years ago
Aaron Parecki
0beac036b9
add twitter support
closes #18
9 years ago
Aaron Parecki
db8dba9f23
include published date for Instagram photos
if the photo has a location, the timezone is set on the published date
9 years ago
Aaron Parecki
773252559d
parse instagram photos and videos
9 years ago
Aaron Parecki
2f9f80c4e6
remove unused function
9 years ago
Aaron Parecki
3bdafad98e
parse URLs with fragment IDs
If the input URL contains a fragment, finds the DOM tree at that ID and runs the subtree through the mf2 parser.
closes #15
9 years ago
Aaron Parecki
1a1215c0be
attempt to catch fatal errors and print a nice message
9 years ago
Aaron Parecki
a7780fb671
set connect timeout
9 years ago
Aaron Parecki
565d50b862
add token fetching and authentication for posts
9 years ago
Aaron Parecki
62697ee46b
strict type checking on properties
9 years ago
Aaron Parecki
1f6de10aba
add tests for validating URL fields
* fields that should be URLs will now be omitted if the value was not a URL, such as when the value is `javascript:alert()`
* makes Mf2 class slightly more self-contained by duplicating the URL helper functions into it
* fixes tests to not cache responses in memcache
9 years ago
Aaron Parecki
5672004535
remove url param since it was not used
9 years ago
Aaron Parecki
4a82561536
fix for h-event parsing
9 years ago
Aaron Parecki
1aa2f01d94
convert hostnames to lowercase
9 years ago
Aaron Parecki
138cddd158
also return audio property
9 years ago
Aaron Parecki
6de9be2567
parse h-event
closes #9
9 years ago
Aaron Parecki
ee5e48e1ef
if there is exactly one item and it's an h-entry, use that
9 years ago
Aaron Parecki
9054b0947c
specific error when there is no content at the URL
9 years ago
Aaron Parecki
1924d1000e
add log messages to debug which case a URL is hitting
9 years ago
Aaron Parecki
b7f49a7958
fix should follow redirects check
10 years ago
Aaron Parecki
8dc0caa4d0
use effective URL after following redirects when comparing URLs
10 years ago
Aaron Parecki
162d2f5ef8
add tests for feeds, catch case when a permalink has other h-entrys
10 years ago
Aaron Parecki
e3000f8c06
better blacklist for google URLs
10 years ago
Aaron Parecki
c4b80506da
support parsing posted HTML
10 years ago
Aaron Parecki
8d1489bb72
fix for target param. include bookmark-of property
10 years ago
Aaron Parecki
075f78a6c1
parse h-entry even if it's not the first objet
10 years ago
Aaron Parecki
d7672df96c
allow ul/li/ol
10 years ago
Aaron Parecki
e3ff109b37
restrict matching mf2 classes to only lowercase names
see http://microformats.org/wiki/microformats2-parsing-issues#ignore_u-camelCase_properties for background
10 years ago
Aaron Parecki
66a9b1cc9e
sanitize HTML in the entry
allow only a basic set of tags, and remove any non-mf2 classes
closes #2
10 years ago