158 Commits (73f7b5c755060c7c4f43b8770fb767b618e7cd7b)

Author SHA1 Message Date
  Aaron Parecki 19908117eb
set user-agent header for github requests 7 years ago
  Aaron Parecki f8e9a87667
parse github issues and comments 7 years ago
  Aaron Parecki 5f63ed7944
updates for instagram scraping 7 years ago
  Aaron Parecki ee7fa97654
skip parsing xkcd home page 7 years ago
  Aaron Parecki 63ab3031a3
parse XKCD comics 7 years ago
  Aaron Parecki 5f5392a7b8
deduplicate categories, and strip leading hashtags 7 years ago
  Aaron Parecki a1234f61e3
recognize h-card if it's the only object 7 years ago
  Aaron Parecki c255df7421
add swarm-coins to h-entry 7 years ago
  Aaron Parecki 4a4bc73f5e
don't include the RT'd photo or video in the main entry 8 years ago
  Aaron Parecki 345bed6075
fix for #26 8 years ago
  Aaron Parecki 5e60e13b5a
add h-recipe 8 years ago
  Aaron Parecki 5d8fb4e13c
support h-review and h-product vocab 8 years ago
  Aaron Parecki bc74919ade
return status code and final URL in response 8 years ago
  Aaron Parecki 693cb9d636
use p3k\timezone library 8 years ago
  Aaron Parecki 4a08c1fd2f
package for releasing to shared servers 8 years ago
  Aaron Parecki 876d4696fb catch non-expanded profile URLs 8 years ago
  Aaron Parecki 755fe8c222 fix positive timezones and case-insensitive username check 8 years ago
  Aaron Parecki ebea6869e1 set UTF-8 for mb_substr 8 years ago
  Aaron Parecki 0beac036b9 add twitter support 8 years ago
  Aaron Parecki db8dba9f23 include published date for Instagram photos 8 years ago
  Aaron Parecki 773252559d parse instagram photos and videos 8 years ago
  Aaron Parecki 2f9f80c4e6 remove unused function 8 years ago
  Aaron Parecki 3bdafad98e
parse URLs with fragment IDs 8 years ago
  Aaron Parecki 1a1215c0be
attempt to catch fatal errors and print a nice message 8 years ago
  Aaron Parecki a7780fb671
set connect timeout 8 years ago
  Aaron Parecki 565d50b862
add token fetching and authentication for posts 8 years ago
  Aaron Parecki 62697ee46b strict type checking on properties 8 years ago
  Aaron Parecki 1f6de10aba add tests for validating URL fields 8 years ago
  Aaron Parecki 5672004535 remove url param since it was not used 8 years ago
  Aaron Parecki 4a82561536 fix for h-event parsing 8 years ago
  Aaron Parecki 1aa2f01d94 convert hostnames to lowercase 8 years ago
  Aaron Parecki 138cddd158 also return audio property 8 years ago
  Aaron Parecki 6de9be2567 parse h-event 8 years ago
  Aaron Parecki ee5e48e1ef if there is exactly one item and it's an h-entry, use that 8 years ago
  Aaron Parecki 9054b0947c specific error when there is no content at the URL 8 years ago
  Aaron Parecki 1924d1000e add log messages to debug which case a URL is hitting 8 years ago
  Aaron Parecki b7f49a7958 fix should follow redirects check 8 years ago
  Aaron Parecki 8dc0caa4d0 use effective URL after following redirects when comparing URLs 8 years ago
  Aaron Parecki 162d2f5ef8 add tests for feeds, catch case when a permalink has other h-entrys 8 years ago
  Aaron Parecki e3000f8c06 better blacklist for google URLs 8 years ago
  Aaron Parecki c4b80506da support parsing posted HTML 8 years ago
  Aaron Parecki 8d1489bb72 fix for target param. include bookmark-of property 8 years ago
  Aaron Parecki 075f78a6c1 parse h-entry even if it's not the first objet 8 years ago
  Aaron Parecki d7672df96c allow ul/li/ol 8 years ago
  Aaron Parecki e3ff109b37 restrict matching mf2 classes to only lowercase names 8 years ago
  Aaron Parecki 66a9b1cc9e sanitize HTML in the entry 8 years ago
  Aaron Parecki 241594dcf5 sanitize HTML 8 years ago
  Aaron Parecki b9c9a6bddd fix for author parsing 8 years ago
  Aaron Parecki ac6d86c0db includes nested h-cite and other objects 8 years ago
  Aaron Parecki ed88b4881b use file_get_contents only for appengine URLs 8 years ago