65 Commits (19908117eb310147261f09f4eb71185b435570e3)

Author SHA1 Message Date
  Aaron Parecki 19908117eb
set user-agent header for github requests 4 years ago
  Aaron Parecki f8e9a87667
parse github issues and comments 4 years ago
  Aaron Parecki 5f63ed7944
updates for instagram scraping 4 years ago
  Aaron Parecki ee7fa97654
skip parsing xkcd home page 4 years ago
  Aaron Parecki 63ab3031a3
parse XKCD comics 4 years ago
  Aaron Parecki 5f5392a7b8
deduplicate categories, and strip leading hashtags 4 years ago
  Aaron Parecki a1234f61e3
recognize h-card if it's the only object 4 years ago
  Aaron Parecki c255df7421
add swarm-coins to h-entry 4 years ago
  Aaron Parecki 4a4bc73f5e
don't include the RT'd photo or video in the main entry 5 years ago
  Aaron Parecki 345bed6075
fix for #26 5 years ago
  Aaron Parecki 5e60e13b5a
add h-recipe 5 years ago
  Aaron Parecki 5d8fb4e13c
support h-review and h-product vocab 5 years ago
  Aaron Parecki bc74919ade
return status code and final URL in response 5 years ago
  Aaron Parecki 693cb9d636
use p3k\timezone library 5 years ago
  Aaron Parecki 4a08c1fd2f
package for releasing to shared servers 5 years ago
  Aaron Parecki 876d4696fb catch non-expanded profile URLs 5 years ago
  Aaron Parecki 755fe8c222 fix positive timezones and case-insensitive username check 5 years ago
  Aaron Parecki ebea6869e1 set UTF-8 for mb_substr 5 years ago
  Aaron Parecki 0beac036b9 add twitter support 5 years ago
  Aaron Parecki db8dba9f23 include published date for Instagram photos 5 years ago
  Aaron Parecki 773252559d parse instagram photos and videos 5 years ago
  Aaron Parecki 2f9f80c4e6 remove unused function 5 years ago
  Aaron Parecki 3bdafad98e
parse URLs with fragment IDs 5 years ago
  Aaron Parecki 1a1215c0be
attempt to catch fatal errors and print a nice message 5 years ago
  Aaron Parecki a7780fb671
set connect timeout 5 years ago
  Aaron Parecki 565d50b862
add token fetching and authentication for posts 5 years ago
  Aaron Parecki 62697ee46b strict type checking on properties 5 years ago
  Aaron Parecki 1f6de10aba add tests for validating URL fields 5 years ago
  Aaron Parecki 5672004535 remove url param since it was not used 5 years ago
  Aaron Parecki 4a82561536 fix for h-event parsing 5 years ago
  Aaron Parecki 1aa2f01d94 convert hostnames to lowercase 5 years ago
  Aaron Parecki 138cddd158 also return audio property 5 years ago
  Aaron Parecki 6de9be2567 parse h-event 5 years ago
  Aaron Parecki ee5e48e1ef if there is exactly one item and it's an h-entry, use that 5 years ago
  Aaron Parecki 9054b0947c specific error when there is no content at the URL 5 years ago
  Aaron Parecki 1924d1000e add log messages to debug which case a URL is hitting 5 years ago
  Aaron Parecki b7f49a7958 fix should follow redirects check 5 years ago
  Aaron Parecki 8dc0caa4d0 use effective URL after following redirects when comparing URLs 5 years ago
  Aaron Parecki 162d2f5ef8 add tests for feeds, catch case when a permalink has other h-entrys 5 years ago
  Aaron Parecki e3000f8c06 better blacklist for google URLs 5 years ago
  Aaron Parecki c4b80506da support parsing posted HTML 5 years ago
  Aaron Parecki 8d1489bb72 fix for target param. include bookmark-of property 5 years ago
  Aaron Parecki 075f78a6c1 parse h-entry even if it's not the first objet 5 years ago
  Aaron Parecki d7672df96c allow ul/li/ol 5 years ago
  Aaron Parecki e3ff109b37 restrict matching mf2 classes to only lowercase names 5 years ago
  Aaron Parecki 66a9b1cc9e sanitize HTML in the entry 5 years ago
  Aaron Parecki 241594dcf5 sanitize HTML 5 years ago
  Aaron Parecki b9c9a6bddd fix for author parsing 5 years ago
  Aaron Parecki ac6d86c0db includes nested h-cite and other objects 5 years ago
  Aaron Parecki ed88b4881b use file_get_contents only for appengine URLs 5 years ago