50 Commits (19908117eb310147261f09f4eb71185b435570e3)

Author SHA1 Message Date
  Aaron Parecki 19908117eb
set user-agent header for github requests 5 years ago
  Aaron Parecki f8e9a87667
parse github issues and comments 5 years ago
  Aaron Parecki 5f63ed7944
updates for instagram scraping 5 years ago
  Aaron Parecki ee7fa97654
skip parsing xkcd home page 5 years ago
  Aaron Parecki 63ab3031a3
parse XKCD comics 5 years ago
  Aaron Parecki 5f5392a7b8
deduplicate categories, and strip leading hashtags 5 years ago
  Aaron Parecki a1234f61e3
recognize h-card if it's the only object 5 years ago
  Aaron Parecki c255df7421
add swarm-coins to h-entry 5 years ago
  Aaron Parecki 4a4bc73f5e
don't include the RT'd photo or video in the main entry 5 years ago
  Aaron Parecki 345bed6075
fix for #26 5 years ago
  Aaron Parecki 5e60e13b5a
add h-recipe 5 years ago
  Aaron Parecki 5d8fb4e13c
support h-review and h-product vocab 5 years ago
  Aaron Parecki bc74919ade
return status code and final URL in response 5 years ago
  Aaron Parecki 876d4696fb catch non-expanded profile URLs 5 years ago
  Aaron Parecki 755fe8c222 fix positive timezones and case-insensitive username check 5 years ago
  Aaron Parecki ebea6869e1 set UTF-8 for mb_substr 5 years ago
  Aaron Parecki 0beac036b9 add twitter support 5 years ago
  Aaron Parecki db8dba9f23 include published date for Instagram photos 5 years ago
  Aaron Parecki 773252559d parse instagram photos and videos 5 years ago
  Aaron Parecki 2f9f80c4e6 remove unused function 5 years ago
  Aaron Parecki 62697ee46b strict type checking on properties 6 years ago
  Aaron Parecki 1f6de10aba add tests for validating URL fields 6 years ago
  Aaron Parecki 5672004535 remove url param since it was not used 6 years ago
  Aaron Parecki 4a82561536 fix for h-event parsing 6 years ago
  Aaron Parecki 138cddd158 also return audio property 6 years ago
  Aaron Parecki 6de9be2567 parse h-event 6 years ago
  Aaron Parecki ee5e48e1ef if there is exactly one item and it's an h-entry, use that 6 years ago
  Aaron Parecki 9054b0947c specific error when there is no content at the URL 6 years ago
  Aaron Parecki 1924d1000e add log messages to debug which case a URL is hitting 6 years ago
  Aaron Parecki 162d2f5ef8 add tests for feeds, catch case when a permalink has other h-entrys 6 years ago
  Aaron Parecki c4b80506da support parsing posted HTML 6 years ago
  Aaron Parecki 8d1489bb72 fix for target param. include bookmark-of property 6 years ago
  Aaron Parecki 075f78a6c1 parse h-entry even if it's not the first objet 6 years ago
  Aaron Parecki d7672df96c allow ul/li/ol 6 years ago
  Aaron Parecki e3ff109b37 restrict matching mf2 classes to only lowercase names 6 years ago
  Aaron Parecki 66a9b1cc9e sanitize HTML in the entry 6 years ago
  Aaron Parecki 241594dcf5 sanitize HTML 6 years ago
  Aaron Parecki b9c9a6bddd fix for author parsing 6 years ago
  Aaron Parecki ac6d86c0db includes nested h-cite and other objects 6 years ago
  Aaron Parecki 2924f35e0d fix tests for new HTTPStream 6 years ago
  Aaron Parecki 82931e46bc switch to using file_get_contents for appengine 6 years ago
  Aaron Parecki 7fafb51e92 add todo note for feeds 6 years ago
  Aaron Parecki 7075254d56 add / to URL if it doesn't have a path 6 years ago
  Aaron Parecki 0d96cb2832 also return matching url for h-cards 6 years ago
  Aaron Parecki fff43444f5 also return categories 6 years ago
  Aaron Parecki 69223cad1d return matching author url 6 years ago
  Aaron Parecki e9bc4bf450 rename to X-Ray 6 years ago
  Aaron Parecki 0b35b74636 implement authorship discovery 6 years ago
  Aaron Parecki 9eecc31571 parse content and name from the entry 6 years ago
  Aaron Parecki 13bb06d2c9 stub mf2 parsing 6 years ago