Aaron Parecki
6733145b47
pass through http response code
8 years ago
Aaron Parecki
f8e9a87667
parse github issues and comments
closes #20
8 years ago
Aaron Parecki
94bd1acd75
set user agent header to get around some user agent sniffing
8 years ago
Aaron Parecki
ee7fa97654
skip parsing xkcd home page
8 years ago
Aaron Parecki
63ab3031a3
parse XKCD comics
skip image alt text for now
closes #34
8 years ago
Aaron Parecki
7d781c3129
match `http-equiv=Status` in addition to `status`
9 years ago
Aaron Parecki
7ef9d2c936
check for http-equiv for deleted posts when target URL provided
9 years ago
Aaron Parecki
de060f110f
return url and code with no link when target URL provided
9 years ago
Aaron Parecki
227311faa9
check for meta-equiv HTTP deleted
closes #16
9 years ago
Aaron Parecki
2d90d5fddd
move twitter logic to its own function
9 years ago
Aaron Parecki
bc74919ade
return status code and final URL in response
* closes #14
* updated readme with details of the response
* includes `url` and `code` in the response with the final URL after following redirects and the HTTP status code returned
9 years ago
Aaron Parecki
755fe8c222
fix positive timezones and case-insensitive username check
9 years ago
Aaron Parecki
0beac036b9
add twitter support
closes #18
9 years ago
Aaron Parecki
773252559d
parse instagram photos and videos
9 years ago
Aaron Parecki
3bdafad98e
parse URLs with fragment IDs
If the input URL contains a fragment, finds the DOM tree at that ID and runs the subtree through the mf2 parser.
closes #15
9 years ago
Aaron Parecki
c59ab9a2d6
also check img/video/audio for target URL
9 years ago
Aaron Parecki
565d50b862
add token fetching and authentication for posts
9 years ago
Aaron Parecki
9006718ca5
add config option to enable caching
9 years ago
Aaron Parecki
bf6de4de06
only return HTTP 400 on client errors
errors fetching the URL should not result in a 400 response
9 years ago
Aaron Parecki
6de9be2567
parse h-event
closes #9
9 years ago
Aaron Parecki
76c45a9e19
only use memcache if class exists
9 years ago
Aaron Parecki
262d9a66fa
cache the HTTP response
9 years ago
Aaron Parecki
9054b0947c
specific error when there is no content at the URL
9 years ago
Aaron Parecki
8dc0caa4d0
use effective URL after following redirects when comparing URLs
9 years ago
Aaron Parecki
c4b80506da
support parsing posted HTML
9 years ago
Aaron Parecki
8d1489bb72
fix for target param. include bookmark-of property
9 years ago
Aaron Parecki
814df605b6
add query string param to pretty-print JSON output
9 years ago
Aaron Parecki
ac6d86c0db
includes nested h-cite and other objects
if a property such as `in-reply-to` is an h-cite, the URL is still returned as the `in-reply-to` value, and the h-cite object is available in a different part of the response.
closes #6
9 years ago
Aaron Parecki
097e999768
return type=unknown instead of error=no_content
9 years ago
Aaron Parecki
82931e46bc
switch to using file_get_contents for appengine
9 years ago
Aaron Parecki
7075254d56
add / to URL if it doesn't have a path
9 years ago
Aaron Parecki
7b54172b87
add slash if no path is given
9 years ago
Aaron Parecki
e9bc4bf450
rename to X-Ray
9 years ago
Aaron Parecki
0b35b74636
implement authorship discovery
* extracts mf2 post contents from pages
* implements authorship discovery to find author info for the URL
9 years ago
Aaron Parecki
9eecc31571
parse content and name from the entry
10 years ago
Aaron Parecki
13bb06d2c9
stub mf2 parsing
10 years ago
Aaron Parecki
85c3ce7b33
starting the parse function, with tests
10 years ago