Aaron Parecki
f19b8fd7dd
refactor XKCD parsing
7 years ago
Aaron Parecki
6b65ae1b94
refactor for parsing Instagram and GitHub complete
7 years ago
Aaron Parecki
2f52eba556
rename Fetcher class, add stub Parser class
7 years ago
Aaron Parecki
4014da6dc7
moves fetching logic into a library class
7 years ago
Aaron Parecki
932cbedf45
refactor Rels class into library and controller
7 years ago
Aaron Parecki
5221cf79e9
get rid of global functions
moves XRay classes to `p3k\XRay` namespace
7 years ago
Aaron Parecki
35669270e3
reorganize XRay classes, use p3k-http lib
* removes the HTTP classes from this project and uses p3k-http library instead
* reorganizes the XRay classes into a psr-4 compatible folder
* moves controller autoload into -dev in preparation for turning this into a library (#17 )
7 years ago
Aaron Parecki
6733145b47
pass through http response code
7 years ago
Aaron Parecki
f8e9a87667
parse github issues and comments
closes #20
7 years ago
Aaron Parecki
94bd1acd75
set user agent header to get around some user agent sniffing
7 years ago
Aaron Parecki
ee7fa97654
skip parsing xkcd home page
7 years ago
Aaron Parecki
63ab3031a3
parse XKCD comics
skip image alt text for now
closes #34
7 years ago
Aaron Parecki
2796897a61
fix arrays
7 years ago
Aaron Parecki
09981cc8d2
add new endpoint to parse all rel values from a URL
Grabs rel values from both HTTP headers and link tags and merges them together, and resolves any relative values. Also returns the final URL requested and the HTTP response.
7 years ago
Aaron Parecki
8c488b41ea
add certbot configuration tool
7 years ago
Aaron Parecki
7d781c3129
match `http-equiv=Status` in addition to `status`
8 years ago
Aaron Parecki
7ef9d2c936
check for http-equiv for deleted posts when target URL provided
8 years ago
Aaron Parecki
de060f110f
return url and code with no link when target URL provided
8 years ago
Aaron Parecki
227311faa9
check for meta-equiv HTTP deleted
closes #16
8 years ago
Aaron Parecki
2d90d5fddd
move twitter logic to its own function
8 years ago
Aaron Parecki
bc74919ade
return status code and final URL in response
* closes #14
* updated readme with details of the response
* includes `url` and `code` in the response with the final URL after following redirects and the HTTP status code returned
8 years ago
Aaron Parecki
4a08c1fd2f
package for releasing to shared servers
* adds .htaccess files for apache
* adds installation instructions and nginx example
* prevent source folders from being viewed with .htaccess
* adds `index.php` in root folder in case you deploy this whole thing to a subfolder
* updates to work when installed in a subfolder
8 years ago
Aaron Parecki
755fe8c222
fix positive timezones and case-insensitive username check
8 years ago
Aaron Parecki
0beac036b9
add twitter support
closes #18
8 years ago
Aaron Parecki
773252559d
parse instagram photos and videos
8 years ago
Aaron Parecki
3bdafad98e
parse URLs with fragment IDs
If the input URL contains a fragment, finds the DOM tree at that ID and runs the subtree through the mf2 parser.
closes #15
8 years ago
Aaron Parecki
c59ab9a2d6
also check img/video/audio for target URL
8 years ago
Aaron Parecki
ac32522c25
should always return 200
8 years ago
Aaron Parecki
565d50b862
add token fetching and authentication for posts
8 years ago
Aaron Parecki
9006718ca5
add config option to enable caching
8 years ago
Aaron Parecki
bf6de4de06
only return HTTP 400 on client errors
errors fetching the URL should not result in a 400 response
8 years ago
Aaron Parecki
6de9be2567
parse h-event
closes #9
8 years ago
Aaron Parecki
76c45a9e19
only use memcache if class exists
8 years ago
Aaron Parecki
262d9a66fa
cache the HTTP response
8 years ago
Aaron Parecki
9054b0947c
specific error when there is no content at the URL
8 years ago
Aaron Parecki
8dc0caa4d0
use effective URL after following redirects when comparing URLs
8 years ago
Aaron Parecki
c4b80506da
support parsing posted HTML
8 years ago
Aaron Parecki
8d1489bb72
fix for target param. include bookmark-of property
8 years ago
Aaron Parecki
814df605b6
add query string param to pretty-print JSON output
8 years ago
Aaron Parecki
ac6d86c0db
includes nested h-cite and other objects
if a property such as `in-reply-to` is an h-cite, the URL is still returned as the `in-reply-to` value, and the h-cite object is available in a different part of the response.
closes #6
8 years ago
Aaron Parecki
097e999768
return type=unknown instead of error=no_content
8 years ago
Aaron Parecki
82931e46bc
switch to using file_get_contents for appengine
8 years ago
Aaron Parecki
7075254d56
add / to URL if it doesn't have a path
8 years ago
Aaron Parecki
7b54172b87
add slash if no path is given
8 years ago
Aaron Parecki
e9bc4bf450
rename to X-Ray
8 years ago
Aaron Parecki
0b35b74636
implement authorship discovery
* extracts mf2 post contents from pages
* implements authorship discovery to find author info for the URL
8 years ago
Aaron Parecki
9eecc31571
parse content and name from the entry
8 years ago
Aaron Parecki
13bb06d2c9
stub mf2 parsing
8 years ago
Aaron Parecki
85c3ce7b33
starting the parse function, with tests
8 years ago
Aaron Parecki
22a71fd7e9
empty project template
8 years ago