Aaron Parecki
|
bf4bc3a668
|
extract photos and videos from streaming tweets when truncated
|
6 years ago |
Aaron Parecki
|
fb2fcec9c6
|
include HTML for tweets with links or user mentions
also expands parsing to be able to handle twitter JSON from the streaming API which is subtly different from the HTTP API.
closes #61
|
6 years ago |
Aaron Parecki
|
b995a1d3ee
|
whitespace
|
6 years ago |
Aaron Parecki
|
452accf6bf
|
include `quotation-of` property for quoted tweets
|
6 years ago |
Aaron Parecki
|
01ba3c0785
|
add note about supporting dev-master of htmlpurifier
|
6 years ago |
Aaron Parecki
|
73ad0e6e21
|
Merge pull request #60 from aaronpk/htmlpurifier-update
switch to htmlpurifier master
|
6 years ago |
Aaron Parecki
|
584f34e1ed
|
add test from ascraeus.org which was causing an INTL error
|
6 years ago |
Aaron Parecki
|
ba309e9cfe
|
add test for parsing a parsed mf2 object with html
|
6 years ago |
Aaron Parecki
|
6ce7f7c64b
|
fix autoloader for htmlpurifier
|
6 years ago |
Aaron Parecki
|
06bcbb3806
|
attempt to update htmlpurifier to latest commit
|
6 years ago |
Aaron Parecki
|
7e3d901995
|
test on more php versions
|
6 years ago |
Aaron Parecki
|
2cc215d370
|
add .editorconfig to data folder
tells the editor to save data files with crlf needed for parsing the test http responses
|
6 years ago |
Aaron Parecki
|
0decb9dcb4
|
return error info after finding feeds if available
|
6 years ago |
Aaron Parecki
|
c67dd9088d
|
bugfix
|
6 years ago |
Aaron Parecki
|
bf7f93f379
|
switch to p3k fork of picofeed
|
6 years ago |
Aaron Parecki
|
aba067234c
|
add h-x-app vocabulary
closes #13
|
6 years ago |
Aaron Parecki
|
171ca175f2
|
adds an option to process a parsed mf2 page
|
6 years ago |
Aaron Parecki
|
fe65def90f
|
comment out two tests until open mf2 parser issues are resolved
|
6 years ago |
Aaron Parecki
|
71bf274917
|
Merge branch 'remove-img-from-photo-posts'
|
6 years ago |
Aaron Parecki
|
2515f618c7
|
include featured image for h-entry
closes #51
|
6 years ago |
Aaron Parecki
|
c376833f4c
|
fix for recipe parsing
|
6 years ago |
Aaron Parecki
|
2fd563db0c
|
put the comment in the right spot
|
6 years ago |
Aaron Parecki
|
4d65b1ca1e
|
if removing the img results in empty content, put the name value back
closes #57
|
6 years ago |
Aaron Parecki
|
3ac38f9dbf
|
add simple case of Known markup
for #57
|
6 years ago |
Aaron Parecki
|
150683e1a7
|
fix error when mailto links are encountered
|
6 years ago |
Aaron Parecki
|
85c2b9b15f
|
add failing test for `p-content` containing an `u-photo`
|
6 years ago |
Aaron Parecki
|
66adfbe2f8
|
run name/content dedupe before munging HTML
fix for #53
|
6 years ago |
Aaron Parecki
|
44770396f9
|
add test to ensure a content property is not returned unless it is defined
|
6 years ago |
Aaron Parecki
|
bdedef6e1e
|
adds a bunch of broken tests for #52
|
6 years ago |
Aaron Parecki
|
b686349ded
|
remove duplicate code
use parseHTMLValue function for event description
|
6 years ago |
Aaron Parecki
|
2f0ba989c5
|
Merge pull request #49 from Zegnat/patch-1
Add $base to default config
|
6 years ago |
Martijn van der Ven
|
7ddddb3d07
|
Add $base to default config
|
6 years ago |
Aaron Parecki
|
d67e76f4b5
|
more strict match for XKCD comics
allows the XKCD XML feeds to be parsed with the feed parser
|
6 years ago |
Aaron Parecki
|
66f4a8b007
|
check for "type" property on alternates
|
7 years ago |
Aaron Parecki
|
36cd121ee1
|
update picofeed dependency
using fork of picofeed with 0.1.38 tag on aaronpk repo
|
7 years ago |
Aaron Parecki
|
70e9f60c42
|
update jsonfeed detection
|
7 years ago |
Aaron Parecki
|
a5f9376f09
|
allow use of libxml_disable_entity_loader in appengine
the zend security model runs libxml_disable_entity_loader to disable it so it's fine to allow
|
7 years ago |
Aaron Parecki
|
a9b1001e62
|
switch to fork of picofeed with authorUrl support
* adds test of instagram-atom feed with individual authors per item
* dedupes atom/rss title if it's a prefix of the content
|
7 years ago |
Aaron Parecki
|
7872429f0c
|
prioritize url on the same domain
if an item has multiple URL values, return the one that is on the same domain
|
7 years ago |
Aaron Parecki
|
65d36a74de
|
always return arrays for photo and audio from XML feeds
|
7 years ago |
Aaron Parecki
|
12f27517f4
|
assume text/xml is an RSS feed
|
7 years ago |
Aaron Parecki
|
c2a8ee5a05
|
feed discovery only takes 1 http request so adjust timeout
|
7 years ago |
Aaron Parecki
|
206e27ea25
|
add feed discovery API
|
7 years ago |
Aaron Parecki
|
796defb389
|
Merge branch 'sebsel-master'
|
7 years ago |
Aaron Parecki
|
8e5163e6d3
|
Merge branch 'master' of https://github.com/sebsel/XRay into sebsel-master
# Conflicts:
# README.md
# composer.json
# composer.lock
|
7 years ago |
Aaron Parecki
|
745c5d4656
|
update readme
includes info on feed parsing
|
7 years ago |
Aaron Parecki
|
85b8a35212
|
normalize URLs when comparing
Treats `https://example.com` and `https://example.com/` as equivalent when comparing URLs. Closes #33
|
7 years ago |
Aaron Parecki
|
15743d411d
|
Find author when author is a property of the h-feed
closes #32
|
7 years ago |
Aaron Parecki
|
05f7d9c86c
|
implement h-feed and other microformats feed parsing
|
7 years ago |
Aaron Parecki
|
7b16371418
|
add basic support for JSONFeed
|
7 years ago |