Mirror of GitHub https://github.com/indieweb/indieweb-chat-archive

web 28572c3f4f logs as of Wed Mar 20 01:15:02 UTC 2019 1 minute ago
freenode 28572c3f4f logs as of Wed Mar 20 01:15:02 UTC 2019 1 minute ago
w3c 5b09d3c5fb logs as of Wed Mar 20 00:45:02 UTC 2019 31 minutes ago
.gitignore 3cf056c3cc 2011 1 year ago
README.md ddf6bf3fa2 Update README.md 11 months ago
clean-spam-from-nick.php 926139059d logs as of Wed Aug 22 20:15:02 UTC 2018 6 months ago
clean-spam.php a976111c9c logs as of Thu Aug 16 21:00:01 UTC 2018 7 months ago
first.json 68040702ff add json files 1 year ago
git-sync.sh 1748517b64 logs as of Fri Dec 1 21:45:01 UTC 2017 1 year ago
indieweb.json 36df509cf6 logs as of Tue Mar 19 09:30:02 UTC 2019 15 hours ago
spam-keywords.json 3496488358 logs as of Tue Mar 12 21:00:01 UTC 2019 1 week ago
w3c.json a645689e00 logs as of Fri Jan 4 09:45:02 UTC 2019 2 months ago

README.md

IndieWeb Chat Archive

This repo contains the full archive of IndieWeb chat log data files visible at https://chat.indieweb.org

Chat logs are added to this repo every 15 minutes.

File Format

Each channel's files can be read using QuartzDB. The files follow a simple format:

2017-12-01 23:15:06.218000 {"type":"message","timestamp":1512170106.218,"network":"irc","server":"freenode","channel":{"uid":"#indieweb","name":"#indieweb"},"author":{"uid":"Loqi","nickname":"Loqi","username":"Loqi","name":"Loqi","photo":null,"url":null,"tz":"US\/Pacific"},"content":"[@indiewebcamp] This week in the #indieweb https://indieweb.org/this-week/2017-12-01.html https://pbs.twimg.com/media/DP_z5rCVwAAGdTk.jpg (http://twtr.io/1Yx4r5CHSBC)","modes":[]}
  • Each line begins with the timestamp.
  • There will always be 26 characters followed by a space.
  • The timestamp is UTC and has 6 digits of precision for the seconds.
  • The rest of the line is a JSON-encoded string representing the IRC message and who sent it.

Spam removal

For a guide on how we deal with spam in these logs, see IRC#Spam on the wiki.