brin-bellway:

captainneverever:

Now that you’ve downloaded your blog and are waiting for the next step, what next?

Help out the Internet Archive (aka Wayback Machine) scrape tumblr for all the blogs!

Check out tracker.archiveteam.org/tumblr to see the progress so far.

The ArchiveTeamWarrior needs an internet connection and some space on your device. They want to save as much as they can.

(Clarification: the Internet Archive is not *running* this project–ArchiveTeam is a separate entity–but they *will* be hosting the results.)

(hat-tip @sophia-epistemia)


Tags:

#morning reblog #signal boost #101 Uses for Infrastructureless Computers #The Great Tumblr Apocalypse #The Last Tumblr Apocalypse

Tumblr tracker Dashboard

{{Title link: http://tracker.archiveteam.org/tumblr/ }}

nightpool:

Hey everybody, the ArchiveTeam tumblr project is up and running!

If you have resources, please install Archiveteam’s warrior program to contribute to the project! It’s very easy to set up on and install on any computer, there are step by step instructions at http://tracker.archiveteam.org/tumblr/

We’re already up to 11TB and 187 million blogs archived, but we’re going to need a lot more help to get all the NSFW content before the 17th!

main project discussion takes place irc at #tumbledown on efnet, and you can add blogs to be saved using this google form: https://goo.gl/RtXZEq

Where did you get the 187-million figure from? It makes sense that the ~65k figure on the tracker would just be the sex blogs, since all of the blogs I’ve seen go by on it have been sex blogs, but I didn’t see any information regarding non-sex blogs.

I have unlimited Internet, cheap electricity, and a cool climate, so I’m in a pretty good position for (small-scale) volunteer computing. I’ve been running a warrior for a couple days now. I’ve been leaving my laptop on overnight because if I’m interpreting the instructions right, you only get to pause a task for a few hours before it’s considered abandoned and re-assigned, and I didn’t want to lose work. (especially since my current task has been 22 hours and counting; some of these blogs are pretty big)

I think I’ll continue helping out with their other projects once this one is finished: archiving is (as anyone reading this blog has probably noticed) a pet cause of mine. Since it mostly just needs bandwidth and doesn’t take much CPU, I can even run it and World Community Grid at the same time without problems (anti-disease efforts are my other pet cause).


Tags:

#reply via reblog #signal boost #101 Uses for Infrastructureless Computers #The Great Tumblr Apocalypse #The Last Tumblr Apocalypse


{{next post in sequence}}

brin-bellway asked: Do you know of any good ways to backup a DW blog? So far, I have investigated: built-in exporter (doesn’t include comments); wget (doesn’t include access-locked posts); LJMigrate (gives an HTTP 307 error, which I have no idea how to deal with); most other tools on the list of DW-compatible LJ archivers (aren’t available at all anymore); printing every post to PDF and re-printing the relevant post with every new comment (severe, ongoing tedium).

{{previous post in sequence}}


farfromdaylight:

dreamwidth-help:

I’m an oldie who used to use Semagic but I haven’t done a backup in a while and I believe Semagic doesn’t work anymore. Let me pitch this to the crowd.

as far as i know there’s no great way to do it right now, though I would ask over on DW, they would have a better idea. iirc they do intend to build a native backup tool in the future but I probably read that in, like 2013, so it’s worth asking about again.

Wait they have a native exporter now??? Holy crap I had no idea that was a thing. The fact that it’s CSV/XML sucks but dang I’ll take it over nothing. Thanks @brin-bellway, this is gonna come in super handy for me.

Anyway as far as comments go I actually just use my email as an archive. Not ideal but it’s better than nothing. (You can also get your own comments emailed to you.) There might be a tool that does still work with DW but if there is I don’t know it, unfortunately.

I appreciate the effort, but I think we cross-posted. I just figured out how to fix the access-lock problem with wget [link].

I hope you find it handy too! :)


Tags:

#reply via reblog #Dreamwidth #101 Uses for Infrastructureless Computers


{{next post in sequence}}

brin-bellway asked: Do you know of any good ways to backup a DW blog? So far, I have investigated: built-in exporter (doesn’t include comments); wget (doesn’t include access-locked posts); LJMigrate (gives an HTTP 307 error, which I have no idea how to deal with); most other tools on the list of DW-compatible LJ archivers (aren’t available at all anymore); printing every post to PDF and re-printing the relevant post with every new comment (severe, ongoing tedium).

{{previous post in sequence}}


brin-bellway:

brin-bellway:

dreamwidth-help:

I’m an oldie who used to use Semagic but I haven’t done a backup in a while and I believe Semagic doesn’t work anymore. Let me pitch this to the crowd.

*

I talked to my dad last night, and he said that in theory I should be able to feed wget my Dreamwidth login cookies to give it the ability to scrape locked posts. Will try it later today and report back.

Looks like it worked! Here is my Dreamwidth post with more info.


Tags:

#reply via reblog #oh look an update #oh look an original post #Dreamwidth #101 Uses for Infrastructureless Computers


{{next post in sequence}}

brin-bellway asked: Do you know of any good ways to backup a DW blog? So far, I have investigated: built-in exporter (doesn’t include comments); wget (doesn’t include access-locked posts); LJMigrate (gives an HTTP 307 error, which I have no idea how to deal with); most other tools on the list of DW-compatible LJ archivers (aren’t available at all anymore); printing every post to PDF and re-printing the relevant post with every new comment (severe, ongoing tedium).

{{previous post in sequence}}


brin-bellway:

dreamwidth-help:

I’m an oldie who used to use Semagic but I haven’t done a backup in a while and I believe Semagic doesn’t work anymore. Let me pitch this to the crowd.

*

I talked to my dad last night, and he said that in theory I should be able to feed wget my Dreamwidth login cookies to give it the ability to scrape locked posts. Will try it later today and report back.


Tags:

#reply via reblog #oh look an update #Dreamwidth #101 Uses for Infrastructureless Computers


{{next post in sequence}}

brin-bellway asked: Do you know of any good ways to backup a DW blog? So far, I have investigated: built-in exporter (doesn’t include comments); wget (doesn’t include access-locked posts); LJMigrate (gives an HTTP 307 error, which I have no idea how to deal with); most other tools on the list of DW-compatible LJ archivers (aren’t available at all anymore); printing every post to PDF and re-printing the relevant post with every new comment (severe, ongoing tedium).

dreamwidth-help:

I’m an oldie who used to use Semagic but I haven’t done a backup in a while and I believe Semagic doesn’t work anymore. Let me pitch this to the crowd.

*


Tags:

#zeroth degree asks


{{next post in sequence}}

captainneverever:

Now that you’ve downloaded your blog and are waiting for the next step, what next?

Help out the Internet Archive (aka Wayback Machine) scrape tumblr for all the blogs!

Check out tracker.archiveteam.org/tumblr to see the progress so far.

The ArchiveTeamWarrior needs an internet connection and some space on your device. They want to save as much as they can.

(Clarification: the Internet Archive is not *running* this project–ArchiveTeam is a separate entity–but they *will* be hosting the results.)

(hat-tip @sophia-epistemia)


#signal boost #101 Uses for Infrastructureless Computers #The Great Tumblr Apocalypse #The Last Tumblr Apocalypse