archivist

joined 2 weeks ago
MODERATOR OF
 

From wikipedia:

It is the largest and oldest of the U.S. international broadcasters, producing digital, TV, and radio content in 48 languages for affiliate stations around the world

From the AT wiki:

Under the second Trump administration, almost all of VOA's 1,300 journalists, producers and assistants were placed on administrative leave. There is risk that Voice of America will be shut down.

Archival has been ongoing for two weeks now, capturing millions of articles, reaching over 200 terabytes of data, including countless videos and images.

The grab is going a bit slow, as AT has a rate limit on how many items it lets warriors grab at once. As such, running more warriors with this project won't make a difference in archival speed right now.

14M items (articles, assets) have been archived, 16M are waiting to be processed, with 10M items so far that failed to be archived.

[–] archivist@lemm.ee 1 points 4 days ago

It's odd that while there is one part that's film negative on purpose, some just seem to be negative for no reason...

 

It starts off with a stop motion part starring dominoes in front of a little building, and then transitions to a number of scenes featuring some fun camera trickery.

I find it fun that this is essentially the exact same thing we used to make as kids some 60 years later, only we used a digital camera!

 

cross-posted from: https://lemm.ee/post/60191746

It’s a creative act to find and make sense of my own history, one that requires a leap of faith in order to fill in the silences, erasures, omissions, and genuine mysteries that old books and documents, records and artifacts, represent. A lot is left to the imagination. Much of what survives from the past asks more questions than we can answer. This is true for queer and trans archival traces, as it is for other aspects of humanity that are poorly accounted for in public records, or actively discriminated against through surveillance and omission in equal parts.

 

cross-posted from: https://lemm.ee/post/60203394

Dr. Brad Hafford shares his thoughts about modern and pencil-and-paper methods of recording archaeological data.

 

Dr. Brad Hafford shares his thoughts about modern and pencil-and-paper methods of recording archaeological data.

 

cross-posted from: https://lemm.ee/post/60191746

It’s a creative act to find and make sense of my own history, one that requires a leap of faith in order to fill in the silences, erasures, omissions, and genuine mysteries that old books and documents, records and artifacts, represent. A lot is left to the imagination. Much of what survives from the past asks more questions than we can answer. This is true for queer and trans archival traces, as it is for other aspects of humanity that are poorly accounted for in public records, or actively discriminated against through surveillance and omission in equal parts.

 

It’s a creative act to find and make sense of my own history, one that requires a leap of faith in order to fill in the silences, erasures, omissions, and genuine mysteries that old books and documents, records and artifacts, represent. A lot is left to the imagination. Much of what survives from the past asks more questions than we can answer. This is true for queer and trans archival traces, as it is for other aspects of humanity that are poorly accounted for in public records, or actively discriminated against through surveillance and omission in equal parts.

1
Roblox Assets Archival [New Project] (tracker.archiveteam.org)
submitted 4 days ago* (last edited 4 days ago) by archivist@lemm.ee to c/archiveteam@lemm.ee
 

I don't think there's info about this one on the wiki yet: https://wiki.archiveteam.org/index.php/Roblox

Looks like it will be done pretty quickly, as it was set to be the default project for warriors.

1
submitted 4 days ago* (last edited 4 days ago) by archivist@lemm.ee to c/archiveteam@lemm.ee
 

The archival started not long before the site was to be shut down, so there wasn't time to grab everything.

When the owners finally pulled the plug, blog posts started returning a 403 error, then later 410 errors. Images and javascript files remained downloadable for longer, but the JS files started returning 410 after a while as well. Images were still available for quite a bit longer.

Today, only so-called "tag" items were being archived, possibly because we ran out of known images, or the team sniffed out that those were still available and valuable.

The last item my warrior grabbed was a tag item at 2025-04-02T10:39:21.085891703Z

8M-14M known items are left unarchived, presumably many more millions not yet discovered.

1
deleted (lemm.ee)
submitted 5 days ago* (last edited 4 days ago) by archivist@lemm.ee to c/archaeology@mander.xyz
 

Ukrainian soldiers digging defensive fortifications stumbled upon an ancient Greek burial site in southern Ukraine.

Archived: archive.org, archive.ph

[–] archivist@lemm.ee 1 points 5 days ago

After a while, blog posts started returning a 403 error, then later 410. Images and javascript files remained downloadable for longer, but the JS files started returning 410 after a while as well. Now, only images are available, and the known ones are slowly being archived as long as they are downloadable.

[–] archivist@lemm.ee 6 points 6 days ago* (last edited 6 days ago)

Wasn't sure where to cross-post it on .ca! Québec, duh! Thanks.

Old "mundane" footage like this is always interesting, I would say!

[–] archivist@lemm.ee 1 points 6 days ago

There does seem to be some tracker rate limiting, but there certainly is a lot of work to be done.

3
SS Blog [New Archival Project] (tracker.archiveteam.org)
submitted 6 days ago* (last edited 6 days ago) by archivist@lemm.ee to c/datahoarder@lemmy.world
 

cross-posted from: https://lemm.ee/post/60023388

Archive Team has just begun the distributed archiving of the Japanese SS Blog, a blog hosting service, which is set to be discontinued on March 31, 2025.

And you can help! There isn't much time left, so as many people running the warrior as possible is needed.

Resources:

  • The wiki page of the project (not much info)
  • The tracker (at the top of the page) has the simplest info on how you can help out
  • The github page offers a docker-based alternative for advanced users, and more info on best practices for this sort of archiving

Why help out?

The web is disappearing all the time, and often a lot of previously easily accessible information is lost to time. These japanese blogs may not be very important to you, but they certainly are to a lot of people, and nobody knows what sort of information is found only here, until they need it.

4
SS Blog [New Archival Project] (tracker.archiveteam.org)
submitted 6 days ago* (last edited 6 days ago) by archivist@lemm.ee to c/datahoarder@lemmy.ml
 

cross-posted from: https://lemm.ee/post/60023388

Archive Team has just begun the distributed archiving of the Japanese SS Blog, a blog hosting service, which is set to be discontinued on March 31, 2025.

And you can help! There isn't much time left, so as many people running the warrior as possible is needed.

Resources:

  • The wiki page of the project (not much info)
  • The tracker (at the top of the page) has the simplest info on how you can help out
  • The github page offers a docker-based alternative for advanced users, and more info on best practices for this sort of archiving

Why help out?

The web is disappearing all the time, and often a lot of previously easily accessible information is lost to time. These japanese blogs may not be very important to you, but they certainly are to a lot of people, and nobody knows what sort of information is found only here, until they need it.

[–] archivist@lemm.ee 1 points 1 week ago

It's very convenient to have these archives always a click of a button away. Definitely recommend!

[–] archivist@lemm.ee 1 points 1 week ago

They are back now, but I could find no further info about it.

[–] archivist@lemm.ee 1 points 1 week ago

For a second I thought it might be another wave of DOS.

view more: next ›