Blader door bijdragen

Update

feb 2022
Tamara van Zwol
613

DPC: Digital Preservation Workflow Webinars and COW-a-thon on 8th and 10th of March 2022 Free to attend and open to all, the Digital Preservation Workflow Webinar series will showcase just some of the digital preservation workflow processes developed and implemented by DPC Member institutions. Webinar programme Episode 1 // Workflow presentations Tuesday 8th March 2022, 1330 – 1430 UTC ‘Integrating digital archiving and e-thesis submission’ ...

Terugkijken: Digitaal Erfgoed Nieuwjaarsevent 2022

feb 2022
Ronald de Nijs
532

Samenvatting

Op 21 januari trapten het Netwerk Digitaal Erfgoed, Podiumkunst.net en het Netwerk Archieven Design en Digitale Cultuur (NADD) 2022 af met het Digitaal Erfgoed Nieuwjaarsevent. Het online event, gepresenteerd vanuit de Bibliotheek Utrecht en met diverse primeurs, is terug te kijken op YouTube.

Mais Flexis: registeren/documenteren van restauratiewerkzaamheden en materiele zorg

feb 2022
M.M.Heemskerk
1180

Samenvatting

Werk je bij een archiefinstelling die bezig is een oplossing te vinden voor het registeren/documenteren van restauratiewerkzaamheden en materiele zorg (in Mais Flexis) dan willen we graag met je in contact komen.

Rapport negeert KEP

jan 2022
Yteke van der Vegt
·
Aangepast jun 2024
2
602

Marco Streefkerk: Het onderzoek is volgens mij niet uitgevoerd door NDE maar door een professioneel onderzoeksburo in opdracht van OCW. Ze geven aan welke methode ze hebben gebruikt. Het publiek, de erfgoedinstelli...

1656052680

Update

jan 2022
Open op Orde
454

Luistertip! In deel 3 van de podcast Over Informatie Gesproken vertellen Arre Zuurmond, Guido Enthoven en Guiny Kustner over de veranderopgave die ons te wachten staat rond de informatiehuishouding. https://www.informatiehuishouding.nl/actueel/nieuws/2022/01/28/luistertip-podcast-over-informatie-gesproken-deel-3-de-veranderopgave

Structured data formats (JSON, CSV, XLSX)

jan 2022
Zefi Kavvadia
423

The turn to more data-intensive access methods to web and social media archives, as indicated by the use of big data and digital humanities methods to analyze social media content calls for capturing social media in formats appropriate for these activities. Usually in formats like JSON, CSV, and XLSX, collections made up of structured data are more amenable to computational methods such as network analysis, topic modelling, and many other visu...

WARC

jan 2022
Zefi Kavvadia
643

The WARC format is widely accepted enough to be considered one of the default formats for storing captured content from the web. It followed its predecessor, the ARC, as the main file format in use by the Internet Archive, and is maintained by the International Internet Preservation Consortium (IIPC). The rationale behind the WARC format is that one file format for web archiving should preferably be able to hold not only the archived resources...

File formats for social media collections

jan 2022
Zefi Kavvadia
445

The two general approaches to social media archiving presented here ("look and feel" and "structured data") also have implications for file format selection, which by extension has implications for preservation and collection quality. The choices made when capturing and preserving, part of which is selecting appropriate formats according to one's purpose, will affect the possible uses the collection can be put into, and by extension, the types...

"Structured data" tools

jan 2022
Zefi Kavvadia
364

This is a list of tools that captures social media content in the form of structured data, focusing on the information included e.g. text, URLs, number of posts, etc., and not on the visual features of the content.

"Look and feel" tools

jan 2022
Zefi Kavvadia
425

This is a list of tools that capture the experience of browsing social media, i.e. visual features, media, etc.

References

jan 2022
Zefi Kavvadia
489

Samenvatting

This is a list of sources referenced in this wiki. Care has been taken to include every source, however additions and corrections for things that might have been accidentally overlooked are of course welcome!

Overleden of niet? Matching met externe databronnen

jan 2022
Wouter Brunner
·
Aangepast jun 2024
3
866

Bob Coret: Wouter, Onderstaande grafiek toont de verhouding (destijds) qua persoonsvermeldingen per bron per jaar (hierbij moet "onderzoeksresultaten" gelezen worden als WO2-bronnen en "overlijdensinschrijvin...

1643658600

Update

jan 2022
Open op Orde
400

Arjan Rompelman, beleidsadviseur bij RDDI, ziet dat het gebruik van sociale media vanuit de overheid nog een grijs gebied is. Wat is er nodig? Daarover schreef hij een artikel voor Od Magazine. https://www.informatiehuishouding.nl/actueel/nieuws/2022/01/27/sociale-media-grijs-gebied-gestandaardiseerde-werkwijzen-en-praktische-beleidskaders-ontbreken

Instructiefilmpje over hoe te zoeken in documentmanagementsysteem Corsa

jan 2022
Marjolijn Vloet - Coenen
·
Aangepast jun 2024
2
681

Marjolijn Vloet - Coenen: Dag Antoinette, Hartelijk dank voor je reactie. Dit soort filmpjes bedoel ik inderdaad. Ziet er goed uit, kort en duidelijk. En ze opnemen in de notificatiemails is inderdaad ook een handige tip.

1643723700

Update

jan 2022
Verwijderde gebruiker
·
Aangepast jun 2024
1
683

Violet: hee wat een geinig initiatief!

1643294820

Munin-Indexer (Munin)

jan 2022
Zefi Kavvadia
623

Munin (Munin-Indexer) uses Docker to wrap different scraping and archiving tools together and offer a scraping solution for Facebook, Instagram, and VKontakte. It indexes and scrapes posts, then crawls and captures them, and finally uses pywb to display them. Suitable for public social media content The important thing to note about Munin is that it is only able to archive public posts, i.e., only posts that do not sit behind a log-in. Conseq...

Crocoite

jan 2022
Zefi Kavvadia
472

Note: According to its GitHub page, this tool is not in active development anymore at the time of this writing (January 2022). However, it is still available for download and it still functions as expected. In a way, crocoite is a good example of a tool arising from the open-source community that could prove problematic to use in a professional setting because of lack of ongoing support. As browser-based crawling seems to become central in th...

Browsertrix Crawler

jan 2022
Zefi Kavvadia
2451

Initially known as Browsertrix, the Browsertrix Crawler is the latest and revamped version of what used to be a system of multiple browser-based crawlers that worked together to capture complex web content, such as social media. Browsertrix Crawler is built by the team behind the online web recording service Conifer and the desktop app ArchiveWeb.page (formerly known as Webrecorder and Webrecorder Player respectively) and uses the Chrome and C...

Brozzler

jan 2022
Zefi Kavvadia
1923

For those looking for large-scale harvesting solutions, Brozzler, like Browsertrix, is an interesting choice. Brozzler was developed and is still being maintained by the Internet Archive, and it is already used by organizations such as the Portuguese Web Archive. It is a browser-based crawler which uses Chrome or Chromium to access web content and harvest it in a WARC file. Brozzler is one of the newer-generation capturing tools which leverag...

Bijdragen

alles met document samenwerkingen privé Groepsbijdragen

Deel

Help

Blader door bijdragen

Update

Terugkijken: Digitaal Erfgoed Nieuwjaarsevent 2022

Samenvatting

Mais Flexis: registeren/documenteren van restauratiewerkzaamheden en materiele zorg

Samenvatting

Rapport negeert KEP

Update

Structured data formats (JSON, CSV, XLSX)

WARC

File formats for social media collections

"Structured data" tools

"Look and feel" tools

References

Samenvatting

Overleden of niet? Matching met externe databronnen

Update

Instructiefilmpje over hoe te zoeken in documentmanagementsysteem Corsa

Update

TAGS

Munin-Indexer (Munin)

Crocoite

Browsertrix Crawler

Brozzler

Bijdragen

Trefwoorden