Internet Archive - Explore the Science & Experts

The Experts below are selected from a list of 97920 Experts worldwide ranked by ideXlab platform

Klaus Graf - One of the best experts on this subject based on the ideXlab platform.

Über 500 mittelalterliche Handschriften der Riccardiana in Florenz online

Archivalia, 2021

Co-Authors: Klaus Graf

Abstract:

522 #Digitalisate mittelalterlicher Handschriften aus der Biblioteca Riccardiana in Florenz, darunter Ricc. 1035 (Boccaccios Abschrift von Dantes "Divina Commedia" mit 7 ihm zugeschriebenen Zeichnungen)!@IlluminatedDan1 #Dante700 https://t.co/O06NtCcQiq pic.twitter.com/yeJS6DBEVV — manuscripta.at (@mss_oeaw) January 18, 2021 Kein Browsen, keine Permalinks, Internet Archive Viewer. Bei Collocazione kann man die Signatur, wenn bekannt eingeben (233 für Ricc.233)

15 days free trial to Access Article
Geschichtliche Erinnerungen in Vilstaler Redensarten

Archivalia, 2021

Co-Authors: Klaus Graf

Abstract:

Die Arbeit von Hans Schlappinger 1925 https://Archive.org/details/schlappinger-vilstaler-redensarten ist leider nicht sonderlich relevant für meine Studien zu ereignisbezogenen Sprichwörtern. Und auch über das "Geschichtsbild des Volkes" (ein obsoletes Konzept) lernt man nicht allzu viel - Rudolf Kubitscheks Hinweis in seinem Artikel "Das Gedächtnis des Volkes" (Internet Archive, ohne Anmerkungen: Projekt Gutenberg) versprach zuviel

15 days free trial to Access Article
Osnabrücker Mitteilungen bis 1925 jetzt lückenlos online

Archivalia, 2021

Co-Authors: Klaus Graf

Abstract:

Mit Dank an die Scan-Spender (für den Band 17, 1892) und Herrn Dr. Unger kann ich den Abschluss meines Osnabrück-Projekts vermelden. Dank Dirk Carius hatte HathiTrust die vor 1925 liegenden Bände für US-Bürger*innen geöffnet, was es mir ermöglichte, sie mittels HathiHelper ins Internet Archive zu bringen. Liste der Bände: https://de.wikisource.org/wiki/Mitteilungen_des_Vereins_f%C3%BCr_Geschichte_und_Landeskunde_von_Osnabr%C3%BCck Der Geschichtsverein kann sich erst einmal auf die Bände nach..

15 days free trial to Access Article
Narrenschiff

Archivalia, 2021

Co-Authors: Klaus Graf

Abstract:

https://Archive.org/details/woodfrogswechsle00doug ARLIMA hat die falschen Metadaten im Internet Archive 2018 angemerkt - passiert ist - nichts

15 days free trial to Access Article
Manuscripts of the Muslim World

Archivalia, 2021

Co-Authors: Klaus Graf

Abstract:

And now, the entire @MmwProject collection is on Internet Archive! Check it out: https://t.co/KPvVaimXBf 401 complete Islamicate manuscripts from all participating project repositories with more from @columbialib to be added very soon. Thanks to @sims_mss staff! pic.twitter.com/d1TDyfJBB0 — MMWProject (@MmwProject) May 16, 2021 See also https://archivalia.hypotheses.org/14969 (2010

15 days free trial to Access Article

Noah Wardripfruin - One of the best experts on this subject based on the ideXlab platform.

hypermedia eternal life and the impermanence agent

Leonardo, 1999

Co-Authors: Noah Wardripfruin

Abstract:

We look to media as memory, and a place to memorialize, when we have lost. Hypermedia pioneers such as Ted Nelson and Vannevar Bush envisioned the ultimate media within the ultimate Archive—with each element in continual flux, and with constant new addition. Dynamism without loss. Instead we have the Web, where “Not Found” is a daily message. Projects such as the Internet Archive and Afterlife dream of fixing this uncomfortable impermanence. Marketeers promise that agents (indentured information servants that may be the humans of About.com or the software of “Ask Jeeves”) will make the Web comfortable through filtering—hiding the impermanence and overwhelming profluence that the Web’s dynamism produces. The Impermanence Agent —a programmatic, esthetic, and critical project created by the author, Brion Moss, a.c. chapman, and Duane Whitehurst—operates differently. It begins as a storytelling agent, telling stories of impermanence, stories of preservation, memorial stories. It monitors each user’s Web browsing, and starts customizing its storytelling by weaving in images and texts that the user has pulled from the Web. In time, the original stories are lost. New stories, collaboratively created, have taken their place.

15 days free trial to Access Article
hypermedia eternal life and the impermanence agent

International Conference on Computer Graphics and Interactive Techniques, 1999

Co-Authors: Noah Wardripfruin

Abstract:

We look to media as memory, and a place to memorialize, when we have lost. Hypermedia pioneers such as Ted Nelson and Vannevar Bush envisioned the ultimate media within the ultimate Archive—with each element in continual flux, and with constant new addition. Dynamism without loss. Instead we have the Web, where “Not Found” is a daily message. Projects such as the Internet Archive and Afterlife dream of fixing this uncomfortable impermanence. Marketeers promise that agents (indentured information servants that may be the humans of About.com or the software of “Ask Jeeves”) will make the Web comfortable through filtering—hiding the impermanence and overwhelming profluence that the Web's dynamism produces. The Impermanence Agent—a programmatic, esthetic, and critical project created by the author, Brion Moss, a.c. chapman, and Duane Whitehurst— operates differently. It begins as a storytelling agent, telling stories of impermanence, stories of preservation, memorial stories. It monitors each user's Web brows...

15 days free trial to Access Article

Raul Magallon Rosa - One of the best experts on this subject based on the ideXlab platform.

la biblioteca digital sobre donald trump fact checking frente a fake news

Estudios Sobre El Mensaje Periodistico, 2018

Co-Authors: Raul Magallon Rosa

Abstract:

espanolobjetivo de esta investigacion es analizar la biblioteca digital creada por la Fundacion del Internet Archive sobre Donald Trump. El archivo, incluye mas de 700 discursos, entrevistas y debates del Presidente de los EEUU, Donald Trump. EnglishThe aim of this research is to analyze the digital library created by Internet Archive Foundation on Donald Trump. The Archive includes more than 700 speeches, interviews and debates by US President Donald Trump.

15 days free trial to Access Article

Michele C Weigle - One of the best experts on this subject based on the ideXlab platform.

a framework for aggregating private and public web Archives

ACM IEEE Joint Conference on Digital Libraries, 2018

Co-Authors: Mat Kelly, Michael L. Nelson, Michele C Weigle

Abstract:

Personal and private Web Archives are proliferating due to the increase in the tools to create them and the realization that Internet Archive and other public Web Archives are unable to capture personalized (e.g., Facebook) and private (e.g., banking) Web pages. We introduce a framework to mitigate issues of aggregation in private, personal, and public Web Archives without compromising potential sensitive information contained in private captures. We amend Memento syntax and semantics to allow TimeMap enrichment to account for additional attributes to be expressed inclusive of the requirements for dereferencing private Web Archive captures. We provide a method to involve the user further in the negotiation of archival captures in dimensions beyond time. We introduce a model for archival querying precedence and short-circuiting, as needed when aggregating private and personal Web Archive captures with those from public Web Archives through Memento. Negotiation of this sort is novel to Web archiving and allows for the more seamless aggregation of various types of Web Archives to convey a more accurate picture of the past Web.

15 days free trial to Access Article
Profiling web Archive coverage for top-level domain and content language

International Journal on Digital Libraries, 2014

Co-Authors: Ahmed Alsum, Michele C Weigle, Michael L. Nelson, Herbert Sompel

Abstract:

The Memento Aggregator currently polls every known public web Archive when serving a request for an Archived web page, even though some web Archives focus on only specific domains and ignore the others. Similar to query routing in distributed search, we investigate the impact on aggregated Memento TimeMaps (lists of when and where a web page was Archived) by only sending queries to Archives likely to hold the Archived page. We profile fifteen public web Archives using data from a variety of sources (the web, Archives’ access logs, and fulltext queries to Archives) and use these profiles as resource descriptor. These profiles are used in matching the URI-lookup requests to the most probable web Archives. We define $$Recall_{TM}(n)$$ R e c a l l T M ( n ) as the percentage of a TimeMap that was returned using $$n$$ n web Archives. We discover that only sending queries to the top three web Archives (i.e., 80 % reduction in the number of queries) for any request reaches on average $$Recall_{TM}=0.96$$ R e c a l l T M = 0.96 . If we exclude the Internet Archive from the list, we can reach $$Recall_{TM}=0.647$$ R e c a l l T M = 0.647 on average using only the remaining top three web Archives.

15 days free trial to Access Article
visualizing digital collections at Archive it

ACM IEEE Joint Conference on Digital Libraries, 2012

Co-Authors: Kalpesh Padia, Yasmin Alnoamany, Michele C Weigle

Abstract:

Archive-It, a subscription service from the Internet Archive, allows users to create, maintain and view digital collections of web resources. The current interface of Archive-It is largely text-based, supporting drill-down navigation using lists of URIs. To provide an overview of each collection and highlight the collection's underlying characteristics, we present four alternate visualizations (image plot with histogram, wordle, bubble chart and timeline). The sites in an Archive-It collection may be organized by the collection curator into groups for easier navigation. However, many collections do not have such groupings, making them difficult to explore. We introduce a heuristics-based categorization for such collections.

15 days free trial to Access Article

Tsakalidis Adam - One of the best experts on this subject based on the ideXlab platform.

DUKweb: Diachronic word representations from the UK Web Archive corpus

2021

Co-Authors: Tsakalidis Adam, Basile Pierpaolo, Bazzi Marya, Cucuringu Mihai, Mcgillivray Barbara

Abstract:

Lexical semantic change (detecting shifts in the meaning and usage of words) is an important task for social and cultural studies as well as for Natural Language Processing applications. Diachronic word embeddings (time-sensitive vector representations of words that preserve their meaning) have become the standard resource for this task. However, given the significant computational resources needed for their generation, very few resources exist that make diachronic word embeddings available to the scientific community. In this paper we present DUKweb, a set of large-scale resources designed for the diachronic analysis of contemporary English. DUKweb was created from the JISC UK Web Domain Dataset (1996-2013), a very large Archive which collects resources from the Internet Archive that were hosted on domains ending in `.uk'. DUKweb consists of a series word co-occurrence matrices and two types of word embeddings for each year in the JISC UK Web Domain dataset. We show the reuse potential of DUKweb and its quality standards via a case study on word meaning change detection.Comment: 24 pages, 6 figures The arXiv submission was replaced to include the following comment. This version of the article has been accepted for publication, after peer review (when applicable) but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1038/s41597-021-01047-

15 days free trial to Access Article
DUKweb: Diachronic word representations from the UK Web Archive corpus

'Organisation for Economic Co-Operation and Development (OECD)', 2021

Co-Authors: Tsakalidis Adam, Basile Pierpaolo, Bazzi Marya, Cucuringu Mihai, Mcgillivray Barbara

Abstract:

Lexical semantic change (detecting shifts in the meaning and usage of words) is an important task for social and cultural studies as well as for Natural Language Processing applications. Diachronic word embeddings (time-sensitive vector representations of words that preserve their meaning) have become the standard resource for this task. However, given the significant computational resources needed for their generation, very few resources exist that make diachronic word embeddings available to the scientific community. In this paper we present DUKweb, a set of large-scale resources designed for the diachronic analysis of contemporary English. DUKweb was created from the JISC UK Web Domain Dataset (1996-2013), a very large Archive which collects resources from the Internet Archive that were hosted on domains ending in `.uk'. DUKweb consists of a series word co-occurrence matrices and two types of word embeddings for each year in the JISC UK Web Domain dataset. We show the reuse potential of DUKweb and its quality standards via a case study on word meaning change detection

15 days free trial to Access Article
DUKweb (Diachronic UK web)

British Library, 2020

Co-Authors: Basile Pierpaolo, Tsakalidis Adam

Abstract:

We present DUKweb, a set of large-scale resources useful for the diachronic analysis of contemporary English. The dataset is derived from JISC UK Web Domain Dataset (1996-2013), which collects resources from the Internet Archive that were hosted on domains ending in ‘.uk’. The dataset includes co-occurrences matrices for each year and two types of word vectors by year, Temporal Random Indexing vectors and word2vec embeddings

15 days free trial to Access Article

Discover everything there is to know about the scientific topic Internet Archive with ideXlab!

Klaus Graf - One of the best experts on this subject based on the ideXlab platform.

Über 500 mittelalterliche Handschriften der Riccardiana in Florenz online

Geschichtliche Erinnerungen in Vilstaler Redensarten

Osnabrücker Mitteilungen bis 1925 jetzt lückenlos online

Narrenschiff

Manuscripts of the Muslim World

Noah Wardripfruin - One of the best experts on this subject based on the ideXlab platform.

hypermedia eternal life and the impermanence agent

hypermedia eternal life and the impermanence agent

Raul Magallon Rosa - One of the best experts on this subject based on the ideXlab platform.

la biblioteca digital sobre donald trump fact checking frente a fake news

Michele C Weigle - One of the best experts on this subject based on the ideXlab platform.

a framework for aggregating private and public web Archives

Profiling web Archive coverage for top-level domain and content language

visualizing digital collections at Archive it

Tsakalidis Adam - One of the best experts on this subject based on the ideXlab platform.

DUKweb: Diachronic word representations from the UK Web Archive corpus

DUKweb: Diachronic word representations from the UK Web Archive corpus

DUKweb (Diachronic UK web)