Internet Archive

14,000,000 Leading Edge Experts on the ideXlab platform

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

The Experts below are selected from a list of 97920 Experts worldwide ranked by ideXlab platform

Klaus Graf - One of the best experts on this subject based on the ideXlab platform.

  • Über 500 mittelalterliche Handschriften der Riccardiana in Florenz online
    Archivalia, 2021
    Co-Authors: Klaus Graf
    Abstract:

    522 #Digitalisate mittelalterlicher Handschriften aus der Biblioteca Riccardiana in Florenz, darunter Ricc. 1035 (Boccaccios Abschrift von Dantes "Divina Commedia" mit 7 ihm zugeschriebenen Zeichnungen)!@IlluminatedDan1 #Dante700 https://t.co/O06NtCcQiq pic.twitter.com/yeJS6DBEVV — manuscripta.at (@mss_oeaw) January 18, 2021 Kein Browsen, keine Permalinks, Internet Archive Viewer. Bei Collocazione kann man die Signatur, wenn bekannt eingeben (233 für Ricc.233)

  • Geschichtliche Erinnerungen in Vilstaler Redensarten
    Archivalia, 2021
    Co-Authors: Klaus Graf
    Abstract:

    Die Arbeit von Hans Schlappinger 1925 https://Archive.org/details/schlappinger-vilstaler-redensarten ist leider nicht sonderlich relevant für meine Studien zu ereignisbezogenen Sprichwörtern. Und auch über das "Geschichtsbild des Volkes" (ein obsoletes Konzept) lernt man nicht allzu viel - Rudolf Kubitscheks Hinweis in seinem Artikel "Das Gedächtnis des Volkes" (Internet Archive, ohne Anmerkungen: Projekt Gutenberg) versprach zuviel

  • Osnabrücker Mitteilungen bis 1925 jetzt lückenlos online
    Archivalia, 2021
    Co-Authors: Klaus Graf
    Abstract:

    Mit Dank an die Scan-Spender (für den Band 17, 1892) und Herrn Dr. Unger kann ich den Abschluss meines Osnabrück-Projekts vermelden. Dank Dirk Carius hatte HathiTrust die vor 1925 liegenden Bände für US-Bürger*innen geöffnet, was es mir ermöglichte, sie mittels HathiHelper ins Internet Archive zu bringen. Liste der Bände: https://de.wikisource.org/wiki/Mitteilungen_des_Vereins_f%C3%BCr_Geschichte_und_Landeskunde_von_Osnabr%C3%BCck Der Geschichtsverein kann sich erst einmal auf die Bände nach..

  • Narrenschiff
    Archivalia, 2021
    Co-Authors: Klaus Graf
    Abstract:

    https://Archive.org/details/woodfrogswechsle00doug ARLIMA hat die falschen Metadaten im Internet Archive 2018 angemerkt - passiert ist - nichts

  • Manuscripts of the Muslim World
    Archivalia, 2021
    Co-Authors: Klaus Graf
    Abstract:

    And now, the entire @MmwProject collection is on Internet Archive! Check it out: https://t.co/KPvVaimXBf 401 complete Islamicate manuscripts from all participating project repositories with more from @columbialib to be added very soon. Thanks to @sims_mss staff! pic.twitter.com/d1TDyfJBB0 — MMWProject (@MmwProject) May 16, 2021 See also https://archivalia.hypotheses.org/14969 (2010

Noah Wardripfruin - One of the best experts on this subject based on the ideXlab platform.

  • hypermedia eternal life and the impermanence agent
    Leonardo, 1999
    Co-Authors: Noah Wardripfruin
    Abstract:

    We look to media as memory, and a place to memorialize, when we have lost. Hypermedia pioneers such as Ted Nelson and Vannevar Bush envisioned the ultimate media within the ultimate Archive—with each element in continual flux, and with constant new addition. Dynamism without loss. Instead we have the Web, where “Not Found” is a daily message. Projects such as the Internet Archive and Afterlife dream of fixing this uncomfortable impermanence. Marketeers promise that agents (indentured information servants that may be the humans of About.com or the software of “Ask Jeeves”) will make the Web comfortable through filtering—hiding the impermanence and overwhelming profluence that the Web’s dynamism produces. The Impermanence Agent —a programmatic, esthetic, and critical project created by the author, Brion Moss, a.c. chapman, and Duane Whitehurst—operates differently. It begins as a storytelling agent, telling stories of impermanence, stories of preservation, memorial stories. It monitors each user’s Web browsing, and starts customizing its storytelling by weaving in images and texts that the user has pulled from the Web. In time, the original stories are lost. New stories, collaboratively created, have taken their place.

  • hypermedia eternal life and the impermanence agent
    International Conference on Computer Graphics and Interactive Techniques, 1999
    Co-Authors: Noah Wardripfruin
    Abstract:

    We look to media as memory, and a place to memorialize, when we have lost. Hypermedia pioneers such as Ted Nelson and Vannevar Bush envisioned the ultimate media within the ultimate Archive—with each element in continual flux, and with constant new addition. Dynamism without loss. Instead we have the Web, where “Not Found” is a daily message. Projects such as the Internet Archive and Afterlife dream of fixing this uncomfortable impermanence. Marketeers promise that agents (indentured information servants that may be the humans of About.com or the software of “Ask Jeeves”) will make the Web comfortable through filtering—hiding the impermanence and overwhelming profluence that the Web's dynamism produces. The Impermanence Agent—a programmatic, esthetic, and critical project created by the author, Brion Moss, a.c. chapman, and Duane Whitehurst— operates differently. It begins as a storytelling agent, telling stories of impermanence, stories of preservation, memorial stories. It monitors each user's Web brows...

Raul Magallon Rosa - One of the best experts on this subject based on the ideXlab platform.

  • la biblioteca digital sobre donald trump fact checking frente a fake news
    Estudios Sobre El Mensaje Periodistico, 2018
    Co-Authors: Raul Magallon Rosa
    Abstract:

    espanolobjetivo de esta investigacion es analizar la biblioteca digital creada por la Fundacion del Internet Archive sobre Donald Trump. El archivo, incluye mas de 700 discursos, entrevistas y debates del Presidente de los EEUU, Donald Trump. EnglishThe aim of this research is to analyze the digital library created by Internet Archive Foundation on Donald Trump. The Archive includes more than 700 speeches, interviews and debates by US President Donald Trump.

Michele C Weigle - One of the best experts on this subject based on the ideXlab platform.

  • a framework for aggregating private and public web Archives
    ACM IEEE Joint Conference on Digital Libraries, 2018
    Co-Authors: Mat Kelly, Michael L. Nelson, Michele C Weigle
    Abstract:

    Personal and private Web Archives are proliferating due to the increase in the tools to create them and the realization that Internet Archive and other public Web Archives are unable to capture personalized (e.g., Facebook) and private (e.g., banking) Web pages. We introduce a framework to mitigate issues of aggregation in private, personal, and public Web Archives without compromising potential sensitive information contained in private captures. We amend Memento syntax and semantics to allow TimeMap enrichment to account for additional attributes to be expressed inclusive of the requirements for dereferencing private Web Archive captures. We provide a method to involve the user further in the negotiation of archival captures in dimensions beyond time. We introduce a model for archival querying precedence and short-circuiting, as needed when aggregating private and personal Web Archive captures with those from public Web Archives through Memento. Negotiation of this sort is novel to Web archiving and allows for the more seamless aggregation of various types of Web Archives to convey a more accurate picture of the past Web.

  • Profiling web Archive coverage for top-level domain and content language
    International Journal on Digital Libraries, 2014
    Co-Authors: Ahmed Alsum, Michele C Weigle, Michael L. Nelson, Herbert Sompel
    Abstract:

    The Memento Aggregator currently polls every known public web Archive when serving a request for an Archived web page, even though some web Archives focus on only specific domains and ignore the others. Similar to query routing in distributed search, we investigate the impact on aggregated Memento TimeMaps (lists of when and where a web page was Archived) by only sending queries to Archives likely to hold the Archived page. We profile fifteen public web Archives using data from a variety of sources (the web, Archives’ access logs, and fulltext queries to Archives) and use these profiles as resource descriptor. These profiles are used in matching the URI-lookup requests to the most probable web Archives. We define $$Recall_{TM}(n)$$ R e c a l l T M ( n ) as the percentage of a TimeMap that was returned using $$n$$ n web Archives. We discover that only sending queries to the top three web Archives (i.e., 80 % reduction in the number of queries) for any request reaches on average $$Recall_{TM}=0.96$$ R e c a l l T M = 0.96 . If we exclude the Internet Archive from the list, we can reach $$Recall_{TM}=0.647$$ R e c a l l T M = 0.647 on average using only the remaining top three web Archives.

  • visualizing digital collections at Archive it
    ACM IEEE Joint Conference on Digital Libraries, 2012
    Co-Authors: Kalpesh Padia, Yasmin Alnoamany, Michele C Weigle
    Abstract:

    Archive-It, a subscription service from the Internet Archive, allows users to create, maintain and view digital collections of web resources. The current interface of Archive-It is largely text-based, supporting drill-down navigation using lists of URIs. To provide an overview of each collection and highlight the collection's underlying characteristics, we present four alternate visualizations (image plot with histogram, wordle, bubble chart and timeline). The sites in an Archive-It collection may be organized by the collection curator into groups for easier navigation. However, many collections do not have such groupings, making them difficult to explore. We introduce a heuristics-based categorization for such collections.

Tsakalidis Adam - One of the best experts on this subject based on the ideXlab platform.

  • DUKweb: Diachronic word representations from the UK Web Archive corpus
    2021
    Co-Authors: Tsakalidis Adam, Basile Pierpaolo, Bazzi Marya, Cucuringu Mihai, Mcgillivray Barbara
    Abstract:

    Lexical semantic change (detecting shifts in the meaning and usage of words) is an important task for social and cultural studies as well as for Natural Language Processing applications. Diachronic word embeddings (time-sensitive vector representations of words that preserve their meaning) have become the standard resource for this task. However, given the significant computational resources needed for their generation, very few resources exist that make diachronic word embeddings available to the scientific community. In this paper we present DUKweb, a set of large-scale resources designed for the diachronic analysis of contemporary English. DUKweb was created from the JISC UK Web Domain Dataset (1996-2013), a very large Archive which collects resources from the Internet Archive that were hosted on domains ending in `.uk'. DUKweb consists of a series word co-occurrence matrices and two types of word embeddings for each year in the JISC UK Web Domain dataset. We show the reuse potential of DUKweb and its quality standards via a case study on word meaning change detection.Comment: 24 pages, 6 figures The arXiv submission was replaced to include the following comment. This version of the article has been accepted for publication, after peer review (when applicable) but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1038/s41597-021-01047-

  • DUKweb: Diachronic word representations from the UK Web Archive corpus
    'Organisation for Economic Co-Operation and Development (OECD)', 2021
    Co-Authors: Tsakalidis Adam, Basile Pierpaolo, Bazzi Marya, Cucuringu Mihai, Mcgillivray Barbara
    Abstract:

    Lexical semantic change (detecting shifts in the meaning and usage of words) is an important task for social and cultural studies as well as for Natural Language Processing applications. Diachronic word embeddings (time-sensitive vector representations of words that preserve their meaning) have become the standard resource for this task. However, given the significant computational resources needed for their generation, very few resources exist that make diachronic word embeddings available to the scientific community. In this paper we present DUKweb, a set of large-scale resources designed for the diachronic analysis of contemporary English. DUKweb was created from the JISC UK Web Domain Dataset (1996-2013), a very large Archive which collects resources from the Internet Archive that were hosted on domains ending in `.uk'. DUKweb consists of a series word co-occurrence matrices and two types of word embeddings for each year in the JISC UK Web Domain dataset. We show the reuse potential of DUKweb and its quality standards via a case study on word meaning change detection

  • DUKweb (Diachronic UK web)
    British Library, 2020
    Co-Authors: Basile Pierpaolo, Tsakalidis Adam
    Abstract:

    We present DUKweb, a set of large-scale resources useful for the diachronic analysis of contemporary English. The dataset is derived from JISC UK Web Domain Dataset (1996-2013), which collects resources from the Internet Archive that were hosted on domains ending in ‘.uk’. The dataset includes co-occurrences matrices for each year and two types of word vectors by year, Temporal Random Indexing vectors and word2vec embeddings