{"id":29673,"date":"2022-07-22T16:12:25","date_gmt":"2022-07-22T16:12:25","guid":{"rendered":"https:\/\/zeru.com\/blog\/?p=29673"},"modified":"2022-07-22T16:12:25","modified_gmt":"2022-07-22T16:12:25","slug":"is-there-a-twitter-archive-2","status":"publish","type":"post","link":"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2","title":{"rendered":"Is There a Twitter Archive?"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_43 counter-flat ez-toc-counter ez-toc-light-blue ez-toc-container-direction\">\n<p class=\"ez-toc-title\">Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-69df939c3e5d0\" class=\"cssicon\"><span style=\"display: flex;align-items: center;width: 35px;height: 30px;justify-content: center;direction:ltr;\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/label><label for=\"ez-toc-cssicon-toggle-item-69df939c3e5d0\"  class=\"cssiconcheckbox\">1<\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-69df939c3e5d0\" ><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2\/#Is_There_a_Twitter_Archive\" title=\"Is There a Twitter Archive?\">Is There a Twitter Archive?<\/a><\/li><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2\/#Information_about_the_Library_of_Congresss_planned_digital_archive_of_all_public_tweets\" title=\"Information about the Library of Congress&#8217;s planned digital archive of all public tweets\">Information about the Library of Congress&#8217;s planned digital archive of all public tweets<\/a><\/li><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2\/#Challenges_in_accessing_the_archive\" title=\"Challenges in accessing the archive\">Challenges in accessing the archive<\/a><\/li><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2\/#Size_of_the_archive\" title=\"Size of the archive\">Size of the archive<\/a><\/li><\/ul><\/nav><\/div>\n<h1><span class=\"ez-toc-section\" id=\"Is_There_a_Twitter_Archive\"><\/span>Is There a Twitter Archive?<span class=\"ez-toc-section-end\"><\/span><\/h1>\n<p>In order to answer the question, &#8220;Is there a Twitter archive?&#8221; we have to understand what the data will consist of. A tweet contains about 150 pieces of metadata, such as the time stamp, location, and unique numerical ID. These elements are also reflected in the archive, along with any replies, favorites, or retweets that were made. For example, if someone tweeted &#8220;I love cats,&#8221; the archive will include their cat&#8217;s name, their url, and the number of followers they have.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Information_about_the_Library_of_Congresss_planned_digital_archive_of_all_public_tweets\"><\/span>Information about the Library of Congress&#8217;s planned digital archive of all public tweets<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The planned digital archive of all public tweets will only contain text, not images, videos or animated gifs. Twitter&#8217;s changing nature has limited the scope of this project. While the Library of Congress had hoped to create a repository for tweets and images, it has run into access problems. The tweet archive will remain embargoed until the library overcomes these problems.<\/p>\n<p>The Library of Congress has been creating archival collections of Web sites since 2000 and has collected 525 terabytes of web archives as of March 2014, growing at a rate of five terabytes per month. Because public tweets have become a permanent part of the history of cultural and world events, the Library of Congress saw a need to archive this data.<\/p>\n<p>The Library of Congress is working with Twitter to create this digital archive. The goal is to make the archive accessible to researchers who would benefit from its content. Twitter started tweeting in March 2006 and has accumulated over 50 million tweets a day. As a result, the Library of Congress will be able to view the tweets of millions of people. This archive will be made available for six months after they were originally posted.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Challenges_in_accessing_the_archive\"><\/span>Challenges in accessing the archive<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The Library of Congress has been an expert in preserving massive amounts of digital information, archiving presidential and congressional campaign sites since 2000 and collecting more than 525 terabytes of Web archive data. Yet, the Library of Congress has encountered several unique technical challenges in accessing the Twitter archive. The size of the archive &#8211; 21 billion tweets containing more than 50 fields of metadata each &#8211; makes it particularly difficult to access.<\/p>\n<p>Researchers have tried to mine the Twitter archive for useful insights, but they have encountered several challenges. One of the biggest challenges is that Twitter only allows researchers to access the latest 3200 tweets. This means that researchers are forced to make assumptions about keywords and topics, and may not be able to verify the validity of their sampled data. Other social networks have far stricter licensing policies and do not even allow researchers to download the entire archive.<\/p>\n<p>The Twitter archive contains 150 pieces of metadata. Every tweet contains a unique numerical ID, a timestamp, and location stamp, as well as a list of replies, favorites, and retweets. Users can also see information such as the number of followers they have. The Library of Congress may be able to provide direct access to individual data elements in the Twitter archive. However, there are still several challenges to be addressed before the Twitter archive can be made publicly available.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Size_of_the_archive\"><\/span>Size of the archive<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The Library of Congress has recently confirmed the size of the Twitter archive. According to the Library, the archive contains 150 pieces of metadata, including a unique numerical ID, timestamp, and location stamp. It also contains IDs for replies, favorites, retweets, and language. As of the time of writing, the archive contains over one hundred and forty billion tweets. However, if one wishes to access all of these data elements, there are no practical ways to do so.<\/p>\n<p>Although the Library of Congress is experienced with preserving large amounts of digital information, this project presents unique challenges for the agency. In particular, the size of the Twitter Archive poses a unique technical challenge. The archive contains 21 billion tweets from 2006-2010, each with over 50 fields of metadata. The Library of Congress received the data in early 2012, and Gnip was selected to handle the delivery of the archive to users.<\/p>\n<p>While there are no plans to restrict access to the Twitter archive, the size of the repository is estimated to be a quarter of the global output of news. This figure is even higher if retweets are removed. Twitter messages are published on the Web already, and the Library of Congress wants to preserve them for future generations. As such, it is essential to preserve the archive. However, it is unclear whether the Library of Congress will provide users with access to their tweets and what controls they will have.<\/p>\n<p> <iframe width=\"387\" frameborder=\"0\" src=\"https:\/\/www.youtube.com\/embed\/b0884Xak6So\" height=\"216\" allowfullscreen=\"true\" style=\"margin:0px auto; display: block;\"><\/iframe><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Is There a Twitter Archive? In order to answer the question, &#8220;Is there a Twitter archive?&#8221; we have to understand what the data will consist of. A tweet contains about 150 pieces of metadata, such as the time stamp, location, and unique numerical ID. These elements are also reflected in the archive, along with any [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":30689,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":[],"categories":[4],"tags":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v19.7 (Yoast SEO v21.1) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Is There a Twitter Archive? - Zeru<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Is There a Twitter Archive?\" \/>\n<meta property=\"og:description\" content=\"Is There a Twitter Archive? In order to answer the question, &#8220;Is there a Twitter archive?&#8221; we have to understand what the data will consist of. A tweet contains about 150 pieces of metadata, such as the time stamp, location, and unique numerical ID. These elements are also reflected in the archive, along with any [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2\" \/>\n<meta property=\"og:site_name\" content=\"Zeru\" \/>\n<meta property=\"article:published_time\" content=\"2022-07-22T16:12:25+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/zeru.com\/blog\/wp-content\/uploads\/Is-There-a-Twitter-Archive_29673.png\" \/>\n\t<meta property=\"og:image:width\" content=\"940\" \/>\n\t<meta property=\"og:image:height\" content=\"748\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Lizzie Yates\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Lizzie Yates\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2\",\"url\":\"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2\",\"name\":\"Is There a Twitter Archive? - Zeru\",\"isPartOf\":{\"@id\":\"https:\/\/zeru.com\/blog\/#website\"},\"datePublished\":\"2022-07-22T16:12:25+00:00\",\"dateModified\":\"2022-07-22T16:12:25+00:00\",\"author\":{\"@id\":\"https:\/\/zeru.com\/blog\/#\/schema\/person\/61005d9ec00b94bc50fbaf11b78aa55e\"},\"breadcrumb\":{\"@id\":\"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/zeru.com\/blog\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Is There a Twitter Archive?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/zeru.com\/blog\/#website\",\"url\":\"https:\/\/zeru.com\/blog\/\",\"name\":\"Zeru\",\"description\":\"Blog\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/zeru.com\/blog\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/zeru.com\/blog\/#\/schema\/person\/61005d9ec00b94bc50fbaf11b78aa55e\",\"name\":\"Lizzie Yates\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/zeru.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/zeru.com\/blog\/wp-content\/uploads\/19-150x150.jpg\",\"contentUrl\":\"https:\/\/zeru.com\/blog\/wp-content\/uploads\/19-150x150.jpg\",\"caption\":\"Lizzie Yates\"},\"description\":\"A content marketing strategist with the Zeru team for a little over 5 years, Lizzie Yates specializes in everything digital media with a particular focus on social media and technology. Her passion? To follow how the social media sites like Instagram, YouTube, Facebook, Twitter, and TikTok are maturing over time, and what businesses can do to keep up. She shares her insights on our blog in a true outpouring of knowledge and expertise. Her knowledge about technology and social media is vast, and she is always willing to share her insights with businesses to help them stay up-to-date with the latest trends.\",\"url\":\"https:\/\/zeru.com\/blog\/author\/writer\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Is There a Twitter Archive? - Zeru","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2","og_locale":"en_US","og_type":"article","og_title":"Is There a Twitter Archive?","og_description":"Is There a Twitter Archive? In order to answer the question, &#8220;Is there a Twitter archive?&#8221; we have to understand what the data will consist of. A tweet contains about 150 pieces of metadata, such as the time stamp, location, and unique numerical ID. These elements are also reflected in the archive, along with any [&hellip;]","og_url":"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2","og_site_name":"Zeru","article_published_time":"2022-07-22T16:12:25+00:00","og_image":[{"width":940,"height":748,"url":"https:\/\/zeru.com\/blog\/wp-content\/uploads\/Is-There-a-Twitter-Archive_29673.png","type":"image\/png"}],"author":"Lizzie Yates","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Lizzie Yates","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2","url":"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2","name":"Is There a Twitter Archive? - Zeru","isPartOf":{"@id":"https:\/\/zeru.com\/blog\/#website"},"datePublished":"2022-07-22T16:12:25+00:00","dateModified":"2022-07-22T16:12:25+00:00","author":{"@id":"https:\/\/zeru.com\/blog\/#\/schema\/person\/61005d9ec00b94bc50fbaf11b78aa55e"},"breadcrumb":{"@id":"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/zeru.com\/blog\/is-there-a-twitter-archive-2#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/zeru.com\/blog"},{"@type":"ListItem","position":2,"name":"Is There a Twitter Archive?"}]},{"@type":"WebSite","@id":"https:\/\/zeru.com\/blog\/#website","url":"https:\/\/zeru.com\/blog\/","name":"Zeru","description":"Blog","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/zeru.com\/blog\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/zeru.com\/blog\/#\/schema\/person\/61005d9ec00b94bc50fbaf11b78aa55e","name":"Lizzie Yates","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/zeru.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/zeru.com\/blog\/wp-content\/uploads\/19-150x150.jpg","contentUrl":"https:\/\/zeru.com\/blog\/wp-content\/uploads\/19-150x150.jpg","caption":"Lizzie Yates"},"description":"A content marketing strategist with the Zeru team for a little over 5 years, Lizzie Yates specializes in everything digital media with a particular focus on social media and technology. Her passion? To follow how the social media sites like Instagram, YouTube, Facebook, Twitter, and TikTok are maturing over time, and what businesses can do to keep up. She shares her insights on our blog in a true outpouring of knowledge and expertise. Her knowledge about technology and social media is vast, and she is always willing to share her insights with businesses to help them stay up-to-date with the latest trends.","url":"https:\/\/zeru.com\/blog\/author\/writer"}]}},"_links":{"self":[{"href":"https:\/\/zeru.com\/blog\/wp-json\/wp\/v2\/posts\/29673"}],"collection":[{"href":"https:\/\/zeru.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/zeru.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/zeru.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/zeru.com\/blog\/wp-json\/wp\/v2\/comments?post=29673"}],"version-history":[{"count":1,"href":"https:\/\/zeru.com\/blog\/wp-json\/wp\/v2\/posts\/29673\/revisions"}],"predecessor-version":[{"id":29679,"href":"https:\/\/zeru.com\/blog\/wp-json\/wp\/v2\/posts\/29673\/revisions\/29679"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/zeru.com\/blog\/wp-json\/wp\/v2\/media\/30689"}],"wp:attachment":[{"href":"https:\/\/zeru.com\/blog\/wp-json\/wp\/v2\/media?parent=29673"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/zeru.com\/blog\/wp-json\/wp\/v2\/categories?post=29673"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/zeru.com\/blog\/wp-json\/wp\/v2\/tags?post=29673"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}