The Bigger Picture: Visual Archives and the Smithsonian
Posts tagged with: Web/Tech
- The beauty of the mechanical - Photographer, Kevin Twomey, has a series of images of the inside workings of mechanical calculators. [via PetaPixel]
- The Getty's Online Scholarly Catalogue Initiative (OCSI) serves as a platform for the sharing of free art catalogues, including the Freer and Sackler Galleries catalog, The World of the Japanese Illustrated Book. [via OpenCulture]
- On Halloween this year, the Smithsonian American Art Museum and the National Museum of American History redidicated Alexander Calder's, Gwenfritz, as was reinstalled in it's original location on the west lawn of NMAH. [via O Say Can You See? blog, NMAH]
- A reimagined National Mall, as told by artist, Sam Durant's Proposal for White and Indian Dead Monument Transpositions, Washington, D.C., which is on exhibition at the Los Angeles County Museum of Art. [via Unframed blog, LACMA]
- Imagine that - You are now able to search every tweet on Twitter, all some half trillion of them and get results in under 100ms. [via InfoDocket]
- The Great War is a video series that will document how World War I unfolded, week-by-week, for the next 4 years. [via OpenCulture]
- Talk about a handful - A look at raising red pandas by hand at the National Zoo. [via Smithsonian Science]
- A new look for the Smithsonian as it announces its plans for renovating the South Mall portion of the Smithsonian which includes the Smithsonian Castle, the Sackler Gallery of Art, the National Museum of African Art, the Arts and Industries Building, the Hirshhorn Museum and Sculpture Garden, and the Enid A. Haupt Garden. [via Newsdesk, SI]
- Up for a redesign - the new look of the Internet Archive. [via Internet Archive Blogs]
- Reality check - Digital files decay - Here's how The Getty takes care of their digital files. [via The Getty Iris]
- Parchment during medieval times was quite the expensive purchase - As a result there are a number of creative ways people employed to repair tears, holes, and other imperfections. [via Colossal]
- A look at the digital preservation practices of 148 cultural institutions provides a basis for the current state of the digital preservation landscape. [The Signal: Digital Preservation, LOC]
- Never before seen in it's entirety by the public,The Textus Roffensis, a 12th century legal encyclopaedia compiled by a single scribe at Rochester Cathedral, in Kent, in the 1120s has been digitized and made available online. [via InfoDocket]
- I see you - a new satellite image of the National Portrait Gallery portrait commission, One of Many, One, by Jorge Rodriguez-Gerada. [via AirSpace blog, NASM]
- Yes, you heard right, the Smithsonian is on its way to raising $1.5 Billion to support its museums, research centers, and programs. [via The Torch, SI]
- Getting toned, book style - Toning Japanese paper hinges for reattaching boards to leather bindings. [via Unbound blog, Smithsonian Libraries]
- Announced this week - The papers of Nobel laureate Toni Morrison will reside at Princeton University Library. [via InfoDocket]
- Not just go-go or punk - The new D.C. Vernacular Music Archive at George Washington University encompasses the variety of music found in our nation's capital. [via DCist]
- Challenge accepted - Flickr created a site to tell you if your picture has a park or a bird in it in response to a challenge laid out in the XKCD webcomic. [via PetaPixel]
- A new tool is coming from Rhizome that allows you preserve the dynamic content found on social media sites called Colloq. [via Bits blog, The New York Times]
- Walk in the steps of Jane Goodall, the Jane Goodall Institute and Google teamed up to bring their Street View Trekker cameras to Gombe National Park in Tanzania and allow you to explore and experience it. [via PetaPixel]
- A bold plan from the National Archives - Digitize their analog records, all 12 billion pages of them. [via AOTUS blog, NARA]
- An epic road trip - Collecting on the road with Jason Stieber, National Collector, Archives of American Art. [via Smithsonian Collections Blog]
- Now availble - DigDC, a new online archive of Washington D.C. history created by the D.C. Public Library’s Special Collections department. [via Washington Post]
- Documenting events as the are happening - A conversation with Howard Besser and the efforts of Activist Archivists in saving the records of the "Occupy" movement. [via The Signal: Digital Perservation, LOC]
- From the stacks - Exhibits writer-editor, David Romanowski, talks about his adventures in doing research in the National Air and Space Museum Archives' Technical Files for the Hawaii by Air exhibition. [via AirSpace blog, NASM]
- Some thoughts on archival appraisal in the age of distant reading and computational analysis of large sets of electronic records. [via The Signal: Digital Preservation, LOC]
- Gale/Library Journal 2014 Library of the Year - Edmonton Public Libary - presents this cool video timeline of their 101 year history. [via InfoDocket]
Last week, we celebrated two years of using Archive-It for documenting the Smithsonian Institution's web presence. Previously, we had been using an in-house software and hardware installation in order to crawl websites and had cobbled together various less-than-ideal methods for capturing social media. Our hope was that a subscription to Archive-It would allow us to capture our web presence in a more efficient manner as well as allow us to provide better access to our crawled web content.
So how are we doing?
The Smithsonian currently has a total of 349 distinct websites and blogs. In the last year, we've crawled 170 of them or approximately 49% of the total. Altogether, we've crawled 327 websites and blogs, about 94% of the total, since we began using Archive-It two years ago. In addition, a significant number have been crawled more than once. Of those that have yet to be crawled, the majority have underlying code that make them nearly impossible to crawl using the technology currently available to us.
By this point, we had hoped to be crawling our websites and blogs annually. Although we haven't reached that goal, we've certainly improved from approximately one-half of our websites in 2 ½ years prior to using Archive-It, to nearly all of our websites and blogs in less than two years with Archive-It. And there's the added bonus of most of our crawled content from the last two years being available online via our Smithsonian Institution Websites Collection on Archive-It.
We continue to take steps to improve our efficiency. One of our next steps will be to evaluate the websites we've already crawled to determine which ones do not need to be crawled again because they are no longer being updated. An example might be an online exhibition that was launched in its final format and was never intended to be modified. The fewer websites that need to be crawled, the more frequently we'll be able to capture those that do.
- Web Archiving Update, The Bigger Picture, Smithsonian Institution Archives
- Smithsonian Now Using Archive-It to Crawl Websites, The Bigger Picture, Smithsonian Institution Archives
- Connecting the Dots: Issues with Preserving Complex Websites, The Bigger Picture, Smithsonian Institution Archives