Deeper analysis of 856 fields

janusman - May 12, 2008 - 21:25
Project:Millennium Integration
Version:6.x-2.x-dev
Component:Code
Category:feature request
Priority:normal
Assigned:Unassigned
Status:active
Description

RIght now the module handles automatic downloading of Tables of Contents from certain Library of Congress URLs, if the URL is in the 856 field of the original record. This does not work for all cases, however, like: http://www.loc.gov/catdir/toc/hol041/2003056462.html and does yet handle other sources of TOCs or other book information that might want to be imported into the biblio record.

#1

janusman - January 7, 2009 - 20:55

Notes from discussion in #code4lib:

Ideas:
* How about importing reviews?
* How about indexing PDFs or other HTMLs that are linked to in 856s?
* How about showing thumbnails or other surrogates of linked-to jpg, gif, png, etc. files?
* How about having an inline/popup viewer for some linked-to files? (e.g. PDFs?)

#2

janusman - January 7, 2009 - 23:47

edsu @#code4lib provided this link to a crawl of (millions of?) LOC records, extracting the 856s.
http://inkdroid.org/bzr/beat/marclinks.txt

#3

janusman - May 20, 2009 - 15:38
Title:Download tables of contents from LOC» Deeper analysis of 856 fields
Version:5.x-1.2» 6.x-2.x-dev

Changing issue subject and version.

It feels the most useful contribution would be to actually tag items as being available online or physically in a library using taxonomy, by checking for 856 tags with a second indicator of 0 to indicate online versions.

 
 

Drupal is a registered trademark of Dries Buytaert.