Deeper analysis of 856 fields
janusman - May 12, 2008 - 21:25
| Project: | Millennium Integration |
| Version: | 6.x-2.x-dev |
| Component: | Code |
| Category: | feature request |
| Priority: | normal |
| Assigned: | Unassigned |
| Status: | active |
Jump to:
Description
RIght now the module handles automatic downloading of Tables of Contents from certain Library of Congress URLs, if the URL is in the 856 field of the original record. This does not work for all cases, however, like: http://www.loc.gov/catdir/toc/hol041/2003056462.html and does yet handle other sources of TOCs or other book information that might want to be imported into the biblio record.

#1
Notes from discussion in #code4lib:
http://www.loc.gov/catdir/beat/hnet.html
(ex: http://www.h-net.org/review/hrev-a0a1r8-aa)
Base URL : http://www.loc.gov/catdir/samples/*
Link in record sometimes is .html file that links to the PDF file.
(ex: http://www.loc.gov/catdir/samples/cam031/94044080.html)
http://roytennant.com/proto/856/
"856$3 values from public libraries in Canada" at http://paste.lisp.org/display/73246
Ideas:
* How about importing reviews?
* How about indexing PDFs or other HTMLs that are linked to in 856s?
* How about showing thumbnails or other surrogates of linked-to jpg, gif, png, etc. files?
* How about having an inline/popup viewer for some linked-to files? (e.g. PDFs?)
#2
edsu @#code4lib provided this link to a crawl of (millions of?) LOC records, extracting the 856s.
http://inkdroid.org/bzr/beat/marclinks.txt
#3
Changing issue subject and version.
It feels the most useful contribution would be to actually tag items as being available online or physically in a library using taxonomy, by checking for 856 tags with a second indicator of 0 to indicate online versions.