RLG DigiNews
BROWSE ISSUES
SEARCH
RLG
   
RLG DigiNews Banner
  April 15, 2002, Volume 6, Number 2
ISSN 1093-5371       

 


Table of Contents


Feature Article 1 
RLG DigiNews: Taking Stock at Five Years, by the Editorial Staff

          Link Analysis in RLG DigiNews, by Richard Entlich

Feature Article 2
Digital Copiers and Scribal Musings, by Fred Stielow

Conference Report

The Final Cedars Workshop: a report from Manchester, UK, by Michael Day

Highlighted Web Site

Open Archival Information System

FAQ

Where Are They Now? By Peter Botticelli and Richard Entlich

Calendar of Events


Announcements

feature article one

                    print this article print this article

RLG DigiNews: Taking Stock at Five Years
The Editorial Staff
preservation@cornell.edu

Introduction
Five years ago, RLG published the first issue of RLG DigiNews. A lot has changed since then—and a good bit has remained the same. We're using this anniversary issue as a case study to reflect on those changes. This feature article discusses key turning points for RLG DigiNews from the access and preservation perspectives. Our FAQ asks "where are they now" as it follows up on two projects that were announced in the first issue. In the June 2002 issue, we'll report on several more. The fate of these projects, like the other changes that the editorial staff of RLG DigiNews has witnessed, are revealing of both the opportunities and the obstacles that line the shores of a swiftly moving technological sea.

RLG DigiNews had its roots in an RLG electronic group-based document, "Diginotes," compiled by members of PRESERV as a way to keep pace with the rapidly developing field of digitization. In the two "issues" distributed via email to a special RLG discussion list, "Diginotes" contained announcements on, and citations to, "library imaging technology and applications." Though "Diginotes" ceased after two compilations, the need for timely information on the topic of digitization did not.

Responding to member requests for assistance and information, a new Web-based document (RLG DigiNews) was born in early 1997. RLG joined forces with the staff of the Cornell University Library Department of Preservation and Conservation to provide a "substantive, informative, and timely response to the expressed desire of preservation specialists for an easy-to-understand, broadly conceived information stream on selected worldwide efforts in the converging fields of preservation and digitization." The editors promised to capitalize on the "enhanced functionality of a Web-based publication, by providing hot links to featured documents and enhanced searching capability." The publication has matured considerably in the past five years, as reflected in the changing masthead (figure 1) and the list of publishing milestones presented below.

1997 RLG DigiNews masthead 1998 RLG DigiNews masthead
1999 RLG DigiNews masthead 2000-2001 RLG DigiNews masthead
2002 RLG DigiNews masthead


Figure 1: The masthead has evolved to reflect the changing look of the Web


Publishing Milestones

  • RLG DigiNews began as a quarterly newsletter, but became a bimonthly publication in its second year. The Council on Library and Information Resources provided initial support for the two extra issues each year.
  • One FAQ and at least one Highlighted Web Site (HWS) have appeared in every issue. In recognition of the increasing popularity of FAQs, we moved this feature beginning with the August 1999 issue from its less obvious location in the midst of news and announcements to a more prominent place after the HWS.
  • Technical Reviews by editorial staff appeared only in Volume 1. These were replaced by periodic Technical Features, for the most part written by external authors.
  • The first Conference Report appeared in the December 1999 issue, reflecting the increasing importance and regularity of key meetings on digital imaging and preservation.
  • Beginning in 2000, each issue included coverage of some aspect of digital preservation. Relevant articles and items were flagged by a new icon that incorporated the infinity symbol typically associated with preservation, e.g., denoting the use of permanent/durable paper.
  • The Editor's Interview, introduced in August 2000, has provided an opportunity for focused discussions with key people on current hot topics.
  • For the first issue, searching was limited to use of a Web browser's "Find" or "Search" function, but by the second issue, viewers could browse the tables of contents or use keyword searching. Author and title indexes were subsequently added.


Access and Use

The initial intended audience for RLG DigiNews included "managers of digital initiatives with a preservation component or rationale." Since then, the reader base has grown dramatically, with the number of hits more than tripling from 1997 to 2000, from just over 20,000 hits to over 70,000. RLG reports that this publication is one of its most popular electronic resources. Lars Aronsson's Telecom History Timeline mentions the founding of RLG DigiNews as a historic event in 1997. The publication meets the Americans with Disabilities Act requirements for accessibility.

Each issue attracts thousands of readers on five continents. Back issues have a long shelf life. Usage of many early issues has not diminished substantially over time, and in some cases has increased. The October 15, 1998 issue, for instance, had more hits in 2001 than it did in 1999. Some issues remain significantly more attractive to users than others. The two most popular issues of 1999 featured lead articles on digital imaging and preservation microfilm (February) and digitization costs (October), proof that both topics continue to spark interest.

A Google index search identified over 1,000 links from various Web sites to RLG DigiNews. The publication shows up frequently on resource pages of consultants and faculty for digital preservation, digital imaging, and library conservation. Library portals around the world, including those in Australia, Canada, China, Europe, Israel, New Zealand, South Africa, South America and the United Kingdom, link to the publication. Twenty-one features have been highlighted in Current Cites, which each month selects "only the best items to annotate" from the current literature in information technology in print and digital form. Preservation "safekeeping" arrangements for many RLG DigiNews articles have been made with Australia's PADI, a subject gateway to select digital preservation resources. (see below)

 

map showing world readership of RLG DigiNews

Figure 2: A map showing the world readership of RLG DigiNews. The majority of readers come from North America (74%) and Europe (19.5%). Click on the pin click here to view map showing world readership of RLG DigiNews to see a larger map.

Chart showing number of visitors coming from various organizations

Figure 3: This chart shows the number and percentage of visitors coming from various Web domains, such as .COM, .NET,.ORG and .EDU for a three month period.

graph tracking first time visitors

Figure 4: Tracking First Time Visitors for three months reveals a peak in readership at the point of publication.

Preservation

While the access statistics for RLG DigiNews are gratifying, we were also interested in determining the health of the journal itself, especially since so many of the back issues continue to receive high use. The five-year anniversary offered a convenient milestone for reviewing our plans for long-term access to the content. Cornell staff prepared a list of key preservation considerations as a basis for self-examination and for identifying potential risk factors for the publication. Using these as a guide, the editorial staff at Cornell and RLG sought to assess the publication's preservation readiness.

1. Organizational commitment

  • What is RLG's commitment to maintain and continue the publication?
  • What is the funding stream; for how long is it secure?
  • Does RLG have a preservation strategy/plan in place?
Robin Dale, Program Officer, Member Programs and Initiatives, Research Libraries Group: RLG intends to maintain and continue the publication for as long as it continues to be a valuable resource to the community. The funding stream is a line item within the yearly budget and is continued from year-to-year. It is anticipated that this stream will remain in place for as long as RLG continues the publication. Regarding a preservation strategy/plan, we have several procedures in place. I make regular backup copies of the material to at least two different media, each of which is stored in a different physical location. The copies are regularly refreshed. This is in addition to the technical infrastructure and back-up described in question 2 below and the third-party archival arrangements described in question 9. Finally, RLG DigiNews issues, along with other selected RLG publications, are part of a testbed digital archive project currently underway at RLG. With all of these strategies in place, we feel comfortable with the security of RLG DigiNews.

2. Technical infrastructure

  • Where do the bits reside?
  • What kind of hardware/software/server is used?
  • Is it backed up, 24/7 supported?
Robin Dale: RLG DigiNews is an integrated part of the RLG corporate web site content. With a 1.544 Mbs connection to our Genuity internet service provider, RLG's corporate web server, named Lyra, sits on a 1.5 Mbs LAN. Presently, the server is a SUN UltraSparc 2, running the Solaris 2.5.1 (Unix) operating system. This is a single-CPU rated at 400 Mhz speed, with 250 Mb internal memory and 20 Gb external disk storage. It runs 24 X 7 X 365, and is continuously monitored for availability. The system and data are fully backed up weekly, using Veritas NetBackup, involving a rotating four-cycle process and off-site tape storage. The server platform will soon be upgraded in connection with an overall corporate website make-over project, and will be run on a SUN 220R server, which is a dual-CPU rated at 440 Mhz each, with 2 Gb of internal memory and 40 Gb of external disk storage. The new Lyra corporate web server will be running the Sun Solaris 2.8 (Unix) operating system. Trained RLG and Stanford University technical staffs are on call at all hours to ensure fast response to any special system needs. Vendor support from SUN is designated as "Silver", meaning we have contracted for "within four hours" on-site response for 8 hours daily on weekdays, with 24-hour support at all times by telephone.

3. Data Fixity

  • What means are in place to secure the files and protect them from unauthorized change and use, data corruption, etc?
Robin Dale: The Lyra server is a bastion host, meaning it must be accessible to the
general Web-using public. Because of this, it is also locked down from an information security standpoint. Access to other servers is tightly controlled, as well as the capability to use the Lyra server as a "Trojan horse" to access other sites. All Lyra server changes are made by system administrators who are permitted differing degrees of capability under user identification and password protection. The system also runs PERL 5.6 and a Web Indexer. Other software in support of the RLG corporate web site on the Lyra server includes Digital Certificate Issuing software for credit card transactions, a Virtual Web Server for access to detailed customer accounting reports under customer id and password control, an anonymous FTP server and a POP Mail server. The system uses the Apache web server software. Upon installation, the Lyra server was configured with the RLG-customized "security hardening" kit, which includes software such as YASP, tripwire-like features, and monitoring software to ensure the security of the server environment. The server platform and all network connections enjoy the physical security of being located in Forysthe Hall on the Stanford University campus, having very secure physical access and operational integrity characteristics.

4. Format stability, reliability, and complexity

  • What formats are used? What versions?
  • Do they adhere to common, open standards?
  • Is the coding correct and the data validated (e.g., HTML validator, parser)?
  • Does the journal rely on experimental approaches (e.g., technology that may be very short-lived)?
  • Is format control exercised by the editorial staff (E.g., does the staff do the mark-up or do contributors, does the journal establish and maintain format requirements)?
  • How complex are the formats in terms of variations, computation, volume?
Cornell editorial staff: RLG DigiNews currently uses GIF and JPEG for images and the current version of HTML for text markup. Current RLG DigiNews manuscript submission policy expands the acceptable article submission formats from ASCII to include Microsoft Word and RTF (Rich Text Format).. Current versions are used. We adhere to open common standards. Mark-up by the editorial staff is consistent and follows established standards. The RLG staff adheres to established procedures for validating the content before posting each issue. The journal tracks trends in Web site design and management, but uses technology that is readily available to avoid inhibiting use or maintenance. The staff does the mark-up using format requirements that adhere to RLG requirements and good practice. The formats used are not very complex. The content is intended for easy use in international settings.

5. Authenticity and Provenance

  • What is the policy on correcting mistakes? Is the original version maintained or the changes noted?
The Editorial Policy for RLG DigiNews is as follows: "Upon discovery and notification to RLG, the error is corrected and a note is inserted into the text to explain the reason why the text was corrected, as well as the date of the correction."

6. Redundancy

  • Is the publication mirrored? If so, where and in how many places?
  • Is there a formal agreement for mirroring in the works?
Robin Dale: The publication is not currently mirrored though we are considering some possible arrangements. Discussions with specific institutions are in the preliminary phase, though implementation of any agreement probably won't take place until at least the end of 2002.


7. Metadata

  • Technical (are the technical approaches well documented, e.g. use of javascript, the guts of the technical application, dependency on external programs and scripts, documentation on changes):
Cornell editorial staff: Yes, the source code includes scripts and these are well-documented. We document changes in policy and practice.
  • Navigation (nature and extent of descriptive and structural metadata, e.g., SGML, Table of contents, consistency of approach, etc.):
RLG DigiNews uses Dublin Core metadata elements and keywords for descriptive metadata. Each issue contains an embedded table of contents to help users navigate through the content. The mark-up is consistent from issue to issue, and changes in the structure and presentation of the content are noted at the time they are implemented.
  • Resource discovery: How can people find the journal?
  • What search engines and abstracting services pick them up?
  • What practices promote/inhibit resource recovery (e.g., use of metatags)?
  • Does the journal provide indexing/searching features itself?
Each issue of the journal is announced on major professional electronic mailing lists. RLG DigiNews is actively promoted by RLG and features prominently on the RLG Web site. As noted earlier RLG DigiNews is well-represented on institutional, organizational, and personal web sites devoted to digital imaging and preservation information. RLG DigiNews uses META tags for Dublin Core data elements, for keywords, and for high-level content elements. Consistent and correct formatting also promotes resource discovery. The RLG DigiNews site provides Author and Title indices, as well as links to back issues and basic full-text search capability. RLG also permits crawling of its site by major search engines, including Google, to facilitate resource discovery by users.

8. External dependencies

  • Does the publication use consistent/persistent link naming?
  • What's the nature and extent of dead links?
  • Are links really dead or just moved?
  • What policy does the publication follow, if any, when including external links, e.g., linking priorities, extent of monitoring and updating?
Cornell editorial staff: RLG DigiNews does not use any of the persistent link approaches. Each issue is a single document from which individual articles can be printed, so each article does not have a unique identifier. The naming of issues is consistent. RLG DigiNews incorporates many links into every issue, including the Highlighted Web Site (HWS). All of the back issues of RLG DigiNews are available on the RLG site and the staff uses link analyzers to monitor the site, though past links are not corrected if the sites are moved or removed. For instance, of the 34 HWS in the first five volumes of RLG DigiNews, 27 are still active and seven have moved. Of those seven, two have merged into one site, one has linked to a new site through a redirect, and one link has become corrupted.

The sidebar by Richard Entlich discusses the issue of link integrity.

9. External, third party archives

  • What external archives cover the journal and how complete is the coverage; to what extent could the journal be recreated from these archives?

Robin Dale: The Internet Archive can capture pages and sites that might not be saved otherwise and may be a piece of a retention program but should not be viewed as a substitute for a digital preservation program for RLG DigiNews. The Internet Archive holds copies of all issues of RLG DigiNews with the exception of the last, Volume 6, Number 1, 15 February 2002. This is because the most recent crawl of the site was on 7 February 2002. Since crawls of the site tend to take place every 8 to 9 weeks or sometimes longer, I'd imagine that this "missing issue" will be covered soon. Other than that, the journal could be recreated.

Cornell Staff Note: The Internet Archive recently launched the "Wayback Machine," an online tool to search the archives's vast holdings. When we used the Wayback Machine to search for RLG DigiNews we discovered some interesting results, as indicated by the table below. Note that the first copy listed for the April 1997 issue was from May of that year, while the first copy obtained for the RLG DigiNews home page was from December 1998, a year-and-a-half after the journal was first published.

  First Capture
RLG Diginews home page Dec 2, 1998
April 1997 issue May 3, 1997
April 1998 issue Aug 15, 2000
April 1999 issue Dec 9, 2000
April 2000 issue Aug 17, 2000
April 2001 issue Apr 18, 2001


Table 1. The Wayback Machine capture dates for RLG DigiNews issues


Besides the Internet Archive, the National Library of Australia's Preserving Access to Digital Information (PADI) initiative has established the Safekept program. Being selected for the Safekept program provides an opportunity for organizations to establish, review and/or enhance their Web site preservation programs. The SafeKept program identifies a nine-step program for insuring the preservation of the selected Web sites, to which contributing organizations must adhere. The Safekept program has many, but not all issues of RLG DigiNews marked within its databases. This "minimalist" approach was borne out of an agreement with PADI that RLG would provide for/maintain all of RLG DigiNews in many ways and formats and therefore it wasn't absolutely necessary for NLA to do the same.

10. Look and Feel

  • Is the old design and any functionality of early issues maintained (e.g., interface changes)?
Cornell editorial staff: Yes. The back issues include the masthead that was in use when the issue was published. Users can view changes in the journal's presentation and format. Earlier we noted the various changes and introductions to the journal.

11. Virtual content

  • How much of the content is virtual (e.g., created at the point of access, or generated on-the-fly)? How well can this virtual content be maintained?
Cornell editorial staff: The content of RLG DigiNews is captured in static HTML documents. Recent issues contain a script that monitors use of the site, but there is no virtual content to be maintained.

12. Ability to retain extended (added-value) services of the journal

  • Is preserving its function as a ready reference database or other information services supported?
Robin Dale: The functionality of RLG DigiNews is easy to maintain. The search capability is upgraded periodically to continue to provide basic searching using current technology. Features that provide access to the content are based primarily upon links that support navigation between issues and identifying topics and authors of interest.

What do our readers think?
In our February 2002 issue, we included a readers' survey to help guide our planning for the next five years of RLG DigiNews. Responses came from 233 readers, including 66 written comments. Overall, we received an abundance of valuable information, for which we thank everyone who offered their time and thoughtful feedback.

RLG DigiNews is the single most valuable source of information that we have. Its current focus is right on target. a quote by nancy kushigian


We were pleased to discover that 87 percent of respondents claimed that the content of RLG DigiNews is "just about right" in its usual level of technical detail. We found it interesting that two-thirds of respondents favor digital imaging features over other parts of the journal. However, respondents were split on the type of content they prefer; 39 percent favor policy recommendations, while 28 percent prefer information on technical standards and best practices, and 27 percent favor equipment reviews. Preferences may be changing, as we found that the number of new readers (those who have discovered the journal within the last two years) were slightly more numerous than responses from long-time readers.

We were also interested to learn that 60 percent of respondents discovered RLG DigiNews through listservs, while only 23 percent learned about it from colleagues and just 10 percent through other publications and Web sites. This is no doubt due to posting announcements of the release of new issues on 20 listservs worldwide.

In sum, as we begin our sixth year of publication, the state of RLG DigiNews appears healthy. Although we can't forecast technological advances and readers' interests for 2008, we can expect at least as many changes as the past five years have witnessed. The editorial staff will maintain flexibility, will periodically take the pulse of our readers, and will look forward to writing the 10th anniversary article!

Link Analysis in RLG DigiNews
Richard Entlich

Once an issue of RLG DigiNews goes "to press" its content is considered fixed. Subsequent changes are only made to correct significant errors, and those are always documented within the issue. Although such a policy guarantees the editorial integrity of the publication, it also means that, over time, the links to external Web sites will gradually degrade.

How bad a problem is "link rot?" Pretty bad, as any regular Web surfer will quickly tell you. Not only does Web site content change and move around a lot, but domain names lapse and are reassigned on a regular basis.


We conducted an analysis of links to external Web sites from the first five years of RLG DigiNews issues. The results are shown below. When all the issues for each year of publication are averaged together, there is a nearly linear increase in the annual percentage of bad links, from about 10% in our most year of publication (2001) up to about 40% in our first year (1997).


A graph showing external link status from 1997 to 2001

There were a total of 1236 unique external links in the 28 issues of RLG DigiNews published prior to this issue. How did we determine the validity of so many links? For the most part, we did what most people would do when faced with such a large task: we took advantage of automation and employed a software link checker. Writing a link checker must either be a very popular assignment in computer science classes or else part of a rite of passage into the world of open source computing, because there are dozens of link checkers available. Many are freeware or shareware or free online services. A good list of available products and services (with a few of its own bad links) is available at http://www.elsop.com/wrc/comp_ls.htm.


As one might expect with such an abundance of products on the market, there are some substantial differences in features and performance. We tested only a few products, but found some considerably more flexible and powerful than others. The differences have an impact not only on ease of use and reporting capability, but on the validity of the results.


The operational basis for link checkers is fairly straightforward. Much like a browser, a link checker connects to Web sites using http (hypertext transfer protocol) and gathers data from the server. However, instead of rendering the data into a viewable Web page, it extracts status information that is part of every http exchange, but not always displayed to the end user. Most everyone has seen a message reading "404 Not Found" when trying to access a URL that's no longer available. The 404 is one of dozens of http status codes defined in the protocol. A "successful" http transaction usually produces a status code such as 200 (meaning "OK") or perhaps 301 or 302 (for a redirect to a new location). Status codes for properly completed http transactions are not normally displayed to users.

How accurate a picture of a Web site's link status does a simple compilation of http status codes provide? That depends a lot on one's definition of a "good"' or "bad" link. For our purposes, we defined a good link as one that leads to (roughly) the content intended by the original reference, either directly or through an automated redirection or refresh. We even accepted situations where one additional manual click would be required to find the original content, as long as the path to that material was fairly evident on the first page brought up.

Unfortunately, even given this fairly liberal definition of "good," most link checkers relying on simple status codes will significantly overestimate the number of good links on a site. There are many situations in which a site that appears good to a link checker may in fact fall short of our definition of good.
  • Status code does not equal content. Just because a URL is still good doesn't mean the content hasn't changed dramatically. The domain name registration may have lapsed and been purchased by another entity. There may be no connection whatsoever with the original content or content provider.
  • Putting out mixed messages. While displaying html content that clearly says "bad," the status code sent out via http says "good." This happens most frequently with sites that have substituted a custom error message for missing pages and failed to associate the proper status code with it. It's obvious on manual examination that these pages are bad, but without special effort, a link checker won't detect it. Depending on how they obtain their status codes, different link checkers may vary in their reporting on such sites.
  • Moved or lost in transit? There are various ways for sites to indicate that content has moved to a new location. One method (called a meta-refresh) is usually reported as good as long as the page with the "we've moved" message loaded correctly. There are also different kinds of redirects. Redirects produce their own status codes, so at least one is alerted that something may be amiss, but a link checker may assume that as long as the redirect succeeds, the page is good. Frequently this is not the case. Also, since assignment of redirect codes requires manual intervention by site operators, it is prone to misapplication.
The figures in the graph above were not taken at face value from a link checker. We did some manual checking of the results in order to verify their accuracy. We manually checked all reported codes for permanent or temporary redirections. We found that 25% of the permanent redirections and nearly 30% of the temporary redirections did not pass our definition of good. We also checked certain other reported server problems. We adjusted the results in the graph to reflect these findings.

We did not manually check all the links reported as good by the link checker, but we did test a subset, just to get an idea of the error rate. Of the 68 links reported good in the three issues of RLG DigiNews for 1997, 14 (a little over 20%) did not pass muster. This discrepancy is not accounted for in the graph above, so, according to our definitions, the percentages of bad links in RLG DigiNews it shows should be higher.

Despite these limitations, however, link checkers can form an essential part of a Web maintenance program, whether for your own site, or an external site that contains resources important to you. But it is important to understand that link check reports cannot necessarily be taken at face value. More expensive link checkers provide configuration and customization options that can help produce more accurate status assessments. Given the abundance of available link checkers, our advice is to take advantage of freeware as well as trial versions of shareware and commercial products to find the product that meets your needs.

—Peter Botticelli, Robin Dale, Carla DeMello, Barbara Berger Eden, Richard Entlich, Anne R. Kenney, and Nancy McGovern

feature article two

                     print this article print this article

Digital Copiers and Scribal Musings

Fred Stielow
Walter P. Reuther Library
ai6050@wayne.edu


A funny thing happened while shopping for photocopiers for the Walter P. Reuther Library of Wayne State University. They are apparently becoming obsolete—going the way of the phonograph record, punch card, and fountain pen. The industry is quietly converging on digital scanning and the microchip. This new technology goes beyond a simple replacement. It has significant implications for the future of libraries and archives along with interesting echoes from their past.


Historical Framework
Libraries and archives have long reflected the information technology of their times. Ancient librarians were experts in reprography. Their duties centered on the recording and recopying of information, including specialized skills for sealing Sumerian cuneiform and the manufacture of secret inks for Egyptian papyrus. In the Dark Ages, the focus was on parchment codices and copying cycles to preserve the knowledge of the ancients. This was the Opus Deum—or God's work—that defined the scribal librarian.

A scribe in his study. Valerius Ma imus, "Faits et dits mémorables." Bruges, 1479

Figure 1: A scribe in his study. Valerius Ma imus, "Faits et dits mémorables." Bruges, 1479

Gutenberg's invention altered the job. Archives branched off into a parallel institution for hand-written materials. Librarians largely abandoned the hands-on skill of the copyist. They lost control of the means of production. Arguably the world's first capitalists, publishers appeared on the scene to decide what would appear in print and, incidentally, they redefined librarianship. Publishers made reading portable and developed the crucial trappings for the modern book—e.g., the title page, tables of contents, indexes. The new industry also helped spawn the concept of authorship and the concomitant need to protect the right to profit through copyright (1).


Library involvement with reproduction techniques returned to a degree at the turn of the twentieth century. A new profession had just appeared in response to the rise of mass culture, which followed the introduction of wood-pulp paper and advances in the printing press. The first generation of college-trained librarians benefited from printing advances for their professional tools, especially the Library of Congress's (LC) introduction of inexpensive catalog card sets. The typewriter with supplementary mimeographs and carbon copies also entered the consciousness of information specialists. These technologies encouraged the development of new descriptive tools, especially the invention of archival finding aids at LC's Manuscript Division (2).

Millionaire Maker ad for an Edison Rotary Mimeograph

Figure 2: Taken from the Atlantic Monthly Advertiser in The North American Review. / Volume 82, Issue 494 from Cornell's Making of America Collection


Reprography blossomed anew in the 1960s. The advent of photocopiers brought unexpected change. "Xeroxing" became a new verb in short order. We could do away with messy mimeographs and carbons. The ability to make facsimiles helped repositories to deter theft and mutilation, as well as assisting with preservation. Photocopiers even impacted the law. They joined the cassette recorder as the technical impetus for the 1976 Copyright Act: Title 17 US Code. The largely unfettered "right of sale" was revised to protect authors, but still give libraries and archives reasonable abilities to make copies. Section 107 established the principle of "Fair Use." Special dispensation was granted in Section 121 to reproduce for the blind or others with disabilities. Most importantly, Section 108: Reproductions by Libraries and Archives gave the institutions the authority to make preservation copies and allow users to make their own copies on their premises. In exchange, the law called for clear procedures, postings, and rights management policies (3).


Technology and repository interests moved forward. The first generation of specialized "wet" papers and awkward processing matured to plain paper copiers. Machines added production capacity and ever more bells and whistles. Repositories developed the copy center—a new business service and revenue stream. The new equipment brought many institutions face-to-face for the first time with the intricacies of repair contracts and licensing arrangements. In the 1980s, an onslaught of facsimile machines pushed toward the provision of electronic means of distribution. Although somewhat constricted by the danger of viruses and copyright implications, many libraries also responded to the delivery potentials of downloading to computer disks and in the 1990s to e-mailing through the Internet.


Coming Impacts

The most recent advance in reprography is the move from analog photographic process to digital scanning. This is a robust and differentiated marketplace with competition among Bell & Howell, Canon, Ricoh, and 3-M. The Reuther Library is considering three types of equipment. One is a volume machine replete with a hopper feeder, collation bins, and automatic stapler. Secondly, our reference room will trade in its microfilm reader/printer for a significantly enhanced product. Third, the conservation lab will get a "Cadillac." The new machine has an oversize scanning area and can cradle delicate materials. Its microprocessor will correct for curvatures in the page and remove the image of a finger holding down the page. The technology offers other equally intriguing prospects and implications:

  • Although the first copy is slower than with analog machines, warm-up time is shorter.
  • The "scan once, print many" feature reduces the physical scanning for large jobs and allows for efficient storage for recurring jobs.
  • Storage is in the computer's temporary RAM (random access memory) as standard TIFF files, which offer crisp and exact output (utilizing non-lossy compression).
  • Identifying marks and time stamps can be automatically embedded in the image. You may also be able to embed electronic watermarks or signatures.
  • Files are available for faxing or e-mailing.
  • Machines can be tied into a local area network for remote control and collective action; e.g., several copiers could be tied together for large jobs.
  • The printer is separate and can be networked into a LAN as a remote station for a variety of jobs.
  • Use can be monitored and for the first time we can actually capture or identify the images being scanned.
  • Images can be exported for fine-tuning by imaging software, converted to text by optical character recognition (OCR) systems, saved as PDF files to preserve the original layout, or to JPEGs for posting on the Web.
Policies, Law, and Copy Centers
Like photocopiers before, digital copiers usher in concerns. The first centers on the law. Legal may be more significant than economic factors and have become more prominent in response to the Web. The Digital Millennium Copyright Act (DMCA) of 1998 brought implied challenges to Fair Use and Section 117 exclusions for computer programs. I predict that digital copiers will heighten the tension. They will likely stimulate direct attacks against archival and library exceptions under Section 108. And, the additional kinds of management tools available with the equipment may lead to calls for heightened policing and screening of users (4).


Copying policies thus need to be dusted off and reconsidered. On one hand, repositories must insure that the increased ability to monitor does not inadvertently jeopardize users' rights to privacy. On the other, a proactive stance on copyright protection is in order. Even if the repository is only concerned with replacing existing photocopiers, it must be on guard. Its policies should clearly proclaim:
  • Scanning files are temporary, a one-shot application to reproduce an in-kind copy on paper.
  • Patron copy stations are limited to printed output.
  • In-house duplication services, especially for items in copyright, are similarly print only.
I doubt that we will be able to stop at one-time duplication. Digitization and "scan once, print many" features are simply too powerful to curtail. The service bureau would lose the economies of storing files for subsequent uses; e.g., for the production of course packets. Direct user access to scanned files offers great possibilities, but would need to be carefully handled for copyright clearance. Moreover, successful legal challenges to Fair Use exemptions could even force our institutions to use the monitoring capacities of digital copiers to assist with the collection of royalties.

Management

Other administration topics come to the table. Management must look at the bottom line. At present, digital equipment is more expensive than analog. But we can expect that prices will decline. Per-item charges appear to be lower, and the new machines are touted as more reliable. Although I have become fond of our copier repair people, the thought of not seeing them so frequently is interesting. You may want to pull out your spreadsheets, monitor prices, and consider delaying or phasing in purchases.


Digital copiers offer interesting synergies. For example, the duplication of newspaper clippings and the like for vertical files automatically becomes a scanning exercise. Instead of a dedicated project, local treasures of archival materials are copied and stored as byproducts of duplication orders.


As the inevitable cycle of replacing old copiers proceeds, every repository gains the technology to build its own digital archives. This change brings different copyright questions. Do you want to protect the institution's ownership, especially when publishing to the Web? You could choose to embed an ownership mark in the image or a digital watermark (Note: digital signatures or watermarks in the original TIFF file may be lost or degraded in transit to a JPEG or PDF on the Web). Photographic and pictorial images add to the mix. While one could mount detailed images for downloading, most institutions may be comfortable with JPEG thumbnails. These provide users with serviceable access to the image, but not high-quality reproductions.


Given the lure of the Web, management will need to consider allied formatting questions. One can argue that a letter from Thomas Jefferson demands a faithful reproduction. Adobe's PDF format presents a different option for duplicating a page image and helping to insure the authenticity. Yet, many researchers are only looking for online searching without regard to original layout. Are there justifiable reasons for maintaining an in-house genealogy collection, archival finding aids, or guides to area resources in print view? HTML or XML documents will likely suffice. You may also want to ask:
  • How much do you put on the Web?
  • Is the aim complete public access or to tease people to research in-house?
  • Are there financial prospects to be examined and are you afraid to "give away the store?"
  • Are there privacy, legal, or political factors that mitigate against mounting certain materials?
Preservation processing with digital copiers leaves us on murky ground. The purist understands that reformatting is for access and not true archival preservation. Results are without artifactual values such as by-products of aging, aesthetics, physical features (like bindings) along with the autographs or provenance. Digital images are also not directly readable by humans and will not last as long as their paper alternatives. Still, digital copiers have preservation management implications. They insure that some version will persist and reduce wear and tear on the original documents.

Digital preservation schemes for published materials are blossoming. The U.S. Government is pushing for digital preservation through the Portable Document Delivery Format and Federal Information Processing Standard. Discussions rage on metadata standards and the implications of XML and RDF. OCLC and RLG are looking at long-range retention through NASA's OAIS (Open Archival Information System) reference model. The call is sounded for elite "trusted" repositories (5).


What will happen to the preceding scenarios with the advent of digital copiers? Will digital copiers democratize preservation and disburse it among thousands of local initiatives, much like the haphazard legacy of the scribal era? Will the process of saving the documentary heritage become the province of the copy center? Will users find the documentary heritage through Web software search engines, which do not pay attention to the Dublin Core or other specialized preservation tagging?


Concluding Thoughts
In sum, the onset of digital copiers opens a number of questions. This quiet invasion surely demands that all repositories examine their preservation and copying policies. History seems to offer some helpful perspectives from the introduction of the photocopier. But, I am most struck by the potential for digital archives. A new technology in the Web era may even restore librarianship with a scribal mission, which it lost in the print revolution.

Footnotes
(1)
Elizabeth Eisenstein, The Printing Press as an Agent of Change, 2 vol. (New York: Cambridge, 1979) remains the key volume on the Guttenberg revolution e.g., See: "How Revolutionary was the Print Revolution, American Historical Review, 107 (February 2002): 84-128. (back)
(2) The mass culture and nationalization thesis was put forth in my article, "Censorship During the Initial Phase of Library Professionalization," Journal of Library History, 18 (1983): 37-54. (back)
(3) Copyright: Title 17 US Code (back)
(4) Library of Congress Copyright Office, Study Required by Section 104 of the Digital Millennium Copyright Act, 2001. (back)
(5) JSTOR; Making of America, http://moa.umdl.umich.edu/ & http://moa.cit.cornell.edu/moa; OAIS; Kizer Walker, "Integrating a Free Digital Resource: The Status of Making of America in Academic Library Collections, RLG DigiNews, Feb 15, 2002; Robin Dale & Meg Bellinger, "Collaboration between RLG and OCLC With Digital Archiving Initiatives," RLG DigiNews, Dec. 15, 2001; OAIS. (back)

feature article two

                     print this article print this article

The Final Cedars Workshop: a report from Manchester, UK

Michael Day
UKOLN
University of Bath
m.day@ukoln.ac.uk


On 25-26 February 2002, 70 invited delegates met at UMIST in Manchester for the final Cedars Workshop. Digital preservation has been a high strategic priority for the Consortium of University Research Libraries (CURL), as evidenced by the Cedars (CURL Exemplars in Digital Archiving) project, which was a collaboration between three CURL institutions, the universities of Cambridge, Leeds and Oxford. The project was due to end in March 2002 after four years. The outcomes of the project were therefore of particular importance to CURL libraries and most of them were represented at the workshop. Publishers have also been regarded as key stakeholders in digital preservation and constituted another group of invitees, as well as playing an active role in the programme for the workshop. Other key decision-makers were also invited. The main purposes of the workshop were:

  • to disseminate information about the Cedars project to the main beneficiaries of the project,
  • to put that work into a wider context, recognising some of the key developments in digital preservation which have developed in parallel with the Cedars project,
  • and last but not least, to look forward to what should happen after the end of the project.

The workshop was structured around five main areas of Cedars activity, metadata; intellectual property rights (IPR); preservation strategies; collection management; and the distributed archiving prototype. Written guides to each of these have been prepared and will be made available from the Cedars Web-site. The guides to metadata, IPR, and collection management are also available in hard copy. Because there is an overlapping interest between publishers and libraries in metadata and IPR, the sessions on these were held on the first day and presentations on the work that Cedars had done combined with a look at the publishers' perspective and the wider context made a stimulating first day. The second day concentrated on the work of Cedars with presentations on the wider perspective. Sessions on preservation strategies, collection management, and distributed archiving led to lively and informed discussion.

The opening session on metadata introduced preservation metadata and linked this to some key concepts defined in the Open Archival Information System (OAIS) reference model (1). Issues raised included the comment in the presentation by Geeti Granger (John Wiley & Sons Ltd.) that the development of metadata systems for Wiley's digital archive had involved making considerable changes to the working practices of publishers and their suppliers. Another problem was that the exact costs of generating and maintaining metadata are unknown. Granger noted that even if these costs could be estimated, it would be difficult to calculate the exact return on investment. Later discussions included the comment that "submission metadata" required from publishers should, where possible, be able to be easily mapped from the information that publishers already hold. It was also observed that it is probably time to stop developing detailed preservation metadata specifications and to begin evaluating their use in real implementations.


Issues raised in the session on IPR included the comment that contracts would increasingly provide the main framework of discourse rather than legislation. Also that content was sometimes tied up with "'look-and-feel"' and that the authenticity of the content would sometimes depend on this being preserved. The presentation by Andrew Charlesworth (University of Hull Law School) stressed that copyright was traditionally seen as a social bargain in that the state provided particular powers to rights holders with the aim that society as a whole could benefit. One of these benefits is that copyright works will eventually pass into the public domain. In the digital world, this aspect of the social bargain was in danger of being neglected by some rights holders. In addition, the typical time-span of copyright lasting until 70 years after the death of authors is likely to mean that digital materials are no longer accessible by this time. Despite this, he also argued that many academic publishers, librarians and others involved in digital preservation were becoming increasingly aware of the benefits of collaboration.


The technical strategies session included an account of work in the CAMiLEON (Creative Archiving at Michigan and Leeds: Emulating the Old on the New) project (2) on software longevity and an introduction to the concept of "migration on demand." Margaret Hedstrom (University of Michigan) argued that different preservation strategies would need to work together. She said that strategies would need to take account of technical complexity, e.g. the potential loss of the significant properties of a resource through migration, and should be scalable. There was also a need for more awareness of collection management issues, the implications of rights management decisions on preservation strategies and more information on costs and effective economic models.

The following session concerned collection management. Amongst the many issues raised was the need for increased staff awareness of digital preservation issues. As in other sessions, it was noted that it was almost impossible to get precise information on costs. In his presentation, Neil Beagrie (JISC Programme Director for Digital Preservation and Secretary of the Digital Preservation Coalition) noted that we would have to adapt to high volumes of information which would emphasise the importance of distributed solutions and the automation of ingest and metadata capture processes.


The last workshop session on "distributed archiving" included a description of the workings of the Cedars prototype. Lynne Brindley (The British Library) gave the final presentation. She reviewed comments that she had made at the Cedars Preservation 2000 conference (3) and remarked on progress since then. She argued that we need to be more precise as to what roles and responsibilities should be undertaken by particular types of institution and that we should be more creative in thinking about how to fund digital preservation. Brindley also outlined the role of the newly formed Digital Preservation Coalition and urged all to support this initiative.


A more detailed summary of the Cedars Workshop will be made available from the Cedars project Web-site.

Footnotes
(1) Consultative Committee for Space Data Systems, Reference model for an Open Archival Information System (OAIS), CCSDS 650.0-R-2. Red Book, Issue 2, July 2001. (back)
(2) CAMiLEON project (back)
(3) Robin Dale and Neil Beagrie, Digital preservation conference: report from York, UK, RLG DigiNews, 4 (6), 15 December 2000. (back)

highlighted web site

Highlighted Web Site

Open Archival Information System (OAIS)

This page is the official source for information on the development of the OAIS, an emerging international standard for archiving digital files representing all subject types and formats. Links are provided to published documents and presentations on the OAIS project, including the key document produced thus far, the “Reference Model for an Open Archival Information System,” which is currently under review as a draft standard by ISO. In addition, the site links to the recently released Producer-Archive Interface Model, which defines a methodology for ingesting documents into an OAIS-compliant repository.

The site also links to the home page for the Consultative Committee for Space Data Systems Panel 2, which has been charged with developing the OAIS standard for use by working digital archives. One follow-up effort to watch is work on XML standards for data archiving by the Object Oriented Data Technology group at the NASA Jet Propulsion Laboratory.

Given the widespread acceptance of OAIS as a framework for constructing digital repositories, these pages are an essential resource for libraries and archives as they make plans for preserving digital information assets.



faq

                     print this article print this FAQ

FAQ

We often read about new projects and programs in RLG DigiNews, but what about past efforts? What results have been produced in the five years since RLG DigiNews began publishing?

The speed of innovation and the proliferation of digital library projects in recent years make it difficult to track the progress of new efforts once we report on them. So, for this issue, we decided to revisit a number of projects we announced in our first issue, in April 1997. While these are not necessarily representative of the digital library field as a whole, we found that the experiences described below offer valuable lessons for institutions contemplating new projects today.

National Agriculture Library: USDA Digital Publications Preservation Program

The April 1997 issue of RLG DigiNews, announced a collaborative effort to preserve digital publications created by the United States Department of Agriculture (USDA). A meeting in March of that year resulted in an action plan, later published as the "Framework for the Preservation of and Permanent Public Access to USDA Digital Publications" that led to the creation of the USDA Digital Publications Preservation Program, hosted by the U.S. National Agricultural Library (NAL).

In the past five years, the USDA Digital Publications Preservation Steering Committee, led by the Director of the National Agricultural Library, has begun to tackle a wide range of issues related to digital preservation. One planning group conducted an inventory of USDA Economic Research Service digital publications, in which 7,000 digital objects were identified. Analysis of this inventory will inform the Steering Committee about the current status of USDA digital publications, and the costs of conducting a Department-wide inventory. Another planning group hosted a conference on metadata issues, including preservation, for USDA employees responsible for creating, managing, making accessible, and preserving USDA digital publications.

A third planning group developed guidelines for USDA agencies that create digital publications, including a metadata template developed by NAL. These guidelines are designed to inform agencies on how to produce digital publications in a way that will best serve access and preservation.

In June 2001 the publication guidelines and metadata template were sent to the USDA Office of the Chief Information Officer as recommendations for policy and best practice. NAL plans to place the metadata template online for USDA agencies' use. Once the guidelines are adopted as formal USDA regulations, the Steering Committee will devise specific policies for managing digital publications across their life cycle.

Currently, the Steering Committee is evaluating the draft ISO standard for an Open Archival Information System (OAIS) and its potential applications for USDA digital publications and NAL's own digital library efforts. In any case, the USDA Digital Publications Preservation Steering Committee sees its ongoing educational efforts as key to building consensus and obtaining sufficient funding to ensure the long-term preservation of the USDA's growing number of digital publications. As concerns over creation, management, and persistence of digital publications have grown, NAL Preservation Officer Evelyn Frangakis notes that her program has been regularly receiving calls for advice from creators of digital publications across the USDA. Additional information about the USDA program may be obtained by contacting the NAL Preservation Office.

Digital Library News

In August 1997, RLG DigiNews announced a new publication called Digital Library News (DLN), sponsored by the IEEE Computer Society Task Force on Digital Libraries and the Advances in Digital Libraries conference. It was intended to offer a "periodic electronic snapshot for the Digital Libraries field" and to "gather notices of ongoing events and new developments" in digital libraries. DLN eventually published three issues, dated June/July 1997, January 1998 and July 1998. Then it fell silent.

What happened to Digital Library News? As is often the case, the Web site for DLN provides no clues. It contains an archive of the three published issues, a copy of the original publication announcement and (clearly outdated) information on how to subscribe to email distribution of DLN. A little slide show about the IEEE task force indicates that by November 1997, over 1,500 people had subscribed to DLN. The final issue includes a solicitation for more contributions and no hint that it might be the publication's swan song.

We contacted DLN's former editor, Susan Feldman, for some insight into its mysterious disappearance. As it turns out, the circumstances of DLN's shutdown were fairly mundane: a task taken on largely as a volunteer (IEEE provided expenses only, no salary) proved too time-consuming, and no one could be found to take over the editing.

That might be the end of the story, but it raises more questions than it answers. How typical is this kind of online publishing cessation? What happens to Web journals after they cease publishing? Which ones merit special attention for preservation? How much can be learned simply by visiting Web sites for the answers?

To put the demise of DLN into some perspective, we conducted a small study on the status of Web-based journals that began publication in 1997. We based our investigation on the 1997 Directory of Electronic Journals, Newsletters and Academic Discussion List, published by the Association of Research Libraries (ARL).

The 1997 ARL directory included about 3,750 electronic publications, so we narrowed the field to about 65 titles, including only those that were launched in 1997 and that were identified by ARL as journals (as opposed to 'zines or newsletters). Here's a summary of what we found:

Year No. of 65 titles still publishing at end of period % of titles still publishing at end of period
1997
65
100.0
1998
56*
86.2
1999
53
81.6
2000
46
70.8
2001
40
61.5
2002
(through early April)
22
33.8

* The number missing by 1998 includes six titles whose whereabouts could not be found and which are presumed to have had a brief publishing history.


Assuming most publications that produced content in 2001 are still functioning, more than half of the publications launched in 1997 appear to be still viable. Nevertheless, DLN has plenty of company. Of the titles we reviewed, a dozen either didn't survive in any form, or appear to have stopped publishing by 1999.

Determining at what point an online publication has ceased to be a going concern can be tricky. In most cases, all one can do is try to judge from the length of the hiatus and other signs of neglect. As with DLN, it is rare for a Web publication to openly acknowledge that it is no longer operating. There are a few exceptions. The Annihilation Fountain, which was described as "a journal of culture on the edge" published 12 issues and then wrote its own edgy obituary.

Quietly online, after an enjoyable run of 3 years, The Annihilation Fountain passed away. Life got in the way. No longer be able to find the time and not wanting TAF to whither (any more than it already has), TAF has opted for euthanasia.


Others are less willing to throw in the towel. The Journal of Credibility Assessment and Witness Psychology, a "peer-reviewed archival scientific journal focused on the scientific study of credibility assessment and witness psychology" has published three issues, the most recent in 1999. A note on its Web site says "JCAAWP is still in business. We have been suffering from a lack of publishable articles. Your submissions would be welcomed." The site was last updated in December 2000.

Even more unusual than a publication that acknowledges its death is one that has written a last will and testament. Consider Videre: A Journal of Computer Vision Research, "a refereed, archival internet journal of computer vision" which posted this note on its site:

Videre has ceased publication as of the end of Volume 1. The journal's content will be archived here on the MIT Press server for the forseeable future. If we decide that we cannot continue hosting this material, we will ensure its continued online availability through partnerships with academic libraries.


One reason publicly acknowledging death might be important is to provide a signal that help is needed to preserve the publication. As indicated earlier, we were unable to find any archive for six of the 65 titles. In a few cases the Internet Archive captured fragments, but typically not whole publication runs.

Does it matter if the output of these short-lived electronic publications disappears? Should Web publications be given more attention than print publications of similar stature?

Such judgments are for collection development specialists and cultural historians to make. We will note, however, that a search of OCLC's WorldCat shows that five colleges and universities thought enough of DLN to catalog it. We don't know whether they have learned that DLN stopped publishing four years ago or would be concerned if its archive disappears from its server at Rutgers.

Clearly, more analysis of electronic publishing patterns and the factors that lead to loss of content is needed. As part of Project Prism, Cornell is studying these and related issues in a Digital Information Longevity Study.

—PKB, RGE

calendar of events

Calendar of Events

Multimedia Archive Preservation: Practical Workshop
May 22-24, 2002
London, UK

This workshop will incorporate the combined experience of ten major European broadcast archives in developing a solution to digital preservation.

Off the Wall and Online: Providing Web Access to Cultural Collections
May 30-31, 2002
Lexington,MA

The Northeast Document Conservation Center (NEDCC) is sponsoring this conference, that will explore the powerful potential of digital technology to transform education, outreach, and marketing activities in museums and other cultural institutions of all sizes. The faculty will include museum professionals, technical experts, educators, and administrators, who will discuss new digital access possibilities as well as the complexities of producing digital products.

The Fifth International Symposium on Electronic Theses and Dissertations (ETD)
May 30- June 1, 2002
Provo, Utah
The theme this year is: Pioneering on the Electronic Frontier: E-theses and Intellectual
Solidarity. Topics include: ETD costs: direct cost versus hidden costs, digital libraries and ETD access issues, and multimedia content in ETD.

Digitisation Summer School
June 30th - July 5th 2002
Glasgow, Scotland

Participants in the course will examine the advantages of developing digital collections of heritage materials, and will investigate issues involved in creating and managing access to such collections. The lectures will be supplemented by seminars and practical exercises.

Fourth Annual UCLA/Getty Museums, Libraries and Archives: Summer Institute for Knowledge Sharing
July 29 - August 1, 2002
Los Angeles, CA
The institute is an intensive four days of instruction and dialogue for professionals involved in creating, sharing and preserving electronic information in museums, libraries, and archives.

Creating Electronic Texts and Images-A Practical "Hands-on" Exploration of the Research, Preservation and Pedagogical Uses of Electronic Texts and Images in the Humanities
August 18 - 23, 2002
New Brunswick, Canada
This course will center around the creation of a set of electronic texts and digital images. Topics include: XML tagging and conversion, using the Text Encoding Initiative Guidelines, e-books, the basics of archival imaging, and the form and implications of XML.


Symposium 2003- Preservation of Electronic Records: New Knowledge and Decision-Making September 15 - 18, 2003
Ottawa, Canada
Submissions are now being invited for this symposium. The purpose is to build awareness of digital preservation by bringing expert and leading edge opinions to a larger audience including small and medium-sized archives, libraries, and museums. The focus will be on making decisions and finding practical solutions that can be implemented immediately, especially for the materials that are at risk of being lost within the next 10 to 20 years. For further information contact: cci-icc_publications@pch.gc.ca.

announcements

Announcements

The Illinois OAI Protocol Metadata Harvesting Project Repository
This project is testing the viability of using the Open Archives Initiative (OAI) protocol for harvesting metadata, and exposing it with a search interface to enhance resource discoverability for materials that represent cultural heritage. The repository includes metadata records donated by over 26 institutions. Please send comments to Joanne Kaczmarek.

Evaluation Initiative for XML Document Retrieval
The DELOS Network of Excellence for Digital Libraries invites participation in an evaluation initiative for XML document retrieval. The widespread use of XML in digital libraries, catalogues, scientific data repositories, and across the Web has prompted the development of appropriate searching and browsing methods for XML documents. This initiative provides an opportunity for participants to evaluate their retrieval methods using uniform scoring procedures and a forum for participating organizations to compare their results. The invitation is open to all research groups with an interest in XML retrieval.


California Digital Library Opens Online Repository for Working Papers

The California Digital Library has launched a Web site and associated digital services to store and distribute academic research results and working papers. The eScholarship Repository includes a set of author and reader services for the rapid dissemination of scholarship authored or sponsored by faculty from the University of California. Its initial focus will be on working papers from the humanities and social sciences.


OAIster Project at the Digital Library Production Service of the University of Michigan Libraries

The OAIster project is one of the Andrew W. Mellon Foundation Metadata Harvesting Initiative grants. The goal is to create a wide-ranging repository of free, useful, previously difficult-to-access digital resources irrespective of subject area or format, and that will be easily searchable by anyone.

Digital Document Quarterly (DDQ)

Worth a look is the Digital Document Quarterly. This online publication will address issues such as document qualities that can be conferred and managed automatically, including integrity, confidentiality, and authenticity.

The Tibetan and Himalayan Digital Library

Making use of new digital Web based technologies to collaboratively solicit, generate, publish and integrate all aspects of scholarship on the geographical, cultural and linguistic regions associated with Tibet and the Himalayas, this site will be of interest to our readers.

publishing information

Publishing Information

RLG DigiNews (ISSN 1093-5371) is a newsletter conceived by the members of the Research Libraries Group's PRESERV community. Funded in part by the Council on Library and Information Resources (CLIR) 1998-2000, it is available internationally via the RLG PRESERV Web site. It will be published six times in 2002. Materials contained in RLG DigiNews are subject to copyright and other proprietary rights. Permission is hereby given for the material in RLG DigiNews to be used for research purposes or private study. RLG asks that you observe the following conditions: Please cite the individual author and RLG DigiNews (please cite URL of the article) when using the material; please contact Jennifer Hartzell, RLG Corporate Communications, when citing RLG DigiNews.


Any use other than for research or private study of these materials requires prior written authorization from RLG, Inc. and/or the author of the article.


RLG DigiNews
is produced for the Research Libraries Group, Inc. (RLG) by the staff of the Department of Preservation and Conservation, Cornell University Library. Co-Editors, Anne R. Kenney and Nancy Y. McGovern; Production Editor, Barbara Berger Eden; Associate Editor, Robin Dale (RLG); Technical Researchers, Richard Entlich and Peter Botticelli; Technical Coordinator, Carla DeMello.


All links in this issue were confirmed accurate as of April 12, 2002.


Please send your comments and questions to preservation@cornell.edu.

end of issue

   
 
RLG DigiNews
BROWSE ISSUES
SEARCH
RLG