Authority Control in the 21st Century: An Invitational Conference


Authority Records: Characteristics of Change

Noelle Van Pulis

The Ohio State University


Table of Contents


If you have examined your program, you will notice a title change. The original title sounded like I was going to tell a story. Once I started preparing for today, a better title became more obvious.


Introduction

New catalog (INNOPAC) - July 1994 (part of statewide OhioLINK)

New method of authority file updating (BNA) - 1994-

Establishment of procedures

Staffing needs


Scope

PCC statistics - looked at most recent 3 years as indicative of current activity.

OSUL notifications - used the NOV 1994 notification records because they represented the largest group received and also, since the service was new, were a group for which we kept track of the bib. updates as part of developing procedures; also have additional records not examined for this presentation.

Looked at types of records changed, fields changed and some examination of the nature of the change.

Changes have different implications depending on one's point of view. There are maintenance issues (for the authority file and the bib. records), as well as issues for use of the catalog and the authority file.


PCC: Pattern of Change (Names & Series)

NEW n%CHA n%
NACO
FY 9374,7638314,89517
FY 9481,3028217,77018
FY 9596,6558023,79420
OSUL
FY 947868810712
FY 957308315017

First table shows the activity at the national level for contribution of new and changed authority records for names and series. LC reported these separately, but I grouped them because series are received as part of the BNA name notification service.

Note that while the level of contribution overall is growing, there is a trend toward increasing change. This was true of our own institution as well.

However, it was not true of everyone, as the next table shows.


PCC: Pattern of Change - Varies (Names & Series)

NEW n%CHA n%
LC CO
FY 94383102897
FY 951067139093
UofVa
FY 944177513825
FY 959167040230

For the LC Cooperative Cataloging staff, there is a definite difference, in contributing more changed records than new ones. There was a slight decline in the rate of change.

I arbitrarily picked out one other NACO participant whose activity was somewhat different. In the case of the University of Virginia, there was a larger overall percentage of contribution of changed records, but they also showed a trend toward an increasing percentage of changed vs. new records.

Some of this, perhaps much, could be due to the increasing activity in series authorities, which tend to be more dynamic.


PCC: Pattern of Change (Subjects)

NEW n%CHA n%
FY 931,0318419816
FY 949188121419
FY 9519547274528
day 9119383060

Again, with subject authority records, the pattern is similar, with a greater jump this past FY in changes.

The number of name changes reported by PCC is about 1,982/mo.

OSUL received an average of 1,233/mo. for 9 of those months.

The first 3 months of LC's FY are excluded because they represent special notification from initial and gap loads rather than ongoing monthly notifications.

The number of subject changes reported by PCC is about 62/mo.

This is very odd in that OSUL received about 556 subject notifications/mo. I have not determined an explanation as yet.
NAME =NACOFY 9523,7941,982/mo
BNA notifJA-SE 9511,0961,233/mo
SUBJ =NACOFY 9574562/mo
BNA notifJA-SE 955,008556/mo

Calhoun & Oskins exam. of one mo. 1991 data of changes to LC AFs = N 60% new, 38% changed (not 80/20); S 38% new, 60% changed which was very different from the current trend.


The OSUL Environment: OSCAR

As part of the development of a state-wide system, OhioLINK, OSUL migrated from locally-developed system, LCS, to OhioLINK Innovative Interfaces system.

Following a contest, renamed the local system OSCAR. Early this year, a duplicate of OSCAR, named FELIX, was created for staff functions primarily in automation and technical services.

The initial load of bib. and auth. recs. took place in Spring 1994, followed by gap loads. We have just received back from BNA bib. and auth. recs. for cataloging done Sept. 1994 to Dec. 1995, and these have not yet been loaded.

The initial and gap groups of bib. records excluded some materials: e.g., ERIC docs., OSU theses, and short records used for circulation of items without full cataloging in m-r form. The group included CRL loaded locally, as well as OSU Law Library and Ohio Historical Society.

ARs received from the bib. processing excluded those without cross-references or significant notes. These factors are relevant when looking at the ratio of bib. recs. to ARs, which results in about 17 ARs for every 100 bib. recs.


Authority Records in Oscar

NAMESUBJMeSH
Initial '94304,900117,3715,546
FY 94/9597,40121,865541
FY 95/96n/an/an/a
NEW =402,301139,2366,087
FY 94/9516,83211,451n/a
FY 95/9613,3035,076n/a
NOTIF =30,13516,527
#BIBRs#ARsAR/BIB
March 22, 19963,214,218552,07417%
Total ARsName402,301
Subject139,236
MeSH6,087
Total547,624
Deletes-10,600
Totalca. 537,024

Notif added rather than overlay = higher #

Some are correct adds

Some dups. not yet deleted

[chart of AR Loads FY 95/96]
Typical mo.NAMES:range 801 for JAN 1996 changes
to 1778 for SEP 1995 changes
SUBJS:range 337 for JAN 1996 changes
to 766 for SEP 1995 changes

JAN appears to suffer from post-holiday slump, while SEP is a month of high energy and perhaps fuller staffing following summer vacations.

Note that for these notification loads, about 72% are NAMES (inc. SER) and 28% are SUBJ.


1994 NOV Notifications for Name & Subject Headings

NAME ARsSUBJECT ARs
100 Pers1543100 Pers337
110 Corp597110 Corp184
111 Conf78111 Conf8
130/1xx|t Series226130/1xx|t Series1
130/1xx|t UniTi83130/1xx|t UniTi28
151 Geog26151 Geog58
150 Topic0150 Topic1073
TOTAL2553TOTAL1689

Looking at the sample month examined in detail, NOV 1994, here is the breakdown by the 1xx tag.

For this paper, all 2,553 NAME ARs were examined. For SUBJ ARs, only topical headings tagged 150 were included, a total of 1,073.

One tangential finding came from looking at the characteristics of the mr LCSH described by Vizine-Goetz & Markey in 1989.


LCSH-mr Characteristics (1987) (Vizine-Goetz & Markey, 1989)

Tag of estab. hdg.Total occ.% of records% of Nov 94 notifs
100, 110, 111, 13012,2847.7533
150128,36780.2964
15119,21812.023
TOTAL158,869100.06

This table shows data about the 1987 mr LCSH, published in an article by Vizine-Goetz & Markey in 1989.

Not surprisingly, the largest number of records is for topical headings, tagged 150.

The last column shows that although sh tagged 100-130 represented only about 8% of the file, changed records are about 33% of those received in NOV 1994, and 64% of changed records were for topical headings, which represented about 80% of the file in 1987.

Geographic names apparently are fairly stable, at least for the time covered by the NOV 1994 notifications.


PCC Source of Change DLC or NACO (% of 040)

TAG:100110111130s 1xx|t130u 1xx|t151
DLC656455879381
NACO35364513719
XXXXXXXXXXXXXXXXXXXXXX
150n/a|dDLC

Looking at the NOV 1994 notification records from BNA, this table shows whether the source of change was the Library of Congress or a NACO participant by examining the last |d in field 040.

Personal and corporate names show a similar pattern, with about 1/3 of changes by NACO participating libraries and 2/3 by LC. Conferences and meeting name changes, small in number, were made by NACO members in 45% of the cases.

For series ARs, changes are made primarily by LC, although this ratio could change with increasing participation by NACO members in series authority contribution.

Uni. titles seem to be avoided by NACO participants, at least in terms of changes to current records, as 93% of changes were made by LC.

Geographic name changes also show a pattern different from the other areas, with 81% by LC and 19% by NACO members.

The last |d in all SUBJ ARs is DLC, so this kind of comparison was not possible for that group.


Percentage of Changes by Type: NAME Notifications

c/a/d100110111130s130u151
PersCorpConf1xx|t1xx|tGeog
1xx871712120
4xx606254528173
5xx4373520212
642000700
663000000
66716010412
670796654374577
680000000
unkn9441048
(c/a/d = change/add/delete)

Each notification record was received in printout form, as well as on tape. The BNA report includes phrases indicating which fields were changed, e.g., 1xx changed, or 4xx added. These appear in a field tagged 699 at the end of each record. Each change is counted only once, so one 699 note for 4xx added could mean that 1 or more 4xx fields were added. These figures show what percentage of each type of record had at least one change to a field with that tag.

Some records lacked 699 fields. These Changes were counted as "unknown". Possibly these were 053 additions, or another change.

Most changes for NAME records were the addition of 4xx's, or see from cross-references. This was especially true of uniform titles, with 81% of records reviewing showing the addition of 1 or more 4xx fields.

This analysis also showed that for personal name records, 79% had a field 670 added. In many cases, this was the ONLY change, and usually contained what one might call biographical information - b. or d. date, fuller forms of names, or other identifying information not needed to estab. the hdg. but to serve a potential reference need.


Percentage of Changes by Type: SUBJECT Notifications

c/a/d150 Topic
1xx4
4xx23
5xx16
67016
6801
unknown63

One obvious difference between the name and subject notifications is the higher percentage of unknown changes, with names ranging from 4 - 10 % for the different types of records. I believe the high percentage of unknown changes to subj. notifications is the addition of geographic subdivision.

I checked several of the many hdgs. in this report in printed volumes of LCSH beginning with the letter "p" and this was the only identifiable change to the authority record. Examples include:

As for NAMES, most of the 4xx changes were the addition of a 4xx field. Unlike NAME records, however, the 5xx changes were about evenly divided among changes (5.5%), adds (4.5%), and deletes (5.5%).

[this detailed breakdown is available on spreadsheets]


1xx Changes: Selected Months

1995TOTAL n1xx n1xx %
(Records)(Changed)
Jul Name1,67029917.90
Jul Subj5606411.40
Aug Name1,29116112.50
Aug Subj3793910.30
Sep Name1,77821311.97
Sep Subj7668911.60
Oct Name1,32018213.80
Oct Subj4906813.90

This table shows the pattern of percentage of changes to 1xx headings.

The figures are taken from record counts on disks of 1xx changed records provided by BNA, and the record count of the full files received on tape.

I bring this up for comparison to NOV 1994, where the range of percentages for changes to 1xx hdgs. went from a low of 7 % for corp. names to a high of 17% for conference or meeting names.

For subjects for NOV 1994, the 1xx changes reported were only for 4% of records examined, but this group included only records tagged 150.

The inclusion in this chart of all subj records, including those tagged 100-130 and 151, may explain the large difference in percentages.

These figures for name changes are comparable to that found by Calhoun & Oskins in a 1992 report of Rates and types of changes to LC authority files, where data analysis from 1 month in 1991 showed that slightly more than 1 change in 10 is a change to 1xx hdg.


Maintenance from Notifications

The two areas of maintenance which have required most of our attention since we began notification loads are de-duping and updating of existing bib. records when a 1xx changed after BNA authorities processing.

Duplicates resulted when, during what is called “overlay”processing, the notification record does not replace the existing record but is added instead. Occurs when there are, in a small number of cases, sequential updates on the same tape. If “target record is busy” being replace by the first notification, the second and later version is added. This past month, we began receiving files of notification records which had been de-duped by BNA so that we only have the latest version for overlay.

Duplicates also result when the same AR is present in both NAME and SUBJ indexes, and the system did not know which one to overlay, and so added the update as a new record. We have to date deleted more than 10,000 duplicate records. This problem is expected to be fixed with the next Innovative Interfaces release, coming soon to a catalog near you.


1xx Changes: 3-Month Overview

Updates:# of hdgs.# of bibs.
1994 NOV
Global142136
FSEn/a141
1994 DEC
Global144139
FSEn/a54
1995 Jan
Global137132
FSEn/a53

This is a sample of local update transactions based on 1xx changed hdgs. from NOV and DEC 1994 and JAN 1995. There does seem to be a pattern emerging.

Global updates are used for both NAMES and SUBJS. Likewise, single updates to bib. records also are used in both cases.

The effort required for each 1xx change depends on the situation for each hdg. Variants must be identified, there are often language difficulties, and subj. hdg. rotation in INNOPAC makes it more difficult to identify occurrences of a hdg.


Sample 1xx Changes: Names

Here are just two examples of NAME changes. The corporate name change reflects rule changes, the personal name change is probably the result of a marriage and usage of a later name, hyphenated name.

I did not attempt to compile a complete list, so these are not given as typical, but I suspect they are not uncommon.


Sample 1xx Changes: Subjects

These changes to subject headings likewise are not surprising.

One indicates a trend toward fewer inverted hdgs.

The value of the last change is unclear to me, but the change in terminology presumably is meaningful in the business vocabulary.

The changes to established headings represented a small number of all changes to ARs examined. Most changes were to 4xx, 5xx and 670 fields.


Sample 4xx Additions: Names (P)

Here are a few examples of the types of 4xx additions.

The first name is for a living individual, who used his middle name very publicly. With the change to this record, he also can be found by his real first name, Andrew.

The second name is for an individual who is gone but not forgotten. In this case, a reference was added from a form by which he was commonly known. There were other cases of names for persons gone a thousand years, but they tended to be very long Arabic names involving diacritics so I passed up on those to fit things on one slide and avoid characters not part of the PowerPoint set.

The last case again arises from usage. Mr. Macpherson sometimes referred to himself by his nickname, which was added as a cross-reference.


Sample 4xx Additions: Uni Ti

Two examples from uniform title changes present very different cases.

In the example for the Bible, the reference is for a specific title in another language.

The case for the musical work is typical of these uniform titles, bringing works together under Selections. This also popped up several times in the group examined for collected works of cartoonists published in book form.


Sample 4xx Additions: Topics

These additions of see from references to subjects illustrate two types of changes. The plural form, cakes, to cake, helps a user who may approach the catalog with that term in mind.

One example of this type that comes to mind was brought to my attention recently by one of our reference librarians, trying to help a patron locate material about turkeys, but who could only find items about the country. I explained the principle of plurality about things, which does not completely hold up where these two edible items are concerned. In case you were wondering, Pies is plural, but Pastry is singular. And no, there are no cross-references from Pie or Pastries.

In scanning for examples, there were many instances of the second type, adding 4xx’s for vocabulary from other domains. Many of the records were for business topics, so there was a lot of activity in that area in that particular month.


Sample 670 Additions: Names (P)

I mentioned earlier that for personal names, about 80% of the records had 670 fields added. Some of these 670 justified the addition of 4xx or 5xx fields, but for many records, the 670 addition was the only change.

The 670 provided in all cases additional information about the person, in the form of dates or other identifying information. Often, when the change was made by LC, the information came from the CIP data sheet, a source which is discarded.

The value of the 670 information is that it forms part of an online reference tool, which is readily accessible during the cataloging process. One does not have to trot off to reference shelves somewhere, or connect to a different online source.


Sample 5xx Additions: Names

Most of you are aware of the reasons for additions of 5xx references to ARs.

This just highlights the obvious reasons, and to save space and time, I just presented them with record numbers in parens.

As the data analysis showed earlier, only about 4% of personal name records had a change or addition to a 5xx field, with 37% and 35% for corp. and conf. names respectively, and 20% for series. For geog. names, the percentage of records with this change was 12% . And for subjects, 16% of the records had a changed or added 5xx.


Catalog Use

At the beginning of this presention, I mentioned three areas of implications for changes to ARs: maintenance of the local catalog catalog use, and authority file use.

I have described a bit about how the load of notification records has resulted in maintenance for our local system, in terms of the de-duping (for now), and then subsequent updating of the existing bib. recs.

For catalog use, 1xx changes mean, generally, that currency of the access point is improved. Cataloging rules and common usage are reflected in the changes, and changes such as subj. simplification.

The added cross references serve as added entry vocabulary and navigation aids, both to public and staff users. Hildreth (1982) said that cross-refs. facilitate term selection browsing and Rice said that when there are more access points, serendipity is less accidental and more systematic. In my own 1988 study, 14% of users seaches matched a sub. xref.

5xx refs. are of course a navigational aid to both public and staff users.


Authority File Use

Cataloging practioners, especially authorities coordinators, know the value of the elements of the authority file which I have talked about today - the 4xx, 5xx and 670 fields.

Without these, an authority record is skeletal, which may be adequate for many names and subjects if the established hdg. is sufficient in the broader bibliographic context.

But the cross-references of the AF serve as added access points and navigational aids to us as well, enabling us to expedite cataloging when a heading is established and the record can be found quickly.

The value of the 670 was mentioned earlier. As an online reference tool, the AF is highly accessible (not to mention free) and is as current as we choose to make it.


Characteristics of Change

I have described the characteristics of change from one view, to show a sample of the volume and types of changes received on BNAs notification service

Another, broader view of the characteristics of change to ARs is that it is inevitable, predictable, and beneficial.

Usage is a driving force. Cultural changes, personal preferences and inconsistencies in name usage, corp. changes, vocabulary changes all can affect the AF. Cataloging rules and to some extent cataloger judgement are another element driving change. The AF reflects and documents rules and standards, both AACR and LCSH, and is being extended to include international views.

At issue is are we making the right changes or enough changes.

Is more better? more of the same? or something different or additional?

Finally, it is worth remembering that changes to the AF do not sit in isolation, but are reflected in whole or part in local catalogs. When ARs are enhanced, in many cases the direct result is improved access in local systems as well.


Return to Proceedings Home Page


Advanced Search | Careers at OCLC | Feedback | Privacy Policy | ISO 9001 Certificate | ©2003 OCLC Online Computer Library Center, Inc.