Friday, January 26, 2007

Reflections on ALA MW and OR 2007

Attending ALA and Open Repositories back-to-back has provided an excellent perspective on the full spectrum of activities in the library world. The average attendee of ALA is a librarian with an ML(I)S, trained in the traditional library model, with many years of professional experience. While there are plenty of librarians at OR, most of the presenters at OR come from a computer science background. The common factor among all of these professionals is that they are dedicated to the idea of stewardship of knowledge: preserving it, providing access to it, understanding the culture and context of knowledge.

On the morning of the last day of this week-long conference extravaganza, I discussed the major themes of these conferences with a colleague who is the incoming Head of Cataloging and Metadata Services at an ARL library. Her comments are in italicized text.


RDA is controversial

Frankly, it is being criticized from multiple perspectives, including traditional catalogers as well as digital library developers. Personally, I think that it will be helpful for recognizing the common ground between long established cataloging practices and emerging organically-digital practices. However, I have a feeling that because it will be so comprehensive and complex, it will not be fully and wholly adopted by most institutions or even communities of practice. Rather, practitioners will pick and choose parts of it to adopt for their own needs. Perhaps this is not a sign of weakness in RDA, but a reflection of the increasingly divergent granularity, needs, and practices in the whole "information space."

I would say RDA is struggling, and it is frustrating to see so much effort by highly knowledgeable professionals go into developing - and then defending - what is essentially a very incremental change in the way we do business. RDA's core cataloging audience feels that it makes too many unwarranted changes, and the metadata community feels that it doesn't move far enough from the MARC model to be relevant outside the traditional catalog. What RDA does right: It separates the bibliographic metadata from ISBD formatting. Our systems should enable us recombine parts of the bibliographic record for different needs, and it's good that our content rules are changing to reflect this (we need to lean on our OPAC providers to recognize this as well). What RDA does wrong: It is hopelessly complex. One idea floating around at ALA was the notion of making RDA a set of basic rules, and then allowing communities of practice (i.e. map catalogers) to build out more complex rules for collections and institutions that need them. I wish RDA had gone this way instead of trying to be all things to all constituents, because this would have been a more agile solution in the short term and more sustainable in the long term.


Open Repositories come of age

Open Repositories 2007 was an exciting conference. The OR community is vibrant, bold, and innovative. It was evident to me that DSpace, Fedora, and Eprints are maturing. These repository systems are enjoying wide adoption, rapid development, and are quickly increasing in the utility they bring to adopters. The OR program featured a rich spectrum of topics which were uniformly interesting. There was a spirit of cooperation and discovery among all participants and a genuine excitement about the future.

It was interesting to see FRBR in the context of a digital library (The Eprints Application Profile: a FRBR approach to modelling repository metadata, Julie Allinson, UKOLN). The FRBR discussions at ALA and other library conferences are very catalog-focused, and it would be nice to have more sharing across information communities. In some ways, it seems like digital libraries are better positioned to adopt the FRBR model because they are dealing with smaller collections and immature systems.


Institutional repositories: systems or services?

I'm not crazy about the "Web 2.0" moniker, but I think the user-oriented services that can now be applied to all kinds of different systems - library websites, OPACs, IRs - are the most exciting development out there right now. In terms of IRs, the technologies have existed for a while, but they have not allowed a lot of flexibility from the metadata and customization (library) side and are incredibly difficult for the end user to adopt. The next generation of these systems have a dramatically improved service layer that makes it possible for "regular" libraries (i.e. without a squad of programmers) and end users to make much richer use of the technology.


Top topics of the week:
  • Web interfaces for repositories, especially Manakin (DSpace) and Fez (Fedora)
  • Needs of librarians vs. needs of users in repositories (they are vastly different)
  • Repository as system vs. repository as service (the second is more important)
  • The direction of RDA and applying the FRBR model to repositories
  • Metadata is important, but there is tension between simplicity and complexity. And who the heck is going to do it?
  • The development of fully realized cyberinfrastructure, emphasizing preservation (iRODS)
  • Incorporating intellectual property management into the repository workflow
  • Storing data sets in repositories, and what to do with it after you have it

Thursday, January 25, 2007

Manakin coming-out at OR 2007

The hosts of the Open Repositories 2007 conference are the University of Texas and Texas A&M, and they are taking full advantage of the conference to promote the projects of the Texas Digital Library. One TDL project, in particular, will clearly have a major impact on the repository scene.

While DSpace has been a successful project by any measure, many have lamented that the standard user interface is difficult to customize. The Digital Initiatives office at Texas A&M took this challenge head-on by developing a fully-customizable, XML-based user interface for DSpace. Dubbed Manakin, it is one of the major stories of this conference.

The two important concepts to understand with Manakin are aspects and themes. Aspects are functional bits of code that work directly with the DSpace API. Themes are the combinations of documents (such as a CSS) that format the data coming out of DSpace in order to customize its look for the web. Manakin takes data from DSpace and formats it into a basic document called a DRI-document. The Manakin aspects and themes add code to the DRI-doc to produce the formatted XHTML that displays in a web browser.

Manakin allows for three levels of customization, allowing for different levels of knowledge, resources, and need in the institution:

1. Style: CSS/XHTML
2. Theme: XSL/CSS/XHTML
3. Aspect: Java/Cocoon development

Adam Mikeal, Lead Software Developer at Texas A&M, demonstrated the great potential for Manakin. A&M digitized the Geologic Atlas of the United States, a series of map folios produced between 1894 and 1945. They loaded it into DSpace, but it was clear that DSpace did not reveal the full value of the collection. It did not allow retrieval based on the geographical nature of the collection, nor did it provide an ideal display of the items in the collection. To address some of these problems, they used Manakin to develop a custom user interface for the collection that shows the geographical coverage of these folios through the use of a web 2.0 style mash-up. In other words, this new interface integrates a custom designed web site, data from a DSpace collection, and the Yahoo!Map API to create an innovative way to display a unique and valuable collection digitized by the library. In all, this project provides an excellent example of how libraries can use emerging technologies to provide improved information services to our patrons.

Wednesday, January 24, 2007

Implementing RDA

The RDA Update Forum on Saturday morning provided additional insight into the JSC's plans for implementation.

Beacher Wiggins, Director of Cataloging at the Library of Congress, discussed RDA implementation from the point of view as an administrator of a major cataloging department. He was concerned about the resources that will be required to transition to RDA, and he emphasized that the implementation time line needs to go according to schedule: the longer it lasts, the more painful the transition will be. He hopes the implementation will not be as challenging as the transition to AACR2.

Marjorie Bloss, RDA Project Manager, discussed implementation in terms of project management. Her goals are to deliver RDA on time and under budget, to meet the stakeholder needs, and to manage outreach. RDA will primarily be an online product, with a "suite" of different versions developed later. A demonstration version of RDA is available online at www.rdaonline.org. There is a lot of discussion about how training for RDA will be developed and provided. At this time it is hard to say how training will be handled until it is finished. She believes that Part A (Description) will require more training for experienced catalogers than Part B (Access Point Control). Trainees should become familiar with the FRBR and FRAD models prior to training.

Tuesday, January 23, 2007

Rheinhold's Philosophizing Monkey

Given that I catalog German monographs, I decided to attend the Germanists Interest Group of the Association of College and Research Libraries. I had never attended this group before; I learned of the group from John Marner of Texas A&M. I don't know what I was expecting, but even if I had made an educated guess, I think I would have been delightfully surprised by the presentation given by Dr. Axel Schmetzke (University of Wisconsin - Stevens Point) on Sunday.

Dr. Schmetzke's presentation, Hugo Rheinhold's Philosophizing Monkey used the story of a small statuette to demonstrate how the web can be used to support research on obscure topics, especially by creating an "invisible college" of interested researchers.

Dr. Schmetzke has developed a highly detailed website recording the history of the monkey statue. He provided multiple perspectives on the statues meaning, including depictions of monkeys in art, influences on the statues composition (such as Rodin's La Pensure), the statue's reference to Darwin's theory of evolution, and Rheinhold's social and political activities. The scope of his research on this small piece is staggering and impressive, synthesizing the intellectual traditions of many scholarly disciplines.

In the conclusion of this highly entertaining presentation, Dr. Schemtzke discussed his research process, touching on web resources in particular. He maintained that while web research is powerful, it can't replace print-based research; rather the two are complementary and equally important.

Saturday, January 20, 2007

RDA and the Future of Cataloging

The first rule of ALA conferences is...be flexible. I'm sitting in one of many, many hallways of the Washington State Convention and Trade Center, right across from room 615, location of the ALCTS Cataloging Norms Discussion Group. This is the first meeting I'm supposed to be blogging, but I'm not even in the room. So many people showed up for the meeting that the audience is spilled into the hallway. Fortunately, this morning I attended another excellent meeting, the Electronic Resources Interest Group (ERIG), so I'll write about that instead.

The ERIG meeting featured four speakers, all of whom are leaders in cataloging and metadata practice: Barbara Tillet, Murtha Baca, Sara Shreeves, and Mary Woodley. While I could write at length about any of their presentations, the most significant in my mind was Barbara Tillet's presentation on RDA. She provided a summary of the development of Resource Description and Access, the sequel to AACR2. In her incredibly rich presentation, she provided a vision of the future of cataloging. Essentially, RDA is being developed with the digital environment in mind and will represent a major change in direction from traditional cataloging. It is currently slated for release in 2009. Here are just some of the changes she presented:
  • RDA is a standard for the content of bibliographic records, but not a standard for the display of bibliographic information. Because of this, the traditional punctuation currently used in cataloging records (a.k.a. ISBD punctuation) will no longer be mandatory.
  • RDA will move away from using terminology and practices that were developed for card catalogs. In some cases, the terminology is changing. For example, the term "access points" will be used instead of "headings." "Authority control" will change to "access point control." "Preferred title" will be used instead of "uniform title."
  • Some traditional cataloging practices will go away, including the main entry. The general material designation (GMD) is going away, but this information will be captured in more detail in various fields that describe an item's format.
  • The rules for transcribing information will be greatly changed to a "Take what you see" approach. This means that catalogers will be encouraged to transcribe descriptive information mostly verbatim, including capitalization, abbreviations, even errors.
  • The rules for transcription will vary for different types of resources. Transcription is more important for resources than others. For example, it is very important for unique print resources, such as rare books, but less so for digital resources.

Wednesday, January 17, 2007

Brian's blogging schedule for ALA

Besides commenting on general news and observations, I will be blogging on three meetings at ALA:

ALCTS-CCS Cataloging Norms Discussion Group
Date: 1/20/2007 1:30 PM -- 3:30 PM
Topic: "How Catalogers and System Developers Work Creatively with Metadata"

I was very impressed by the lineup at this meeting, which includes Terry Reese, the developer of the MarcEdit program, and Casey Bisson, who recently won a Mellon Grant for a project to integrate an OPAC with the Word Press content management system.

ALCTS-CCS RDA Update Forum
Date: 1/20/2007 4:00 PM -- 6:00 PM

Resource Description and Access is the planned successor to the Anglo-American Cataloging Rules. This is a significant project in the cataloging/metadata world, yet it is somewhat controversial.

ALCTS Networked Resources and Metadata Interest Group
Date: 1/21/2007 8:00 AM -- 10:00 AM

I am the chair for this interest group. We have an excellent lineup of speakers on metadata practices, including Diane Hillman, author of Metadata in Practice.

Click on the links above to view a description of the session. If anything else of particular interest comes up in the course of the conference, I will definitely be commenting on them as well.

Tuesday, January 16, 2007

Preparing for ALA Midwinter

Over the next three days, I will be preparing for ALA Midwinter. Part of that preparation is testing out this blog. If you are reading this, then the test was successful! Besides that, most of my preparation is for the Networked Resources and Metadata Interest Group. I am the chair of this group, and we have a meeting at every Midwinter. This year, we have some exciting speakers lined up for a discussion followed by a business meeting. In addition to NRMIG, I will be attending other meetings and programs, as well as viewing the exhibits by vendors. I will post my agenda here on the blog soon.

New Fondren Library Conference Blog

Hi all,

Welcome to the new blog for the Fondren Library Staff Travel and Development Committee. This will be a space where staff at the library can blog events at conferences they are attending. Look for posts soon!