DOI: Basics and Resources

A Digital Object Identifier (DOI) is a unique and persistent identifier which provides a link to an object on the Internet via a registration and indexing service.

Why learn about DOIs?

DOIDOIs are often used in citations and discovery services to provide permanent links to online articles, ebooks, images, reports, and other types of online objects.  Publishers now regularly assign a DOI to each journal article or ebook (or ebook chapter) they publish.  Many times, when a direct link or OpenURL lookup fails, an item still can be located using its published DOI.  Even if an object changes location on the Internet, its DOI will remain the same (and point to the new location).

The Basics

The Digital Object Identifier system is an ISO standard (ISO 26324) officially maintained by the International DOI Foundation (IDF).  The IDF provides the infrastructure to support DOIs by governing independent DOI Registration Agencies.  It was created in 1997 with DOI becoming a standard in 2012.

Note that according to the IDF, a DOI is a “digital identifier of an object” rather than an “identifier of a digital object”.

DOI Format

Each Digital Object Identifier has a unique alphanumeric string made up two parts:

{prefix}/{local name}

The prefix is a numeric string beginning with 10. and followed by several more numbers, usually four (ex. 10.1103).  The prefix is assigned by a Registration Agency to a publisher, institution, organization, or other types of Registrant.

The local name (suffix) follows the forward slash and is a unique alphanumeric string (for that prefix).  The local name for an object is chosen by the Registrant using whatever naming scheme they want
(ex. PhysRevLett.116.061102).

If you’re curious about why the DOI has its particular structure, see the Handle.Net Registry Technical Manual.

DOI URL

A DOI handle can be converted into a useful URL by adding it as a path after the domain https://doi.org.

So, for the previous prefix and local name (suffix) examples above, the full linkable URL would be:

https://doi.org/10.1103/PhysRevLett.116.061102

Note that older DOI URLs use the http://dx.doi.org format which was deprecated in 2016 (but is still supported).

DOI Uses

As stated, the Digital Object Identifier often points to digital content such as journal articles and ebooks (or individual chapters or even data).  Because DOIs provide a permanent link to these scholarly works, they are now frequently included—and sometimes required by journal publishers—in citations.  Of course, the various style guides have rules on how to format them in citations.  Here is an example of an APA citation with a DOI:

APA citation with a DOIAll of the style guides offer help for adding DOIs to citations.  Major ones include:

Publisher sites and databases often include DOIs in the online article or ebook itself.  This makes it easy to cite or to make a persistent link in a reading list.  For example, if an article has a DOI, Elsevier displays it as a link after the title and author(s).

DOI in an Elsevier articleDigital Object Identifiers are part of the metadata indexed in library discovery services.  EBSCO Discovery Service, Primo, Sierra, Summon, and WorldCat Discovery all display a DOI field on search results or on item detail screens (if available).  Sometimes they are links.  Including the DOI is useful when the search result link is broken as it provides a secondary method for locating the resource.  For example, Summon displays the item’s DOI as a link when you expand the “Preview” section.

Summon result with DOIDOI Registration

For many years librarians and researchers have been using Digital Object Identifiers.  More and more librarians are becoming the creators and maintainers of DOIs as yet another type of metadata to enhance discovery and access.  As we have seen, DOI links point to a URL that serves as a central index and redirection service.  DOI URLs are translated into the actual URL of the target object.  So how does this index get built?

The central DOI index is maintained by several DOI Registration Agencies.  The most important of these is Crossref.  Publishers and other institutions become members and pay to create DOIs for their items.  With the rise of university publishing and institutional repositories, more universities are enhancing their scholarly output with DOIs to aid in their discovery and sharing.  Increasingly, librarians are assisting university faculty with creating DOIs as part of the publishing process.  Cataloging and scholarly communications librarians are frequently tasked with creating and managing the DOIs for these repositories.

Resources

Here are official resources to learn about DOIs:

  • DOI.org – Website of the International DOI Foundation (IDF).
  • DOI Handbook – Official source of information about the DOI system.
  • Driven by DOI – Watch videos and download brochures.

Library-related DOI registration agencies:

  • Crossref – Popular DOI agency for scholarly publishing.
  • DataCite – “[L]eading global provider of DOIs for research data.”
  • mEDRA – Multilingual European Registration Agency.

Useful DOI tools:

  • DOI Citation Formatter – Crosscite will format an entered DOI in one of hundreds of citation styles.
  • DOI Resolver – Google Chrome extension to create links and citations from a DOI.
  • Custom DOI resolver – Firefox extension to turn a non-linked DOI into a URL.

COUNTER: Basics and Resources

COUNTER (Counting Online Usage of NeTworked Electronic Resources) is an organization and a set of standards to report usage of library electronic resources by vendors and publishers.

Why understand COUNTER reports?

When database vendors and publishers use different methods of counting and reporting usage of their platforms and items, it is difficult for librarians to determine accurate use counts (and rejections).  Thus, it is hard to calculate the cost-per-use.  Different figures often make database comparison difficult when deciding between the value of two similar products.

COUNTER provides a way to standardize and compare database usage statistics across vendors and time.

COUNTER reports can also show demand for titles not owned by reporting access denied to books (BR3) and journals (JR2).  These reports can be helpful for library resource acquisition decisions.

The Basics

COUNTERThere are four types of item-type usage reports each with their unique statistics:

  • Books – Electronic or print monographs, including reference works.
  • Journals – Serials including conference proceedings and newspapers.
  • Media – Non-text items such as images and video.
  • Databases – Collections of online data.

The Title report is a combination of book and journal counts.

The Consortium report counts all usage across a group of institutions.

COUNTER Report Request Screen
COUNTER Report Request Screen

COUNTER statistics are reported on a monthly basis and reports can typically cover any custom time period in addition to calendar year.

Reports are distributed electronically.  Sometimes they are available from vendors as immediate downloads, but other times they must be requested and emailed.  “COUNTER reports are available in two formats: delimited files, which are readable using Excel and similar spreadsheet tools, and XML, which is delivered using SUSHI.”  The two types of delimited files are comma-separated value (CSV) and tab-separated value (TSV) files.

Download an example COUNTER report.

If the vendor allows, you can avoid the manual running and downloading of COUNTER reports.  COUNTER offers an automated way to accomplish this using  the SUSHI protocol.  “The Standardized Usage Statistics Harvesting Initiative (SUSHI) protocol is designed to simplify the gathering of usage statistics by librarians, and it uses a series of XML schemas to do this.”

The current version in widespread use is COUNTER 4, however some vendors still offer archived usage statistics using version 3.  COUNTER 5 is available and we will see vendors moving to it gradually.  EBSCO just announced that it is making its usage reports available in COUNTER 5 format.

While version 5 is the future, COUNTER 4 is still by far the most common version offered by vendors today.  Below is a list of all COUNTER 4 reports with a brief description of each.  Vendors, depending on their size and content, usually offer a subset of these reports.

Continue reading “COUNTER: Basics and Resources”

Proxy Servers: Basics and Resources

A proxy server is a service that provides authentication and mediation between database or publisher websites and the end user by routing Internet traffic through its system.

Why learn about proxy servers?

A proxy server is a service that libraries use to authenticate their users to provide access to many online databases and publisher websites.  Using a proxy service allows library resource vendors to authenticate users from a single point-of-access regardless of where they are located, on-campus or from their home computer.

The Basics

For our examples, we’ll use the popular EZproxy product from OCLC.

Authentication

To avoid having to provide users with an individual or institutional login and password, most database and publisher websites authenticate users by IP address.  Sometimes vendors will limit access to a range of IP addresses—on a single campus, for example.  But for users outside of the physical campus, you must provide a known IP address (or set of IP addresses). This is accomplished by routing users through a proxy server so that the access requests come from its IP address(es) which are recognized by the vendor.  The content is then returned to the proxy server and routed back to the original user.

EZproxy process
Source: http://www.oclc.org/support/services/ezproxy/documentation/learn/overview.en.html

Because libraries can’t let everyone access their resources via EZproxy, they must authenticate their users before access.  EZproxy allows user login itself, but EZproxy also provides a method of authentication using your institution’s single sign-on (SSO) server.

Resource Access

EZproxy is accessed using an HTTP request.  To access a website via EZproxy, you must prepend the EZproxy server URL to the database or publisher’s website address.  A typical EZproxy URL looks like this:

http://ezproxy.yourlib.org/login?url=

To this proxy URL, we add the URL for the website we wish to access through EZproxy. For example:

http://ezproxy.yourlib.org/login?url=http://www.credoreference.com

The above URL is referred to as a “pre-proxy” link.  Once a website is accessed via EZproxy, the address changes to a “post-proxy” URL.  For example:

http://search.credoreference.com.ezproxy.yourlib.org

As you perform a search or click on links on a database or publisher site, you are submitting your requests to your EZproxy server which passes them on to the original website.  Data is returned to the EZproxy server which sends it back to your browser.  That is why the post-proxy URL ends with .ezproxy.yourlib.org (ignoring the path).

You might notice some post-proxy URLs use hyphens instead of dots between parts of the original website’s address.

https://refworks-proquest-com.ezproxy.libproxy.db.erau.edu.

The short explanation is that the EZproxy server uses a wildcard security (SSL) certificate for *.ezproxy.yourlib.org which allows one subdomain before the EZproxy server domain (ezproxy.yourlib.org).  The hyphens “trick” the server into seeing the original website as a single subdomain. This is done only for original websites that use HTTPS.

EZproxy Configuration

EZproxy has many settings that are configured during initial installation (using hyphens with HTTPS, for example).  You also set the maximum number of virtual hosts (typically from 5,000 to 20,000).

For each resource you access through EZproxy, you must configure it separately in a block of code called a stanza.  In the stanza for a database or publisher website you must specify at the minimum a Title (T) and starting URL (U).  Other basic stanza directives are Host (H), HostJavaScript (HJ), Domain (D), and DomainJavaScript (DJ).  Below is a basic database stanza:

Title SPIE Digital Library
URL  http://www.spiedigitallibrary.org
DJ   spiedigitallibrary.org

The Title directive is used to identify the resource (and is used for the link text on the default EZproxy list page).  The URL is used to match the EZproxy request link with the appropriate stanza to apply.  Once accessed, any further links on the resource site will be compared to any D or DJ lines and if there is a match, will be proxied and given access (including any JavaScript if a DJ line is encountered).

If a database or publisher website includes multiple domains or subdomains or uses both HTTP and HTTPS, you need to add Host (H) or HostJavaScript (HJ) directives to account for them.  More advanced directives are used to manage website cookies, set domains that should never be proxied, find and replace HTML code, and many others.  OCLC publishes an EZproxy Reference Manual to list these directives.

Here is a more advanced database stanza:

Option DomainCookieOnly
Title Engineering Village
URL http://www.engineeringvillage.com
HJ https://www.engineeringvillage.com
HJ engineeringvillage.com
HJ www.engineeringvillage.com
HJ www.engineeringvillage2.org
HJ www.ei.org
HJ acw.elsevier.com
DJ ei.org
DJ engineeringvillage.com
DJ referexengineering.elsevier.com
Option Cookie

OCLC publishes a list of recommended database stanzas for many of the most popular databases.  Of course, websites are frequently updated and these changes often require revised or completely new stanzas.  These stanzas are found in the config.txt file.

Resources

Here are some resources to learn more about EZproxy.

 

Discovery Services: Basics and Resources

A discovery service is an online library searching tool that provides an all-in-one interface for finding both local library items and online subscription and open access resources.

Why learn about discovery services?

Most libraries use discovery services in addition to, or as a replacement for, their OPAC.  Whether you are a technical services librarian whose job it is to administer them or a reference librarian who uses them as a major research tool, it is helpful to know how discovery services work.  You need to know their features and their limitations.

The Basics

For this article, we will use Summon (with Serials Solutions) as an example.  But all discovery services share the same basic functions and features.  They consist of three major parts: the index, the link resolver, and the search interface.

Summon is a discovery service developed by ProQuest and now managed and supported by Ex Libris (since their merger).  Summon allows users to search for print and electronic resources owned or subscribed to by their library.

In my library, Summon works in conjunction with Serials Solutions 360 Link using a Summon Unified Index to provide links to ebooks (and chapters), journal articles, audio, videos, library catalog records, institutional repositories, LibGuides, and more.

Summon Unified Index
Source: http://exl-edu.com/07_Summon/Overview/Discovery/Introduction to Summon

The Summon index contains citation metadata, subject terms, abstracts, full text, and direct links (when available).  It also includes Ulrichsweb information, DOIs, altmetrics, and citation counts.

Summon Indexing

The combination of Summon and Serials Solutions employs two methods of indexing applied in this order:

  1. Summon – Index Enhanced Direct Linking (IEDL)
  2. Serials Solutions 360 Link – OpenURL Link Resolver

Summon for Researchers
Source: http://exl-edu.com/07_Summon/Overview/Discovery/Understanding the Summon Index

When you perform a search and Summon returns results, Summon first looks to see if it has a direct link to the item using its proprietary Index Enhanced Direct Linking (IEDL).  These are reliable links that point specifically to the title and are successful more than 99% of the time, according to Ex Libris.

If Summon doesn’t have an IEDL record, the metadata is passed to Serials Solutions 360 Link.  This is a link resolver that relies on OpenURL technology.  What this means is that 360 Link must build a URL containing item search metadata in a format that the database website can use.  Most broken search results links occur when either the URL is badly formed or the metadata on the database’s side is incorrect.  Fortunately, Summon gives you ways to work around the problem by providing DOIs, other database choices, or links to the ebook or journal level so that you can browse to the chapter or article.

One great feature is Summon’s ability to do a full-text search for library print book holdings by indexing electronic versions of the same title, even if they aren’t owned by your library.

Relevancy Ranking and Filters

Summon then performs relevancy ranking on the results using a proprietary method that is a combination of dynamic and static ranking. Dynamic ranking includes search term frequency, field (title, author, abstract) weighting, term manipulation (synonyms, stemming, etc.), and other functions.  Static ranking includes item attributes such as content type, date published, peer-review status, and citation counts.  Library collection items are given a higher ranking than subscription database items.

After relevancy ranking, Summon filters search results for those items that your library has access to by default.  Of course, users can see all of the relevant results by checking the “Add results beyond your library’s collection” box.

Finally, Summon offers several filters to limit results to full text, peer-review, library catalog, content type, discipline, publication date, and language.

Library Resources Management

Now that we know how Summon indexes and creates search results links, we need to know how Summon selects results that are only contained in your library’s collection.  In this sense, your “collection” means both your print holdings as well as all of the online resources you own or subscribe to.

eBook and e-Journal Holdings

In order for Summon to link to your online ebook and e-journal holdings, you must tell Summon what you own or subscribe to.  You do this by activating or “tracking” your holdings in Serials Solutions 360 Core, the back-end of 360 Link and interface for the ProQuest Knowledgebase.  You can track entire databases, publisher collections, journal titles (with specific date ranges), and individual ebooks.

Serials Solutions e-Catalog List

For each database you subscribe to, you might subscribe to all of the titles (if offered as a complete package) or you might subscribe to only some of the titles (if purchased individually).

If titles within a database are purchased individually, you must track those individual titles.  In addition, you also might need to set custom dates if you don’t subscribe to the entire run of the journal title.  If you subscribe to an entire database, new titles get added automatically when they become available.  Ebook Central can also be set up to add your newly purchased ebook titles (perpetual and DDA) automatically.  Otherwise, library staff must add new ebook and e-journal titles manually.

Serials Solutions e-Catalog Titles

Because libraries often have unique access requirements, Serials Solutions gives you a way to customize your access URL and other details.  For databases which do not have article- or ebook-level linking, you can choose to link at the database level instead.  You can choose to include your proxy URL for subscribed titles or omit it for open access resources. You can include custom journal subscription date ranges.  Finally, you can add custom public notes (such as login information).

Serials Solutions e-Catalog Details

Library Catalogs and Institutional Repositories

Discovery services can include records from your library catalog and institutional repository.  There are several different methods for getting the local records into the Summon index.

At my library, to get our library catalog indexed, we export bibliographic and holdings records from our ILS on a periodic basis and upload them via FTP to Ex Libris.  Staff at Ex Libris then add them to the Summon index (a process that can take several weeks).

For many institutional repositories, you provided Summon with access to your institutional repository metadata using Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH).  Institutional repository content is then harvested automatically on a periodic basis (weekly for Summon).

Resources

Here are some great online resources to learn about discovery services:

Articles and books about discovery services:

The Future of Library Resource Discovery by Marshall Breeding

The Future of Library Resource Discovery“A white paper commissioned by the NISO Discovery to Delivery (D2D) Topic Committee” gives an overview of the current state of library discovery services and looks into how they might adapt to the future.  Published in 2015.

Access the full-text article (PDF).

E-Discovery Tools and Applications in Modern Libraries edited by Egbert de Smet and Sangeeta Dhamdhere

 My library My History Books on Google Play E-Discovery Tools and Applications in Modern LibrariesPart of the “Advances in Library and Information Science” (ALIS) series.  This book is a collection of papers covering discovery UX, e-metrics, open source, digital libraries, and library usage studies. Published in 2016.

View details and find a place to buy or borrow at Google Books.

Implementing Web-Scale Discovery Services: A Practical Guide for Librarians by JoLinda Thompson

Implementing Web-Scale Discovery Services: A Practical Guide for LibrariansNo. 9 in the “Practical Guides for Librarians” series.  From the publisher: this book is a “one-stop source for librarians seeking to evaluate, purchase, and implement a web-scale discovery service.”  Published in 2014.

View details and find a place to buy or borrow at Google Books.

Web Scale Discovery Services by Jason Vaughan

Web Scale Discovery ServicesThis title is actually an issue of Library Technology Reports from ALA Tech Source.  The report covers the content, interface, and functionality of discovery services from the major vendors to help with evaluation.  Possibly a bit dated now. Published in 2011.

View details and find a place to buy or borrow at Google Books.

OAI-PMH: Basics and Resources

Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) is a set of specifications for making structured open repository metadata accessible to other service providers issuing requests.

Why learn about OAI-PMH?

Taking advantage of repositories (data providers) and services (service providers) that offer metadata using OAI-PMH will allow your resources better visibility and access.  For example, many discovery services (the “harvester”) use OAI-PMH metadata for indexing open access institutional repository articles.

The Basics

Open Archives InitiativeOpen Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) specifies how metadata is structured and presented for ingestion by external services, usually on the Internet.  OAI-PMH metadata is encoded in extensible markup language (XML) format.  OAI-PMH records are harvested using HTTP requests.

OAI-PMH is a project of the Open Archives Initiative.

Continue reading “OAI-PMH: Basics and Resources”