Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Different Types of Digitization

Reference Versus Research Images

At a low one level of digitization of print materials is a simply quickly derived Reference image that can be created using automated scanners or smartphones, with no intended determination for long term preservation. Scanning devices or smartphones flatbed or orbital scanners. These devices require no specialized skill in lighting, composition or focusing as devices automatically determine settings. This can be a very useful image but is , often made from text-based material slated for optical character recognition (OCR), but not optimized for OCR or other high level deep zoom and detailed online research of materials. Files created by these devices (PDF, JPEG, PNG) are not intended for optimized enhancement and are often low-resolution, ideal for reference, speedy transfer, and portability, but insufficient for quality reproduction.

At a high another level of digitization of cultural heritage materials is a skillfully derived Research image created at documented preservation standards informed by best practices specifications that meet or exceed FADGI (Federal Agencies Digitization Guidelines Initiative) standards. This type of digitization requires the skilled use of high-resolution photographic equipment (See PUL Imaging). The photographer will use lighting designed for cultural heritage imaging and must use professional judgment to properly set exposure, illuminate, and compose each photograph. In addition, the camera, lighting, and display monitor must be calibrated regularly. File formats created using this equipment are lossless (RAW, TIFF) allowing for optimized enhancement and captured at equipment-capable resolutions suitable for high-quality reproduction.

While As both types of images may be ingested to the PUL repository , when creating Research images it is desirable to follow best practices recommendations. Staff will need to consider what type of image is appropriate to the material being digitized and its intended use. Additional considerations are the availability of equipment, timeframe, quality of the source material, and storage costs.

File/Directory naming

All image files should use an 8.3 naming convention: eight digits, numeric, sequential, padded with leading zeros followed by a lowercase, three-character file extension, e.g. 00000013.tif, ensuring consistent and relevant image order. Directory names and structure should reflect the collection.

...

Some materials, such as audiovisual materials or ephemera, may be named corresponding to a barcode or other unique identifier assigned to each physical asset. This may also include indicators about the side (for bilateral media) and derivative status appended at the end. For example: 32101047381338_1_pm.wav (where 32101047381338 = barcode, 1 = side 1, pm = preservation master, and .wav = file extension).

Metadata needs

The most efficient workflow for PUL is for descriptive metadata to be created prior to digitization. At minimum, there must be a unique identifier, such as a metadata management system ID or a Finding Aids component ID, connecting digitized content to a metadata record.

...