AMI Digital Asset Specifications
This document outlines the technical specifications and requirements for the preservation-focused digitization of audio and moving image collections, along with guidelines for the packaging of digital deliverable files.
Last updated: 2025-05-19. For previous versions, see changelog.md
Table Of Contents
- Specifications for Digital Assets
Specifications for Digital Assets
Specifications may be modified over time to reflect changes in best practices or NYPL’s digital infrastructure, or to address previously unspecified media or conditions.
The following sections are broken into format groups to define the file deliverables for different media types and format variations.
Note: While this document outlines general specifications, the most precise and up-to-date technical criteria—particularly for file-level validation—can be found in our MediaConch policy files.
Digital Asset Packaging: All digital deliverables must be organized according to NYPL’s digital asset packaging guidelines, which follow the BagIt specification (v1.0). Each bag must:
- Represent only one inventoried collection object;
- Follow NYPL’s prescribed directory structure (
PreservationMasters
,EditMasters
,ServiceCopies
,Images
, etc.); - Include metadata JSON for each media file (image files excluded);
- Contain an
md5
manifest for all files in thedata
directory.
Bag directories must be named using the Primary ID of the collection object and must not be compressed.
Filename Conventions: File naming rules are referenced throughout this document, but a full breakdown of filename components and role suffixes can be found in the NYPL file naming convention guide.
Each filename is built from a standardized “root” that includes:
- a three-letter division prefix,
- the object’s Primary ID,
- one or more structural components (e.g.,
v01
,f01
,r01
,s01
,p01
,t01
), - and a two-letter suffix denoting the file’s role (
pm
,em
,sc
,mz
).
Examples:
myd_123456_v01_pm.mkv
→ Video Preservation Mastermyh_987654_v01f01r01_pm.flac
→ Audio Preservation Master (Face 1, Region 1)
For details on volume, face, region, stream, and part identifiers—as well as role suffixes—refer to the file naming convention chart.
Film Media
Deliverables
For each original recording, the following should be produced:
- One preservation master file*
- One mezzanine file*
- One service copy file*
- One JSON metadata file per media file (see JSON Metadata Deliverables)
- Optional image files as described in Image Deliverables
If the object has audio content (i.e. composite sound print), audio must be synchronized and embedded in all final deliverables.
Capture tools
Film must be digitized and captured either (1) as DPX, then transcoded along with synchronous Broadcast Wave files to FFV1/FLAC/Matroska using RAWcooked, or (2) directly to FFV1/FLAC/Matroska using a Lasergraphics scanner, provided that all NYPL specifications are met.
RAWcooked Parameters
When using RAWcooked to transcode from DPX to FFV1/Matroska, the following parameters must be used:
--output-name
: Set to match the NYPL filenaming convention--framerate
: Set to match the source material’s native frame rate--check
: Enabled to verify successful encoding--hash
: hash algorithm must be used/reversibility data created--info
: Full encoding details must be preserved
Film Groups 1 and 2: Motion Picture Film, Silent / Sound
Preservation Master File Specifications: Film Groups 1 & 2: Motion Picture Film, Silent / Sound
Source format | 35mm | 16mm | 8mm / Super 8mm / Double 8mm |
---|---|---|---|
Video bit depth | 16 bit | 10 bit | 10 bit |
Resolution | 4K optical side overscan | 2K optical side overscan | 2K optical side overscan |
Video codec | FFv1 version 3 | FFv1 version 3 | FFv1 version 3 |
File wrapper | Matroska (.mkv) | Matroska (.mkv) | Matroska (.mkv) |
Frame size | 4096x3112 | 2048x1556 | 2048x1556 |
Frame rate | (Same as source. If not described, please determine on viewing and describe in metadata) | (Same as source. If not described, please determine on viewing and describe in metadata) | (Same as source. If not described, please determine on viewing and describe in metadata) |
Pixel aspect ratio | 1.000 | 1.000 | 1.000 |
Audio bit depth* | 24 bit | 24 bit | 24 bit |
Audio sampling rate* | 96000 Hz | 96000 Hz | 96000 Hz |
Audio codec* | FLAC | FLAC | FLAC |
Audio channels* | Same as source | Same as source | Same as source |
Color space | Linear RGB (4:4:4) | Linear RGB (4:4:4) | Linear RGB (4:4:4) |
Color primaries | BT.709 | BT.709 | BT.709 |
Transfer characteristics | Printing Density | Printing Density | Printing Density |
Notes | Files may be transcoded from DPX using RAWcooked or captured directly to FFV1/MKV via an approved scanner. | Files may be transcoded from DPX using RAWcooked or captured directly to FFV1/MKV via an approved scanner. | Files may be transcoded from DPX using RAWcooked or captured directly to FFV1/MKV via an approved scanner. |
* Where audio is applicable.
Mezzanine File Specifications: Film Group 1 & 2 (Motion Picture Film, Silent / Sound)
Source format | 35mm | 16mm | 8mm / Super 8mm / Double 8mm |
---|---|---|---|
Bit depth | 10 bit | 10 bit | 10 bit |
Resolution** | 1920 x 1080 | 1920 x 1080 | 1920 x 1080 |
Display aspect ratio** | 16:9 pillarboxed / letter boxed as needed | 16:9 pillarboxed / letter boxed as needed | 16:9 pillarboxed / letter boxed as needed |
Video codec | ProResHQ | ProResHQ | ProResHQ |
File wrapper | QuickTime (.mov) | QuickTime (.mov) | QuickTime (.mov) |
Frame size | 1920 x 1080 | 1920 x 1080 | 1920 x 1080 |
Color space | YUV | YUV | YUV |
Chroma subsampling | 4:2:2 | 4:2:2 | 4:2:2 |
Frame rate | (Same as preservation master) | (Same as preservation master) | (Same as preservation master) |
Scan type | Progressive | Progressive | Progressive |
Pixel aspect ratio | 1.000 | 1.000 | 1.000 |
Audio codec* | PCM | PCM | PCM |
Audio bit rate* | 2304 kbps | 2304 kbps | 2304 kbps |
Audio bit depth* | 24 bit | 24 bit | 24 bit |
Audio sampling rate* | 48000 Hz | 48000 Hz | 48000 Hz |
Audio channels* | same as Preservation Master* | same as Preservation Master* | same as Preservation Master* |
Image corrections | Color corrected for dye fading, cropped to picture - no frame-lines or sound track visible, non-anamorphic | Color corrected for dye fading, cropped to picture - no frame-lines or sound track visible, non-anamorphic | Color corrected for dye fading, cropped to picture - no frame-lines or sound track visible, non-anamorphic |
* Where audio is applicable.
Service Copy File Specifications: Film Group 1 & 2 (Motion Picture Film, Silent / Sound)
Source format | 35mm | 16mm | 8mm / Super 8mm / Double 8mm |
---|---|---|---|
Bit depth | 8 bit | 8 bit | 8 bit |
Resolution** | 1920 x 1080 | 1920 x 1080 | 1920 x 1080 |
Display aspect ratio** | 16:9 pillarboxed / letter boxed as needed | 16:9 pillarboxed / letter boxed as needed | 16:9 pillarboxed / letter boxed as needed |
Video codec | H.264 | H.264 | H.264 |
File wrapper | MPEG-4 (.mp4) | MPEG-4 (.mp4) | MPEG-4 (.mp4) |
Color space | YUV | YUV | YUV |
Chroma subsampling | 4:2:0 | 4:2:0 | 4:2:0 |
Frame size | 1920 x 1080 | 1920 x 1080 | 1920 x 1080 |
Frame rate | (Same as preservation master) | (Same as preservation master) | (Same as preservation master) |
Scan type | Progressive | Progressive | Progressive |
Pixel aspect ratio | 1.000 | 1.000 | 1.000 |
Video bit rate | CRF 21 (variable bitrate) for in-house workflows; vendors may use constant bitrate (CBR) encoding at 8 Mbps to approximate equivalent visual quality | CRF 21 (variable bitrate) for in-house workflows; vendors may use constant bitrate (CBR) encoding at 8 Mbps to approximate equivalent visual quality | CRF 21 (variable bitrate) for in-house workflows; vendors may use constant bitrate (CBR) encoding at 8 Mbps to approximate equivalent visual quality |
Audio codec* | AAC | AAC | AAC |
Audio bit rate* | 320 kbps (CBR) | 320 kbps (CBR) | 320 kbps (CBR) |
Audio sampling rate* | 48000 Hz | 48000 Hz | 48000 Hz |
Audio channels* | same as Mezzanine* | same as Mezzanine* | same as Mezzanine* |
Image corrections | Color corrected for dye fading, cropped to picture - no frame-lines or sound track visible, non-anamorphic | Color corrected for dye fading, cropped to picture - no frame-lines or sound track visible, non-anamorphic | Color corrected for dye fading, cropped to picture - no frame-lines or sound track visible, non-anamorphic |
* Where audio is applicable.
Film Group 3: Audio Film
Preservation Master File Specifications: Film Group 3 (Audio Film)
Source format | 35mm | 16mm | 8mm / Super 8mm / Double 8mm |
---|---|---|---|
Audio codec | FLAC | FLAC | FLAC |
Wrapper | FLAC (.flac) | FLAC (.flac) | FLAC (.flac) |
Bit depth | 24 bit | 24 bit | 24 bit |
Sampling rate | 96000 Hz | 96000 Hz | 96000 Hz |
Number of audio channels | (same as source) | (same as source) | (same as source) |
Other characteristics | If there are tones / sync marks present, they must be captured or resolved and described in metadata signal notes. | If there are tones / sync marks present, they must be captured or resolved and described in metadata signal notes. | If there are tones / sync marks present, they must be captured or resolved and described in metadata signal notes. |
Edit Master File Specifications: Film Group 3 (Audio Film)
Source format | 35mm | 16mm | 8mm / Super 8mm / Double 8mm |
---|---|---|---|
Audio codec | FLAC | FLAC | FLAC |
Wrapper | FLAC | FLAC | FLAC |
Bit depth | equal to preservation master | equal to preservation master | equal to preservation master |
Sampling rate | equal to preservation master | equal to preservation master | equal to preservation master |
Number of audio channels | equal to preservation master | equal to preservation master | equal to preservation master |
Other characteristics | If there are tones / sync marks present, they must be captured and described in metadata signal notes | If there are tones / sync marks present, they must be captured and described in metadata signal notes | If there are tones / sync marks present, they must be captured and described in metadata signal notes |
Note: Unlike Film Groups 1 & 2, Film Group 3 (Audio film) uses Edit Masters rather than Mezzanine files. Service copies are not currently required for this group. The Edit Master serves as the primary access file.
Video Media
Deliverables
For each original recording, the following should be produced:
- One preservation master file
- If EIA-608 (line 21) captions are present in source, one closed captions sidecar file
- One service copy file
- One JSON metadata file per media file (see JSON Metadata Deliverables)
- Optional image files as described in Image Deliverables
Capture tools Preservation master video files must be generated by professional-grade capture devices and software, with either direct capture to FFV1/FLAC/MKV, or transcoding from V210/PCM/MOV. Specific FFmpeg transcoding recipes will be provided by NYPL to ensure consistency.
Guidelines: Preservation Master Files, All Groups
- Characteristics intrinsic to the broadcast standard of the source material, including frame rate, pixel aspect ratio, field dominance, resolution, and recording standard (NTSC, PAL, SECAM, etc.) should be preserved.
- Signal extraction must be optimal, and carried out using the equipment and accessories that are appropriate for the original format characteristics.
- The most direct and clean signal path must be used at all times from source to destination. There may be no devices inserted in the signal path that are not being used. If there are multiple destination formats being used in the transfer the signal path must be routed in parallel. No daisy-chaining of devices may occur.
- The highest quality signal format (composite, S-Video, Component, SDI, etc.) available for the source media type must be used throughout the entirety of the signal path from source through destination. Exceptions to this must be explained and requested prior to performing the transfer.
- Luminance, black, and color levels should be adjusted to existing color bars if they are present on tape and look accurate. If color bars are not present or are clearly inaccurate, preview each tape in order to adjust levels according to the content of the tape using known references (such as blue sky, known blacks and whites, flesh tone, etc.). Luma should be adjusted to fall within broadcast range (100 IRE max) and must not exceed 110 IRE.
- The transfer should capture all content recorded on the original object, including any bars and tone, slates, or other material coming before the start of the recorded program.
- The recording should run until the end of the recorded content (picture and sound). If this endpoint cannot be unambiguously determined, the recording should run until the end of the original object.
- If present on the source tape, EIA-608 (line 21) closed captions must be captured.
Silent Audio Channels
-
If a channel is present but silent, it should be captured and noted in the JSON file, indicating that the tape and the channels in the preservation master file are silent. For example:
- When soundField is set as “ch.1:mono, ch.2: none” instead of just “mono”, it implies that the second channel exists, and source.audioRecording.numberOfAudioChannels should be set to 2. The preservation master file delivered will contain two audio channels, one of which is silent.
-
In cases where two silent channels are detected as actual channels (i.e., recorded with no sound rather than not recorded at all), both channels should be captured and included in the preservation master. Additionally, a signalNote should be added to indicate that the tape is silent.
Timecode
- Two forms of legacy/source timecode should be retained: LTC Timecode, recorded on an audio channel, should be captured as an audio stream in the resulting preservation master file; VITC timecode, if present, should be captured through the use of appropriate playback devices and a carefully routed SDI signal chain.
Video Preservation Master Sidecar Files
EIA-608 (line 21) Closed Captions (.scc)
- If EIA-608 (line 21) captions are present in the source object, a sidecar Scenarist Closed Caption File (.scc) must be created.
- The file should follow the naming convention:
division_PrimaryID_v01_pm.scc
- Place the file alongside the video preservation master in the
PreservationMasters
directory.
QCTools Report
- Each video preservation master file may also receive a corresponding QCTools report as a gzipped XML file.
- The file should follow the naming convention:
division_PrimaryID_v01_pm.mkv.qctools.xml.gz
- Place the file alongside the preservation master file in the
PreservationMasters
directory.
Example directory structure:
PrimaryID
├── data
│ ├── PreservationMasters
│ │ ├── division_PrimaryID_v01_pm.mkv
│ │ ├── division_PrimaryID_v01_pm.scc
│ │ └── division_PrimaryID_v01_pm.mkv.qctools.xml.gz
│ └── ServiceCopies
│ └── division_PrimaryID_v01_sc.mp4
Video Group 1: Analog and Digital Cassettes, Analog Open Reel
Preservation Master File Specifications
Attribute | Specification |
---|---|
Video codec | FFv1 version 3 |
Data compression | Lossless, Intra-frame (GOP-1) only |
Color space | YUV |
Chroma subsampling | 4:2:2 |
Bit depth | 10-bit |
File wrapper | Matroska (.mkv) |
Frame rate | SD: 29.97 (NTSC) or 25 (PAL); HD: same as source |
Frame size | SD: 720x486 (NTSC) or 720x576 (PAL); HD: 1920x1080 |
Broadcast standard | (Same as original media) |
Pixel aspect ratio | SD: 0.909 (NTSC) or 1.091 (PAL); HD: 1.000 |
Slices | 24 |
Slicecrc | 1 |
Audio format | FLAC |
Audio bit depth | 24 bit |
Audio sampling rate | 48 kHz |
Audio channels | (Same as original media, see guidelines for silent channels) |
Video Group 2: DV
NYPL prefers native capture of DV content over FireWire whenever possible. All natively captured DV files must be processed using dvpackager, part of the DVRescue project, and rewrapped as Matroska (.mkv).
By default, dvpackager will segment files when there are changes to key signal parameters (such as aspect ratio or broadcast standard). NYPL prefers a minimal number of segments, but splitting is acceptable when necessary due to such changes. Segments are initially labeled as part1, part2, etc., but must be renamed to follow NYPL’s filenaming convention using region identifiers (e.g., _v01f01r01, _v01f01r02, etc.).
When native capture is not possible due to object condition or playback issues, DV or HDV tapes may be captured via SDI as V210/MOV and subsequently transcoded and rewrapped as .mkv.
Preservation Master File Specifications: DV Cassettes
Attribute | Specification | Notes |
---|---|---|
Video codec | DV | Native DV stream extracted from source |
File wrapper | Matroska (.mkv) | DV files must be processed using dvpackager and rewrapped as .mkv |
Other characteristics | (Same as source) | Splitting allowed when aspect ratio or broadcast standard changes mid-recording. Filenames must follow NYPL’s regional naming convention. |
Alternate method | FFv1 in Matroska (.mkv) | If native capture fails, SDI capture to V210/MOV may be transcoded and rewrapped. |
SDI Capture Fallback Specifications (when native DV capture is not possible)
Attribute | Specification |
---|---|
Initial capture codec | V210 (10-bit uncompressed) |
Initial capture wrapper | QuickTime (.mov) |
Transcode codec | FFv1 version 3 |
Final wrapper | Matroska (.mkv) |
Audio | FLAC (24 bit, 48kHz) |
Video Group 3: HDV
NYPL prefers native capture of HDV content via FireWire, preserving the original MPEG-2 transport stream. Files should be saved as .m2t
and subsequently rewrapped as Matroska (.mkv
) using a lossless method that retains all original stream characteristics.
Native capture ensures the preservation of embedded metadata, timecode, and recording characteristics. NYPL prefers that a single .m2t
stream is retained wherever possible, though segmentation is acceptable if necessitated by signal changes (e.g., aspect ratio shifts, standard transitions). Segmented files must be renamed according to NYPL’s filenaming convention for regions (e.g., _v01f01r01
, _v01f01r02
, etc.).
When native capture is not possible due to object condition or playback issues, HDV tapes may be captured via SDI as V210/MOV and then transcoded to FFV1/FLAC in Matroska (.mkv
).
Preservation Master File Specifications: HDV Cassettes
Attribute | Specification | Notes |
---|---|---|
Video codec | MPEG-2 | Native HDV stream extracted from source |
File wrapper | Matroska (.mkv) | Original .m2t must be rewrapped as .mkv , preserving stream characteristics |
Audio codec | MPEG-1 Layer II or AC-3 (as in source) | Retain original audio streams; do not convert |
Other characteristics | (Same as source) | Segment splitting allowed if required by stream changes; use NYPL filenaming convention for regions |
Alternate method | FFV1/FLAC in Matroska (.mkv) | If native capture fails, SDI capture to V210/MOV may be transcoded and rewrapped using the same parameters as the DV fallback method |
Video Group 4: DVD-Video
Preservation Master File Specifications: DVD-Video
Attribute | Specification |
---|---|
File system | Preserve as-is from source disc; may include UDF, ISO 9660, or a hybrid structure |
File wrapper | ISO (.iso) |
Other characteristics | Full disc image must retain original structure, including VIDEO_TS and AUDIO_TS directories where present |
Video Group 5: Video CDs (VCD)
Preservation Master File Specifications: Video CDs (VCD)
Attribute | Specification |
---|---|
File wrapper | BIN/CUE pair |
.bin file | Full raw-sector copy of the disc (2352-byte sectors, Mode 2 Form 2) |
.cue file | Track layout, indexing, mode and sector metadata for the .bin |
File system | File system: Create full raw sector image that preserves the complete disc structure, including the CD-ROM XA file system and all AUDIO/VIDEO DAT files |
Other | Use the same naming convention as other PM files |
Service Copy Specifications for Video
Service Copy File Specifications: Video Groups 1 and 2
Attribute | Specification |
---|---|
Video codec | H.264/MPEG-4 AVC |
Video bit rate | CRF 21 (variable bitrate) for in-house workflows; vendors may use constant bitrate (CBR) encoding at 3.5 Mbps to approximate equivalent visual quality |
Color space | 4:2:0 YUV |
Bit depth | 8 bit |
File wrapper | MPEG-4 (.mp4) |
Frame rate | 59.94 (NTSC) or 50 (PAL) |
Frame size | 720x480 (NTSC) or 720x576 (PAL) |
Broadcast standard | (Same as original media) |
Pixel aspect ratio | 0.889 (NTSC 4:3) or 1.091 (PAL 4:3); 1.212 (NTSC DV Widescreen); 1.454 (PAL DV Widescreen) |
Audio codec | AAC |
Audio bit rate | 320 kbps (CBR) |
Audio sampling rate | 48 kHz |
Audio channels | 2 (see examples) |
Service Copy File Specifications: Video Group 4 (DVD-Video)
Due to the variety of encoding structures present on source DVDs, NYPL allows a broader range of specifications for service copies derived from DVD (video group 4). Files must comply with the following general characteristics, with resolution, frame rate, display aspect ratio, and pixel aspect ratio varying according to the source.
Attribute | Specification |
---|---|
Video codec | H.264 / MPEG-4 AVC |
Video bit rate | CRF 21 (variable bitrate) for in-house workflows; vendors may use constant bitrate (CBR) encoding at 3.5 Mbps to approximate equivalent visual quality |
Color space | YUV |
Chroma subsampling | 4:2:0 |
Bit depth | 8bit |
File wrapper | MPEG-4 (.mp4) |
Frame rate | NTSC: 29.970 / 59.940 / 23.976 / 47.952; PAL: 25.000 / 50.000 |
Frame size | Based on source DVD: 720x480, 704x480, 352x480, 352x240 (NTSC); 720x576, 352x576, 352x288 (PAL) |
Broadcast standard | (Same as original media) |
Display aspect ratio | 4:3 or 16:9 (1.333, 1.777, 1.778) |
Pixel aspect ratio | Based on source; ranges from 0.889 to 2.909 |
Audio codec | AAC |
Audio channels | 2 |
Audio sampling rate | 48000 Hz |
Audio bit rate | 320 kbps (CBR) |
Note: While these specs permit a variety of technical values to accommodate the source material, all DVD-derived service copies must validate against the NYPL MediaConch DVD SC policy.
Note for VCDs: Apply the same service-copy settings as for DVDs (see Service copy specifications: video group 4), transcoding the MPEG-1 DAT streams into H.264/MP4 per spec.
Additional Video Specifications
Anamorphic Video
- Service copies created from anamorphic preservation masters must be reformatted to true 16:9 (widescreen). Do not letterbox or pillarbox.
Complex Audio Configurations
- In general, when multiple distinct language tracks are present as separate tracks, these should be retained in the service copy to preserve access to all content.
- If distinct languages appear on individual channels of a single track and the result is difficult to understand (e.g., overlapping language audio), consult with NYPL. In most cases, the service copy should retain the English channel and omit the others to ensure accessibility.
- If there is only audible content in a single channel of any number of channels in a source tape and preservation master, the channel containing audible content should be duplicated to produce a 2-channel service copy for improved user experience.
- If a channel contains audible LTC (Longitudinal Timecode), it must be removed from the service copy. The remaining “normal” audio content should be duplicated across both channels.
Trimming of Heads and Tails
- Color bars must not be trimmed.
- Long tails of black / “snow” / unrecorded content may be trimmed after confirmation that there is no visible or audible recorded content.
- Trimming must not result in an abrupt end of visible or audible content.
Audio Media
Deliverables
For each collection object, the following should be produced:
- One or more preservation master file(s)
- One edit master file per preservation master
- One JSON metadata file per media file (see JSON Metadata Deliverables)
- For optical audio only, one CUE file per preservation master file
- Optional image files as described in Image Deliverables
Capture tools
Preservation master and edit master files must be captured/encoded as Broadcast Wave Format (BWF). Files that exceed the Broadcast Wave Format 4GB file size limitation should be captured as RF64. Post-capture, all Broadcast Wave and RF64 files must be transcoded to the FLAC codec and container using the FLAC Utility, retaining embedded metadata and original modification times. Original capture as Wave64 (.w64) is not acceptable.
Guidelines: Preservation Master Files, Audio
- Preservation master files must comply with technical recommendations outlined by the International Association of Sound and Audiovisual Archives (IASA-TC 03, version 3).
- Signal extraction must be complete, including lead-in and play-out.
- Analog-to-Digital conversion must use converters meeting FADGI’s Audio ADC Performance Specifications.
- No signal processing may be applied during ADC transfer (e.g., no EQ, gain adjustment, dither, or noise reduction).
Signal Extraction
- Extraction must use equipment and accessories appropriate for the original format.
-
Flat (unmodified) transfers are required, with limited exceptions:
- de-emphasis of stated pre-emphasis (EQ)
- decoding of stated noise-reduction encoding
- Both intended and unintended signal content must be captured.
- One- or two-channel interleaved preservation masters may be created depending on the original recording.
- Levels may be adjusted only if distortion or clipping occurs; this must be documented in metadata.
- Sync tones (e.g. Pilottone) must be captured or resolved.
- All grooved media must include needle-drop and needle-lift.
Special circumstances and object-file relationships
- Faces: Each side of a cassette or disc is a discrete Face with a separate preservation master.
-
Regions: New Region required for changes in speed or sampling rate.
- Speed changes: 5-second overlap required between PM files.
- Sampling rate changes: Overlap not required; include timestamp note in metadata.
Multi-track Preservation Masters
When dealing with multi-track recordings (e.g., 8-, 16-, 24-track open-reel tapes, multi-channel DAT or N-track digital formats), follow these guidelines:
-
Parallel, synchronized streams Each track must be recorded as its own file, with precisely the same duration and time-alignment. This preserves phase relationships for later mixdown or analysis.
- Filenaming for stream identifiers Append a stream index identifier (
s01
,s02
, …) between the face/region code and role suffix.- Format:
division_PrimaryID_vXXfYYsZZ_pm.flac
sZZ
= stream number, zero-padded to two digits.
- Format:
-
No trimming or processing Preservation masters must be “flat” transfers: no edits, fades, trims, or level adjustments. All overlap and dead-air at splice points remains intact.
- Edit masters No trimming or alteration is allowed for Edit Masters derived from these streams.
Example: 24-track, two-inch open-reel tape (Face 1, Region 1):
myd_123456_v01f01s01_pm.flac
myd_123456_v01f01s02_pm.flac
myd_123456_v01f01s03_pm.flac
…
myd_123456_v01f01s24_pm.flac
Audio Group 1: Analog Magnetic
Preservation Master File Specifications
Attribute | Specification |
---|---|
Audio codec | FLAC |
File wrapper | FLAC (.flac) |
Bit depth | 24 |
Sampling rate | 96000 Hz |
Number of channels | (same as source) |
Audio Group 2: Digital Magnetic
Whenever possible, digital magnetic audio formats should be transferred using original playback equipment that supports native digital extraction. This approach—similar to our preferred method for DV video—ensures bit-perfect preservation of the original digital signal. If native digital migration is not possible due to equipment availability or format degradation, analog playback and re-digitization may be used as a fallback, pending approval from NYPL. Vendors must confirm and document the transfer method used for each asset.
Preservation Master File Specifications
Attribute | Specification |
---|---|
Audio codec | FLAC |
File wrapper | FLAC (.flac) |
Bit depth | (same as source, commonly 16 bit) |
Sampling rate | (same as source, commonly 48000 Hz, 44100 Hz, 32000 Hz (DAT), 44056 Hz (PCM-1610/30, F1)) |
Number of channels | (same as source) |
Audio Group 3: CD-DA
NYPL’s preference is for all audio content present on the disc—including multiple sessions, if applicable—to be extracted and preserved. Tracks should be concatenated into a single WAV file according to the original track order defined in the CUE sheet. This ensures accurate representation of the disc’s structure and content while simplifying preservation workflows.
Preservation Master File Specifications
Attribute | Specification |
---|---|
Audio codec | FLAC |
File wrapper | FLAC (.flac) |
Bit depth | 16 bit |
Sampling rate | 44100 Hz |
Number of channels | 2 (left + right discrete) |
Other characteristics | CD tracks should be concatenated into a single file according to Cue sheet |
CUE sheet files
- Generate a .cue file during BWF capture before FLAC transcode.
- Name to match the WAV file, e.g.,
myh_123456_v01f01_pm.cue
- Place alongside the WAV in the
PreservationMasters
directory only:
PrimaryID
├── data
│ ├── PreservationMasters
│ │ ├── division_PrimaryID_v01f01_pm.flac
│ │ └── division_PrimaryID_v01f01_pm.cue
│ └── EditMasters
│ └── division_PrimaryID_v01f01_em.flac
Audio Group 4: MiniDisc
For MiniDiscs, NYPL’s preference is to capture each track as a discrete file, preserving the structure and segmentation of the original disc. This approach accommodates the non-contiguous nature of audio data on MD and reflects the limitations and capabilities of current extraction tools.
Preservation masters are captured in ATRAC format (ATRAC1, ATRAC3, ATRAC3plus, or PCM for Hi-MD) and stored with the .aea
file extension. Alongside each set of preservation masters, a CSV file is included that documents the track layout and metadata as exported by the transfer software.
Preservation Master File Specifications
Attribute | Specification |
---|---|
Audio codec | ATRAC1 / ATRAC3 / ATRAC3plus / PCM (as present on source) |
File wrapper | AEA (.aea) |
Bit depth | As present on source (typically 16-bit for PCM) |
Sampling rate | As present on source (commonly 44100 Hz) |
Number of channels | 2 (left + right discrete) |
Other characteristics | One .aea file per track; no concatenation. |
Additional file | One CSV track metadata file (UTF-8, comma-separated) |
Placement | CSV must be placed alongside .aea files in the PreservationMasters directory |
CSV Track Metadata File
The accompanying CSV must retain its original structure, as exported by the MiniDisc extraction software (e.g.,Web MiniDisc Pro. It includes fields such as:
INDEX,GROUP RANGE,GROUP NAME,NAME,FULL WIDTH NAME,ALBUM,ARTIST,DURATION,ENCODING,BITRATE
The CSV filename should match the root of the preservation master files and be placed in the PreservationMasters
directory alongside the .aea
files—for example:
PrimaryID
├── data
│ ├── PreservationMasters
│ │ ├── division_PrimaryID_v01f01r01_pm.aea
│ │ ├── division_PrimaryID_v01f01r02_pm.aea
│ │ └── division_PrimaryID_v01f01_pm.csv
│ └── EditMasters
│ ├── division_PrimaryID_v01f01r01_em.flac
│ └── division_PrimaryID_v01f01r02_em.flac
Audio Group 5: Grooved Disc
Preservation Master File Specifications
Attribute | Specification |
---|---|
Audio codec | FLAC |
File wrapper | FLAC (.flac) |
Bit depth | 24 |
Sampling rate | 96000 Hz |
Number of channels | 2 (left + right discrete) |
Reproduction Details
-
Playback EQ must be applied and documented in metadata.
- Default for transcription discs: 400Hz turnover, -12dB @ 10kHz
-
Equipment requirements:
- Time-Step or KAB Souvenir phono EQs (Owl EQs not acceptable)
- Cartridge set to “lateral” unless otherwise specified
- Full stylus kit on-hand; best stylus choice noted in metadata
- Arm must be long enough and vertically adjustable
- Arm must remain parallel during playback
-
For multiple Regions due to speed/stylus change, create separate PMs.
- Reattempt retracking if skips occur; document outcomes
Audio Group 6: Grooved Cylinder
Preservation Master File Specifications
Attribute | Specification |
---|---|
Audio codec | FLAC |
File wrapper | FLAC (.flac) |
Bit depth | 24 |
Sampling rate | 96000 Hz |
Number of channels | Mono (1) |
Reproduction Details
- Transfer must be flat (no EQ), cartridge set to “vertical”
- Appropriate stylus kit required; best choice noted in metadata
- Groove pitch (TPI), speed, and cylinder material must be assessed and noted prior to capture
Edit Master File Specifications: All Audio Groups
Attribute | Specification |
---|---|
Audio codec | FLAC |
File wrapper | FLAC (.flac) |
Bit depth | Equal to preservation master |
Sampling rate | Equal to preservation master |
Number of channels | Mono (1) or Stereo (2) * |
* For mono recordings, channel count should reflect true mono configuration.
Guidelines for Edit Masters
- Needle-drop/lift, unrecorded portions, and equipment noise must be trimmed
- Do not trim multi-track masters; silence is intentional
- Eliminate 5-second overlaps where used in Preservation Masters
- Level adjustments up to -2dB allowed when needed
- Mono recordings must be rendered as true mono
Data Media
Deliverables
For each collection object, the following should be produced:
- One or more preservation master file(s)
- One JSON metadata file per media file (see JSON Metadata Deliverables)
- Optional image files as described in Image Deliverables
Filenaming and metadata protocol for misidentified discs:
- If a disc described as Audio or Video is actually a data disc, omit any Face metadata.
- Use Volume-level naming only, e.g.,
abc_123456_v01_pm.iso
- Update format/media type per Data Optical Disc JSON schema
Capture tools
- Use sector-by-sector imaging tools
- Output as ISO file
- Retain all file system structures as-is (UDF, ISO 9660, or hybrid)
Preservation Master File Specifications: Data Optical Disc
Attribute | Specification |
---|---|
File system | Preserve as-is; UDF, ISO 9660, or hybrid allowed |
File wrapper | ISO (.iso) |
Other characteristics | Same as source |
Image Deliverables
Images
Photographic documentation is encouraged for all magnetic and optical media and their enclosures, especially when significant bibliographic annotations, labeling, or physical features are present. Where appropriate, photographic documentation of film media may also be included at NYPL’s discretion.
Image file naming convention and controlled vocabulary
Image files should follow the same pattern as the media files, and be placed in the Images
directory, at the same hierarchical level as the PreservationMasters
and EditMasters
directories. The following controlled vocabulary must be appended as the final suffix in each image file name:
- assetfront
- assetback
- assetside
- assetrightside (if applicable)
- assetleftside (if applicable)
- assettop (if applicable)
- assetbottom (if applicable)
- boxfront
- boxback
- boxside
- boxrightside (if applicable)
- boxleftside (if applicable)
- boxtop (if applicable)
- boxbottom (if applicable)
- ephemera
Example structure:
PrimaryID
├── data
│ ├── PreservationMasters
│ ├── EditMasters
│ └── Images
│ ├── div_PrimaryID_v01f01.jpg
│ ├── div_PrimaryID_v01f02.jpg
│ ├── div_PrimaryID_v01_assetfront.jpg
│ ├── div_PrimaryID_v01_assetback.jpg
│ └── div_PrimaryID_v01_assetside.jpg
Image Specifications for Photographs of Media Objects
Attribute | Specification |
---|---|
Source format | All magnetic and optical media |
Bit depth | 8 bits per RGB channel |
Colorspace (ICC) | sRGB IEC61966-2.1 |
File format & extension | JPEG (.jpg) |
Resolution | Minimum 400 true DPI, without interpolation |
Surfaces | All enclosures: front, back, top, bottom, left side, right side |
Notes | Images with text/content should be oriented correctly. If a surface lacks content, it may be skipped. If the media object has extensive labeling, media and enclosure may be photographed together. |
JSON Metadata Deliverables
NYPL metadata deliverables must adhere to the customized fields and controlled vocabulary defined in NYPL’s ami-metadata JSON Schema repository.
-
Per-file packaging Each audio, video, or film media file must have exactly one JSON sidecar. Images do not require JSON metadata.
-
Schema validation JSON must validate against the appropriate schema. Any validation errors (including BOM/encoding issues) will prompt a full redelivery of the Bagged asset.
- Naming Use the same filename “root” as the media file, with a
.json
extension only.- ✅
division_PrimaryID_v01_pm.json
- ❌
division_PrimaryID_v01_pm.mov.json
- ✅
-
Encoding JSON files must be encoded as UTF-8 without a Byte Order Mark (BOM). Including a BOM often breaks downstream validation; please ensure your editor or export tool does not insert one.
- Where to find samples & instructions See the
samples/
folder in the schema repo for sample JSON, plus README-based validation steps.