Known Errors for JHOVE 1.0, 2005-05-26
- The HTML module will sometimes report incorrect line numbers for
syntax errors if the file contains blank lines.
- The TIFF module incorrectly looks for the GeoTIFF ModelTransformationTag
with a value of 34254, not its correct value of 34264.
- The TIFF module may incorrectly report a count mismatch for the
ISOSpeedRatings tag of a TIFF/EP file.
- The TIFF module will report valid files as being only well-formed with an
error message of the form:
"Invalid DateTime separator: 2005-03-09T00:00:00".
This message may be incorrect due to an error in handling the date string,
which is being inappropriately converted to ISO 8601-form prior to
being tested for format validity.
- The HTML may be caught in an endless loop if invoked against some non-HTML
text files containing the "<" character.
- The AIFF module reports an incorrect value for the first sample offset
in files with a non-zero offset field in the SSND chunk.
- The AIFF module reports bitrate reduction data for non-standardly encoded
PCM audio.
- XML-encoded AES audio metadata is not valid relative to the schema
because of badly formatted sample rates.
- The PDF module may report files with malformed date strings as being
valid, and files with well-formed date strings with trailing whitespace as
being invalid.
- The PDF module reports files with non-existent, indirect destination
references as "non-well-formed", rather than "well-formed, but invalid".
- The JPEG module does not report the X and Y sampling frequency for
files meeting the JFIF profile, nor does it report the pixel aspect ratio
for JFIF profile files for which it is defined.
- When displaying text strings reported by various modules, the XML handler
will include any control characters that appear in those strings. However,
control characters other than TAB (0x09), CARRIAGE RETURN (0x0a), and LINE
FEED (0x0d) are not allowed in well-formed XML.
- The "subMessage" attribute is not defined by the jhove.xsd
schema.
- The PDF module throws a Class Cast Exception if the Info key of the
trailer dictionary is not an indirect reference.
- The TIFF parser loses synchronization when reading a pyramidal TIFF
file using the SubIFDs tag (330) with a type of IFD (13) rather than LONG (4).
- The TIFF module only reads the first sub-IFD of a pyramidal TIFF file.
- The XML module may terminate abnormally if a schemaLocation attribute
does not properly define two whitespace-separated URIs, depending upon
the SAX parser being used.
- The JPEG 2000 module may incorrectly identify files produced by the
LuraWave codec as having unrecognized QCC marker segments.
- The PDF module reports the "IncrementalUpdate" property as the total
number of file trailers, not the number of incrementally-updated trailers,
which is one less than the total.
- The TIFF module does not properly parse the EXIF SubsecTimeOriginal
(37521) or SubsecTimeDigitized (37522) properties. This bug also occurs
for JPEG files containing EXIF metadata.
For more information: jhove-support@hulmail.harvard.edu
Copyright 2005-2007 by JSTOR and the President and Fellows of Harvard College
Last updated 2007-02-13