SHNTOOL
Section: local (1)
Updated: March 2009
Page Index
NAME
shntool - a multi-purpose WAVE data processing and reporting utility
SYNOPSIS
shntool
mode ...
shntool
[
CORE OPTION]
DESCRIPTION
shntool
is a command-line utility to view and/or modify WAVE data and properties.
It runs in several different operating modes, and supports various lossless audio formats.
shntool
is comprised of three parts - its core,
mode
modules, and
format
modules.
This helps to make the code easier to maintain, as well as aid other programmers in developing new functionality.
The distribution archive contains a file named 'modules.howto' that describes how to create a new mode
or format module, for those so inclined.
Mode modules
shntool
performs various functions on WAVE data through the use of mode modules.
The core of
shntool
is simply a wrapper around the mode modules. In fact, when
shntool
is run with a valid mode as its first argument,
it essentially runs the main procedure for the specified mode, and quits.
shntool
comes with several built-in modes, described below:
-
- len
-
Displays length, size and properties of PCM WAVE data
- fix
-
Fixes sector-boundary problems with CD-quality PCM WAVE data
- hash
-
Computes the MD5 or SHA1 fingerprint of PCM WAVE data
- pad
-
Pads CD(hyquality files not aligned on sector boundaries with silence
- join
-
Joins PCM WAVE data from multiple files into one
- split
-
Splits PCM WAVE data from one file into multiple files
- cat
-
Writes PCM WAVE data from one or more files to the terminal
- cmp
-
Compares PCM WAVE data in two files
- cue
-
Generates a CUE sheet or split points from a set of files
- conv
-
Converts files from one format to another
- info
-
Displays detailed information about PCM WAVE data
- strip
-
Strips extra RIFF chunks and/or writes canonical headers
- gen
-
Generates CD-quality PCM WAVE data files containing silence
- trim
-
Trims PCM WAVE silence from the ends of files
For more information on the meaning of the various command-line options for each mode, see the
MODE-SPECIFIC OPTIONS
section below.
For convenience, each mode can specify an alternate name or alias that will invoke it
(this feature is currently only available on systems that support symbolic or hard linking).
In particular, each mode is aliased to 'shn<mode>'.
For instance, running
shnlen
is equivalent to running
shntool
len
- thus saving a few keystrokes.
Format modules
File formats are abstracted from
shntool
through the use of format modules. They provide a means for
shntool
to tranparently read and/or write different file formats.
This abstraction allows
shntool
to concentrate on its job without worrying about the
details of each file format.
The following formats are currently supported:
-
- wav
-
RIFF WAVE file format
- aiff
-
Audio Interchange File Format (AIFF and uncompressed/sowt AIFF-C only) (via 'sox'):
<http://sox.sourceforge.net/>
- shn
-
Shorten low complexity waveform coder (via 'shorten'):
<http://www.softsound.com/Shorten.html>
<http://www.etree.org/shnutils/shorten/>
- flac
-
Free Lossless Audio Codec (via 'flac'):
<http://flac.sourceforge.net/>
- ape
-
Monkey's Audio Compressor (via 'mac'):
<http://www.monkeysaudio.com/>
<http://supermmx.org/linux/mac/>
- alac
-
Apple Lossless Audio Codec (via 'alac'):
<http://craz.net/programs/itunes/alac.html>
- tak
-
(T)om's lossless (A)udio (K)ompressor (via 'takc'):
<http://www.thbeck.de/Tak/Tak.html>
- ofr
-
OptimFROG Lossless WAVE Audio Coder (via 'ofr'):
<http://www.losslessaudio.org/>
- tta
-
TTA Lossless Audio Codec (via 'ttaenc'):
<http://tta.sourceforge.net/>
- als
-
MPEG-4 Audio Lossless Coding (via 'mp4als'):
<http://www.nue.tu-berlin.de/forschung/projekte/ -
lossless/mp4als.html>
- wv
-
WavPack Hybrid Lossless Audio Compression (via 'wavpack' and 'wvunpack'):
<http://www.wavpack.com/>
- lpac
-
Lossless Predictive Audio Compression (via 'lpac'):
<http://www.nue.tu-berlin.de/wer/liebchen/lpac.html>
- la
-
Lossless Audio (via 'la'):
<http://www.lossless-audio.com/>
- bonk
-
Bonk lossy/lossless audio compressor (via 'bonk'):
<http://www.logarithmic.net/pfh/bonk>
- kxs
-
Kexis lossless WAV file compressor (via 'kexis'):
<http://www.sourceforge.net/projects/kexis/>
- mkw
-
MKW Audio Compression format (via 'mkwcon'):
<http://www.etree.org/shnutils/mkwcon/>
- cust
-
Custom output format module (output only, useful for encoding to a format that
shntool
does not yet support)
- term
-
sends output to the terminal
- null
-
sends output to /dev/null (output only, useful for dry-runs in several modes, such as
fix
mode or
strip
mode)
When reading files for input,
shntool
automatically discovers which, if any, format module handles each file.
In modes where files are created as output, you can specify what the output format should be - otherwise,
shntool
decides for you by selecting the first format module it finds that supports output (in a default installation, this will be the
wav
format).
CORE OPTIONS
Modeless
When run without a mode,
shntool
takes these options:
- -m
-
Show detailed mode module information
- -f
-
Show detailed format module information
- -a
-
Show default format module arguments
- -v
-
Show version information
- -h
-
Show a help screen
GLOBAL OPTIONS
All modes
All modes support the following options:
- -D
-
Print debugging information
- -F file
-
Specify a file containing a list of filenames to process. This overrides any files specified on the command line or on the terminal.
NOTE: Most modes will accept input filenames from a single source, according to the following order of precedence:
file specified by the
-F
option, otherwise filenames on the command line, otherwise filenames read from the terminal.
- -H
-
Print times in h:mm:ss.{ff,nnn} format, instead of m:ss.{ff,nnn}
- -P type
-
Specify progress indicator type.
type
is one of:
{pct, dot, spin, face, none}.
pct
shows the completion percentage of each operation.
dot
shows the progress of each operation by displaying a '.' after each 10% step toward completion.
spin
shows a spinning progress indicator.
face
shows the progress of each operation by displaying six emoticons that become increasingly happy as the operation nears completion.
none
prevents any progress completion information from being displayed.
The default is
pct.
- -h
-
Show the help screen for this mode
- -i fmt
-
Specify input file format decoder and/or arguments.
The format is: "fmt decoder [arg1 ... argN]", and must be surrounded by quotes.
If arguments are given, then one of them must contain "%f", which will be replaced with the input filename.
Examples:
-
-i
'shn shorten-2.3b'
(use official shorten-2.3b instead of later versions; leave default arguments untouched)
-i
'shn shorten -x -d 2048 %f -'
(force shorten to skip the first 2048 bytes of each file)
- -q
-
Suppress non-critical output (quiet mode).
Output that normally goes to stderr will not be displayed, other than errors or debugging information (if specified).
- -r val
-
Reorder input files?
val
is one of:
{ask, ascii, natural, none}.
The default is
natural.
- -v
-
Show version information
- -w
-
Suppress warnings
- --
-
Indicates that everything following it is a filename
Output modes
Additionally, any mode that creates output files supports the the following options:
- -O val
-
Overwrite existing files?
val
is one of:
{ask, always, never}.
The default is
ask.
- -a str
-
Prefix
str
to base part of output filenames
- -d dir
-
Specify output directory
- -o str
-
Specify output file format extension, encoder and/or arguments.
Format is: "fmt [ext=abc] [encoder [arg1 ... argN (%f = filename)]]", and must be surrounded by quotes.
If arguments are given, then one of them must contain "%f", which will be replaced with the output filename.
Examples:
-
-o
'shn shorten -v2 - %f'
(create shorten files without seek tables)
-o
'flac flake - %f'
(use alternate flac encoder)
-o
'aiff ext=aif'
(override default aiff extension of 'aiff' with 'aif')
-o
'cust ext=mp3 lame --quiet - %f'
(create mp3 files using lame)
- -z str
-
Postfix
str
to base part of output filenames
MODE-SPECIFIC OPTIONS
len mode options
- -U unit
-
Specifies the unit in which the totals will be printed.
unit
is one of:
{b, kb, mb, gb, tb}.
The default is
b.
- -c
-
Do not show column names
- -t
-
Do not show totals line
- -u unit
-
Specifies the unit in which each file will be printed.
unit
is one of:
{b, kb, mb, gb, tb}.
The default is
b.
- len mode output
-
-
The output of len mode may seem cryptic at first, because it attempts to convey a lot of
information in just a little bit of space. But it is quite easy to read once you know what
the columns represent; and in certain columns, what each character in the column means.
Each column is explained below.
- length
-
Shows the length of the WAVE data, in m:ss.nnn (millisecond) format. If the data is CD-quality, then m:ss.ff
is shown instead, where ff is a number from 00 to 74 that best approximates the number of
frames (2352-byte blocks) remaining after m:ss. If all files are CD-quality, the total
length will be shown in m:ss.ff format; otherwise it will be in m:ss.nnn format.
NOTE: CD-quality files are rounded to the nearest frame; all other files are rounded to the
nearest millisecond.
- expanded size
-
Shows the total size of all WAVE chunks within the file (header, data and any extra RIFF chunks).
Essentially this is the size that the file would be if it were converted to .wav format, e.g.
with
shntool
conv.
NOTE: Do not rely on this field for audio size! If you simply want to know how many bytes of
audio are in a file, run it through
info
mode, and look at the "data size" field in its output.
- cdr
-
Shows properties related to CD-quality files.
A 'c' in the first slot indicates that the WAVE data is not [C]D-quality.
A 'b' in the second slot indicates that the CD-quality WAVE data is not cut on a sector [b]oundary.
An 's' in the third slot indicates that the CD-quality WAVE data is too [s]hort to be burned.
A '-' in any of these slots indicates that the particular property is OK or normal. An 'x' in any
of these slots indicates that the particular property does not apply to this file, or cannot
be determined.
- WAVE
-
Shows properties of the WAVE data.
An 'h' in the first slot indicates that the WAVE [h]eader is not canonical.
An 'e' in the second slot indicates that the WAVE file contains [e]xtra RIFF chunks.
A '-' in any of these slots indicates that the particular property is OK or normal. An 'x' in any
of these slots indicates that the particular property does not apply to this file, or cannot
be determined.
- problems
-
Shows problems detected with the WAVE header, WAVE data, or the file itself.
A '3' in the first slot indicates that the file contains an ID[3]v2 header.
An 'a' in the second slot indicates that the audio data is not block-[a]ligned.
An 'i' in the third slot indicates that the WAVE header is [i]nconsistent about data size and/or file size.
A 't' in the fourth slot indicates that the WAVE file seems to be [t]runcated.
A 'j' in the fifth slot indicates that the WAVE file seems to have [j]unk appended to it.
A '-' in any of these slots indicates that the particular problem was not detected. An 'x' in any
of these slots indicates that the particular problem does not apply to this file, or cannot
be determined.
- fmt
-
Shows which file format handled this file.
- ratio
-
Shows the compression ratio for this file.
- filename
-
Shows the name of the file that's being inspected.
fix mode options
NOTE: file names for files created in
fix
mode will be based on the input file name with the string '-fixed' appended to it, and the extension
will be the default extension of the output file format. For example, with an output file format of
shn
the file 'foo.wav' would become 'foo-fixed.shn'.
This can be overridden with the
-a
and/or
-z
global options described above.
- -b
-
Shift track breaks backward to the previous sector boundary. This is the default.
- -c
-
Check whether fixing is needed, without actually fixing anything.
shntool
will exit with status 0 if fixing is needed, and status 1 otherwise.
This can be useful in shell scripts, e.g.:
"if shntool fix -c *; then shntool fix *; else ...; fi"
- -f
-
Shift track breaks forward to the next sector boundary.
- -k
-
Specifies that all files should be processed, even if the first several of them wouldn't be altered, aside from a possible file format change.
The default is to skip the first N files that wouldn't be changed from a WAVE data perspective in order to avoid unnecessary work.
- -n
-
Specifies that the last file created should not be padded with silence to make its WAVE data size a multiple
of 2352 bytes. The default is to pad the last file.
- -u
-
Round track breaks to the nearest sector boundary.
hash mode options
- -c
-
Specifies that the composite fingerprint for all input files should be generated, instead of the default of one fingerprint per file.
The composite fingerprint is simply the fingerprint of the WAVE data from all input files taken as a whole in the order given,
and is identical to the one that would be generated from the joined file if the same files were joined into one large file, with no padding added.
This option can be used to fingerprint file sets, or to identify file sets in which track breaks have been moved around, but no audio has been modified
in any way (e.g. no padding added, no resampling done, etc.).
- -m
-
Generate MD5 fingerprints. This is the default.
- -s
-
Generate SHA1 fingerprints.
pad mode options
NOTE: file names for files created in
pad
mode will be based on the input file name with the string '-prepadded' or '-postpadded' appended to it, and the extension
will be the default extension of the output file format. For example, with an output file format of
shn
and pre-padding specified on the command line, the file 'foo.wav' would become 'foo-prepadded.shn'.
This can be overridden with the
-a
and/or
-z
global options described above.
Be aware that some output format encoders (e.g. flac, ape) automatically
strip headers and/or extra RIFF chunks.
- -b
-
Specifies that the file created should be padded at the beginning with silence to make its WAVE data size a multiple
of 2352 bytes.
- -e
-
Specifies that the file created should be padded at the end with silence to make its WAVE data size a multiple
of 2352 bytes. This is the default action.
join mode options
NOTE: file names for files created in
join
mode will be prefixed with 'joined.', and the extension
will be the default extension of the output file format. For example, with an output file format of
wav
the files 'files*.wav' would become 'joined.wav'.
This can be overridden with the
-a
and/or
-z
global options described above.
- -b
-
Specifies that the file created should be padded at the beginning with silence to make its WAVE data size a multiple
of 2352 bytes. Note that this option does not apply if the input files
are not CD-quality, since padding is undefined in that case.
- -e
-
Specifies that the file created should be padded at the end with silence to make its WAVE data size a multiple
of 2352 bytes. This is the default action. Note that this option does not apply if the input files
are not CD-quality, since padding is undefined in that case.
- -n
-
Specifies that the file created should not be padded with silence to make its WAVE data size a multiple
of 2352 bytes. Note that this option does not apply if the input files
are not CD-quality, since padding is undefined in that case.
split mode options
NOTE: file names for files created in
split
mode are of the form prefixNNN.ext, where NNN is the output file number, and 'ext' is the default extension of the output file format.
If an output file format of 'wav' is used, and the prefix is not altered via the
-n
switch described below, then the output file names will be "split-track01.wav", "split-track02.wav", etc.
This can be overridden with the
-a
and/or
-z
global options described above.
For information on specifying split points, see the
Specifying split points
section below.
- -c num
-
Specifies the number to start counting from when naming output files. The default is 1.
- -e len
-
Prefix each track with
len
amount of lead-in taken from the previous track.
len
must be given in bytes, m:ss, m:ss.ff or m:ss.nnn format.
- -f file
-
Specifies a file from which to read split point data. If not given, then split points are read from the terminal.
- -l len
-
Specifies that the input file should be split into smaller files based on multiples of the
len
time interval.
len
must be given in bytes, m:ss, m:ss.ff or m:ss.nnn format.
- -m str
-
Specifies a character manipulation string for filenames generated from CUE sheets.
These characters, taken one-by-one, represent from/to character translation.
They must always be in pairs. Some examples:
-
- :-
-
Translate all instances of ':' to '-'
- :-/-
-
Translate both ':' and '/' to '-'
- :-/_*x
-
Translate ':' to '-', '/' to '_', and '*' to 'x'
- -n fmt
-
Specifies the file count output format. The default is %02d, which gives two-digit zero-padded numbers (01, 02, 03, ...).
- -t fmt
-
Name output files in user-specified format based on CUE sheet fields.
The following formatting strings are recognized:
-
- %p
-
Performer
- %a
-
Album
- %t
-
Track title
- %n
-
Track number
- -u len
-
Postfix each track with
len
amount of lead-out taken from the next track.
len
must be given in bytes, m:ss, m:ss.ff or m:ss.nnn format.
- -x list
-
Only extract tracks in
list
(comma separated, may contain ranges).
Examples include:
-
- 7
-
Only extract track 7
- 3-5
-
Only extract tracks 3 through 5
- 2-6,9,11-13
-
Only extract tracks 2 through 6, 9, and 11 through 13
- Specifying split points
-
-
Split points simply mark places within the WAVE data of the input file where tracks will be split.
They can be specified in any combination of the following formats:
- bytes
-
where bytes is a specific byte offset
- m:ss
-
where m = minutes and ss = seconds
- m:ss.ff
-
where m = minutes, ss = seconds and ff = frames (75 per second, so ff ranges from 00 to 74)
- m:ss.nnn
-
where m = minutes, ss = seconds and nnn = milliseconds (will be rounded to closest sector boundary,
or the first sector boundary if the closest one happens to be the beginning of the file)
- CUE sheet
-
- a simple CUE sheet, in which each "INDEX 01 m:ss:ff" line is converted to a m:ss.ff split point
Split points must be given in increasing order, and must appear one per line.
If the byte offset calculated from the final split point equals the input file's WAVE data size, then it is ignored.
Since split points specify locations within the input file where tracks will be split, N split points will create N+1 output files.
All m:ss formats will create splits on sector boundaries whenever the input file is CD-quality;
to force non-sector-aligned splits, use the exact byte format.
cat mode options
- -c
-
Specifies that extra RIFF chunks should be suppressed from the output. The default is to write the extra RIFF chunks.
- -d
-
Specifies that the WAVE data should be suppressed from the output. The default is to write the data.
- -e
-
Specifies that the WAVE header should be suppressed from the output. The default is to write the header.
- -n
-
Specifies that the NULL pad byte at end of odd-sized data chunks should be suppressed from the output, if present. The default is to write the NULL pad byte.
This option only applies when WAVE data is also written, otherwise it is ignored.
cmp mode options
- -c secs
-
Sets the number of seconds of audio to use for the byte-shift comparison buffer.
This option only makes sense with the
-s
option. The default is 3 seconds.
- -f fuzz
-
Sets the "fuzz factor" for determining whether byte-shifted data is identical.
fuzz
is a positive integer that represents the maximum number of allowable byte mismatches between the two files
in the area searched by the
-s
option.
This allows one to check for differing bytes between to files that (a) are
byte-shifted and (b) contain at least one error in the area searched by the
-s
option. The higher the fuzz factor, the longer the search takes, so set it
low to begin with (8 or so), and increase it in small steps if needed.
NOTE: this switch can only be used with the
-s
switch.
- -l
-
List offsets and values of all differing bytes. Output is similar to 'cmp -l'; in particular, offsets are 1-based.
Can be used with the
-s
switch.
- -s
-
Check to see whether the WAVE data contained in the input files are identical modulo a byte-shift.
Currently, this will only detect differences up to the first 529200 bytes (equal to 3 seconds of CD-quality data).
This can be used to compare WAVE data within a pre-burned file to WAVE data in the corresponding track
ripped from the burned CD, which is useful if the ripped track came from a CD burned TAO, and thus might have a
2-second gap of silence at the beginning. This option can also help identify a CD burner/CD reader combined read/write offset.
cue mode options
- -c
-
Specifies that a simple CUE sheet should be output. This is the default action.
NOTE: all input files must be CD-quality for CUE sheets to be valid.
- -s
-
Specifies that split points in explicit byte-offset format should be output.
conv mode options
NOTE: file names for files created in
conv
mode will be named based on the input file name. Specifically, if the input
file name ends with the default file extension for that file's format, then the
default extension for the desired output format will replace it; otherwise, it
will be appended to it. For example, for an output format of
shn
and a
wav
input file named 'file.wav', the converted file will be named 'file.shn', since '.wav'
is the default extension for the
wav
format. On the other hand,
given the same situation above, but with an input file named 'file.wave', the
converted file will be named 'file.wave.shn', since '.wave' does not match '.wav'.
This can be overridden with the
-a
and/or
-z
global options described above.
Be aware that some output format encoders (e.g. flac, ape) automatically
strip headers and/or extra RIFF chunks, while others (e.g. sox) might adjust
WAVE data sizes in rare instances in order to align the audio on a block boundary.
- -t
-
Read WAVE data from the terminal.
info mode options
This mode doesn't support any additional options.
strip mode options
NOTE: file names for files created in
strip
mode will be based on the input file name with the string '-stripped' appended to it, and the extension
will be the default extension of the output file format. For example, with an output file format of
wav
the file 'bar.shn' would become 'bar-stripped.wav'.
This can be overridden with the
-a
and/or
-z
global options described above.
Be aware that some output format encoders (e.g. flac, ape) automatically
strip headers and/or extra RIFF chunks, while others (e.g. sox) might adjust
WAVE data sizes in rare instances in order to align the audio on a block boundary.
- -c
-
Specifies that extra RIFF chunks should not be stripped. The default is to remove everything that appears after the first data chunk.
- -e
-
Specifies that WAVE headers should not be made canonical. The default is to canonicalize headers.
gen mode options
NOTE: file names for files created in
gen
mode will be prefixed with 'silence.', and the extension
will be the default extension of the output file format. For example, with an output file format of
wav
the generated file would become 'silence.wav'.
This can be overridden with the
-a
and/or
-z
global options described above.
- -l len
-
Generate files containing
len
amount of silence.
len
must be given in bytes, m:ss, m:ss.ff or m:ss.nnn format.
trim mode options
NOTE: file names for files created in
trim
mode will be based on the input file name with the string '-trimmed' appended to it, and the extension
will be the default extension of the output file format. For example, with an output file format of
shn
the file 'foo.wav' would become 'foo-trimmed.shn'.
This can be overridden with the
-a
and/or
-z
global options described above.
- -b
-
Only trim silence from the beginning of files
- -e
-
Only trim silence from the end of files
ENVIRONMENT VARIABLES
- ST_DEBUG
-
If set, shntool will print debugging information. This is analogous to the
-D
global option, with the exception that debugging is enabled immediately, instead of
when the command-line is parsed.
- ST_<FORMAT>_DEC
-
Specify input file format decoder and/or arguments.
Replace
<FORMAT>
with the format you wish to modify, e.g.
ST_SHN_DEC.
The format of this variable is analagous to the
-i
global option, except that the initial format is not included.
Examples:
-
ST_SHN_DEC='shorten-2.3b'
ST_SHN_DEC='shorten -x -d 2048 %f -'
- ST_<FORMAT>_ENC
-
Specify output file format extension, encoder and/or arguments.
Replace
<FORMAT>
with the format you wish to modify, e.g.
ST_SHN_ENC.
The format of this variable is analagous to the
-o
global option, except that the initial format is not included.
Examples:
-
ST_SHN_ENC='shorten -v2 - %f'
ST_FLAC_ENC='flake - %f'
ST_AIFF_ENC='ext=aif'
ST_CUST_ENC='ext=mp3 lame --quiet - %f'
Note that command-line options take precedence over any of these environment variables.
EXIT STATUS
Generally speaking,
shntool
will exit with status 0 upon success, and status 1 if it encounters an error.
The only exception is when the 'quit' option is selected from within the
interactive file reordering menu, in which case the exist status will be 255.
NOTES
shntool
is a misnomer, since it
processes WAVE data, not shorten data.
The name is a holdover from its early days as 'shnlen',
a program created specifically to extract information
about WAVE data stored within .shn files.
Aliases for
shntool
are prefixed with 'shn' instead of 'wav' to avoid possible collisions with existing programs.
AUTHOR
Jason Jordan <shnutils at freeshell dot org>
Please send all bug reports to the above address.
The latest version of
shntool
can always be found at <http://www.etree.org/shnutils/> or <http://shnutils.freeshell.org/>.
COPYRIGHT
Copyright (C) 2000-2009 Jason Jordan
This is free software. You may redistribute copies of it under the terms of the GNU General Public License
<http://www.gnu.org/licenses/gpl.html>. There is NO WARRANTY, to the extent permitted by law.
REVISION
$Id: shntool.1,v 1.140 2009/03/30 05:59:25 jason Exp $