PVC [Image] INDEX * INTRODUCTION * INSTALLATION UNPACKING SETTING THE DESTINATION DIRECTORY COMPILING THE BINARIES * UNIX COMMAND INFORMATION PAGE SETTING FLAG VALUES CONTROLLING PARAMETERS WITH FUNCTIONS RUNNING COMMANDS WITH SHELL SCRIPTS * ROUTINES: SHORT DESCRIPTIONS BASIC ROUTINES PLAINPV TWARP AMPLITUDE WARPING NOISEFILTER COMPANDER SPECTWARPER ADDITIVE SYNTHESIS HARMONIZER INHARMONATOR CHORDMAPPER SUBTRACTIVE SYNTHESIS FILTER FREQRESPONSE CHORDRESPONSEMAKER FILTRESPONSEMAKER PVANALYSIS TVFILTER CONVOLVER RESONANCE/REVERB RING RINGFILTER RINGTVFILTER NONLINEAR FREQUENCY DEVIATION FILTDEVIATOR TVFILTDEVIATOR FEATURE EXTRACTION ENVELOPE CENTROID FLUXOID PITCHTRACKER CONTROL FUNCTION PROCESSING SHAPE ---------------------------------------------------------------------------- * TERMS AND COMMON FEATURES OVERLAP/ADD METHOD VS. OSCILLATOR BANK METHOD AND RESYNTHESIS THRESHOLDS SOURCE MULTIPLE CHANNELS PLAYBACK DURING PROCESSING INPUT SOUND FILE OUTPUT SOUND FILE FLOATING-POINT AMPLITUDE RESCALING OUTPUT STATISTICS FREQIUENCY RESPONSE TERMINAL OUTPUT ANALYSIS FILES DECIBELS LOW/HI SHELF EQUALIZATION WARP INDEX PITCH TRANSPOSITION FREQUENCY SHIFT ENVELOPE RESPONSE TIME RING DECAY TIME FFT SIZE WINDOW SIZE WINDOW TYPE FRAMES PER SECOND TIME EXPANSION/CONTRACTION BEGIN/END TIMES GAIN FILTERING: SOURCE SIGNAL LEVEL TRANSPOSITION/SHIFT APPLICATION FLAG FILTER TYPES: PASS OR REJECT RESPONSE FUNCTION SMOOTHING ANALYSIS DATA ACCESS MODE CONVOLVER PANPOT FREQUENCY RESPONSE ACCUMULATION METHOD RING ROUTINES: FILTER PLACEMENT COMPRESSION AND EXPANSION UTILITIES FILE CONVERSION: aiffs, aiffd, nexts, nextd, nextfloats FUNCTION VIEWING: showme, showspect USING THE SHELL SCRIPTS SAMPLE SCRIPT: S.PLAINPV SHELL SCRIPT OUTPUT SOUND FILE CONVERSION GEN FUNCTION CONTROL OF PARAMETERS SAMPLE OUTPUT: S.PLAINPV ---------------------------------------------------------------------------- * INTRODUCTION PVC is a collection of phase vocoder signal processing routines and accompanying shell scripts for use in the transformation and manipulation of sounds. It is written in C and designed to be used in a UNIX environment. It has come about as a result of my path of education and research into phase vocoder technology. It follows in the spirit of the work by Eric Lyon (out of which PVC is built) and Chris Penrose whose particular dsp research springs from the coding and tutorial work of F.R. Moore and Marl Dolson. Moore's book, Elements of Computer Music, published by Prentice Hall, is therefore a great resource for making sense of the phase vocoder engine which I am unable to go into here. Curtis Road's book, The Computer Music Tutorial, published by MIT Press, has sections on the phase vocoder as well; these may better introduce the beginner to the practical concerns of this technology. Short of the explanations these sources provide, I have attempted to offer below some explanations, particularly as needed for control of the parameters in these routines. A manual and tutorial would be great to have; unfortunately time has not yet made it so. These routines reflect my need for tools which can perform different spectral resynthesis tasks; both simple and experimental. Their refinement has advanced with my growing skills and curiosity, which I expect will continue as long as I have questions about sound. Most of these routines can be viewed in terms of traditional additive or subtractive synthesis tasks, coming about as they did from the desire for greater finesse and control of these two basic types of synthesis. While the speculative nature of some give them an idiosyncractic character, most should, with practice, reveal the transparency of their names if not the role they can play in the shaping of sound. All require a good ear tuned towards sound and idea as none of these routines are automatic, although many hold great potential for the diligent. This 3.0 release contains only those routines which I think are stable, useful and moderately transparent. Some earlier versions have been omitted, replaced or consolidated into newer routines. For example, compander remains, but the ideas behind bandamp have ripened into spectwarper, a remarkable "super companding" tool for windowing amplitude, and balancing the resonance/noise-residues of a sound. The harmonic tone reorganizer, chordmapper, has continued to grow in its controls (however arcane), offering increasingly subtle ways to reorganize harmonic spectra. The noisefilter routine is now very good, having become a PVC first encounter routine for many whose noisy lives cross my path. Tvfiltdeviator now joins the arcane but novel filtdeviator routine. In addition, I have added a set of feature analysis routines (pitchtracker, centroid, envelope, fluxoid); which should be useful in generating function files to control different synthesis strategies. There are other, more experimental routines (some actually appeared in 2.0) which are still proving themselves; in time they will appear or reappear. As with 2.0, floating-point files (combined with a rescale feature) continue to be readable and writable. Someday I will deal with AIFF headers (although they do not offer floating-point values), but not for now. Paul Koonce koonce@music.princeton.edu RETURN TO INDEX ---------------------------------------------------------------------------- INSTALLATION The PVC package contains both my PVC routines, and the CMUSIC gen functions written by F.R. Moore (each in a separate directory). I have included some SGI shell script utilities as well, for easily changing sound file formats. Moore's standalone gen functions are useful tools for creating function files that can then be used in the time-varying control of parameters. The gen functions included are cspline, gen0, gen1, gen2, gen3, gen4, gen5, gen6, and genraw; a one-line summary can be obtained by running the command without any arguments. A detailed explanation of each can be found in the appendix of Moore's Elements of Computer Music. You can compile and install the PVC and gen function routines separately or together following unpacking and setting of the destination directory. 1) Unpack: First move PVC.tar.gz to the directory of your choice. Unzip it with gunzip. gunzip PVC.tar.gz Then, unarchive it with tar. tar xvf PVC.tar This will produce a PVC directory in which you will find several other directories. 2) Set Destination Directory: Next you will need to set the destination directory in the Makefile located in the /PVC directory. This is the master makefile for both the PVC and gen routines. Change the directory specified by: DESTDIR = /You_must_set_the_desination_directory to the directory in which you want the routines installed. 3) Compile: To compile and install the PVC and gen function routines both, type: make which if successful should be followed with: make install To compile and install only the PVC routines, type the following. make PVC make install And to compile and install only the gen routines, type the following. make GEN make install In all cases the make install moves the compiled routines from the /PVC/bin directory to your specified destination directory. (The handful of utilities in the UTILITIES directory are copied as well.) If the destination directory is in your .cshrc path, and you have sourced your shell file, as in: source .cshrc you should be able to type any of the routines and see their flag information page. Try typing: plainpv for example. RETURN TO INDEX ---------------------------------------------------------------------------- UNIX COMMAND-LINE FORMAT The routines are UNIX, command-line routines in the form of: routine [flags] input_soundfile output_soundfile At present the only soundfile formats excepted (both input and output) are NEXT/SUN formant files in either 16-bit short samples, or 32-bit floating-point samples. In PVC, the float form has a required rescale function, which is the whole reason for using floats in this case. All processing is done in floats. Control of each routine's parameters is done through flags, as in: plainpv -N1024 -p12 input.snd output.snd In most cases, the parameter flag inputs allow you to specify either a constant (i.e. -p12 ) or a function file (i.e. -p/tmp/pitch_change ); function files give you time-varying control over a parameter (see below). In most cases, parameters are initialized to default values. RETURN TO INDEX INFORMATION PAGE Information about any routine can be seen by typing the name of the routine without any arguments/files. Typing: plainpv produces the following information about plainpv. plainpv: generic phase vocoder with dynamic controls plainpv [flags] [input file (16-bit shorts)] [output file (optional)] (values in brackets denote defaults) N: FFT length (must be a power of 2) [1024] M: window size in samples (must be a power of 2) [2*FFT] (0 will automatically set window to 2*FFT size or larger) w: window type: 0 = hamming, 1 = rectangular 2 = Blackman, 3 = Bartlett triangular [0.] 4-12 = Kaiser windows for alpha = 4-12, respectively (representative sidelobe levels for alpha: 4 = -30dB, 8 = -58 dB, 12 = -90 dB) D: analysis frames per second [200] I: time expansion/contraction factor [1.] (duration = duration * factor, 1. = original time) P: pitch transposition in semitones (func) [0] a: frequency shift factor (bin frequency adder, before -P )(func) [0.] b: begin time in seconds [0.] e: end time in seconds ( 0. = end of file) [0.] C: resynthesis channel (1 -> ?) (0 = all) [0] SHELF EQ:(post transpose/shift) H: SHELF EQ: Low shelf gain in dB (func) [0.] X: SHELF EQ: High shelf gain in dB (func) [0.] m: SHELF EQ: Low shelf frequency in Hz (func) [200.] R: SHELF EQ: High shelf frequency in Hz (func) [2000.] W: warp index for reshaping magnitude response (func) [0.] Values > 0 expand the dynamic range, values < 0 compress the dynamic range. A: gain in decibels (func) [0.] l: envelope attack time (func) [0.] L: envelope release time (func) [0.] T: BRICKWALL FILTER TYPE: 0 = bandpass, not 0 = band reject [0] f: frequency window: low boundary (before -P and -a) (in Hz) [0.] F: frequency window: high boundary (before -P and -a)(in Hz) [Nyquist frequency] p: amplitude reports print mode: 0 = off, 1 = on [0] i: time interval between amplitude reports [.25] _: OUTPUT FORMAT: 0 = taken from input file 1 = 16-bit integer, 2 = 32-bit floats [0] =: PEAK RESCALE LEVEL (float output only) 0 to -96 dB Set to 1 to rescale to level of input file. [ 1 ] TERMINAL DISPLAY AND GRAPH FILE OUTPUT n: number of frames [0] u: low bin frequency [-1] U: high bin frequency (-1 = nyquist) [Nyquist frequency] S: TERMINAL DISPLAY: display option [0] (0 = off, 1 = phase data, 2 = amp data, 3 = both) c: GRAPH FILE: WRITE ascii to FILE 0 = off, 1 = freq, 2 = decibels [0] 3 = decibels - waterfall plot (When on, this flag writes ascii point pairs (with time frame on x axis) for plotting with gnuplot.) d: TERMINAL DISPLAY FILE NAME for -c [./ascii.out] t: oscillator resynthesis threshold in decibels [ -96 ] RETURN TO INDEX SETTING FLAG VALUES If no output file is specifed, the name pv.out.snd will be used in the local directory. The bracketed values at the end of each parameter represent the default value; it can be changed by specifying the flag letter preceeded by a minus sign and followed by the new value, with no spaces on either side. For example, the following: plainpv -N2048 inputfile outputfile would change the FFT size to 2048. Some flags require files rather than constants. For these, simply supply the full pathname of the needed file as in: twarp -F/here/there/everywhere/analysis_file which supplies twarp with the necessary analysis file. RETURN TO INDEX CONTROLLING PARAMETERS WITH FUNCTIONS: Parameters which have the word (func) on the info page just before the default as in: W: warp index for reshaping magnitude response (func) [0.] can be controlled dynamically. This is done by providing a full pathname file in place of the constant. The file is assumed to be a headerless series of values representing how the parameter will evolve as a function of time. The values may be either 32-bit floating-point values, or ASCII numbers, arranged one-per-line (the routine deciphers which it is). The function file can have any number of values as the series is fitted to the specified duration, linearly interpolated to produce the values inbetween. Function files in 32-bit floating-point form can be created with the CMUSIC gen routines provided with this package. (See INSTALLATION.) RETURN TO INDEX RUNNING THE COMMANDS WITH SHELL SCRIPTS: While all routines can be run at the commandline, they are most easily run using the shell scripts found in the SCRIPTS directory. These scripts are useful for saving and managing the parameters; in many ways they are a poor-man's GUI. All scripts contain a top section for setting variables, and a bottom section where those variables are placed into the commandline flag structure and run. Some scripts perform two routines such as a short analysis routine followed by the main synthesis routine, while others run just one routine. The variables for the routines in a shell are set in the top section. Take note that shell script variable assignments do not allow for spaces. The numerous parameters, which in some routines run as high as 53, make these scripts a necessity. They will be your friend if you take care to leave the bottom part alone, and don't corrupt your variable names. Someday I will make a better way to interface with the routines; for now this is the way it is. To run the scripts, simply type the name of the file (or appropriate pathname when running from outside the directory in which it resides). For example: S.plainpv (If this does not work, check to make sure the script is executable, and that the first line contains #!/bin/sh). If things are working correctly, the resynthesis should begin which you will know from the output streaming to the terminal. (See the explanation below about using shell scripts. ) RETURN TO INDEX ---------------------------------------------------------------------------- ROUTINES: SHORT DESCRIPTIONS Below is a listing of the routines contained in this release along with a short description of what each does. BASIC ROUTINES PLAINPV Plainpv is a basic phase vocoder with control of pitch transposition, frequency shift, time scale, amplitude warp and low/high shelf equalization. It also has some nice controls for looking at the data produced by the phase vocoder. Run this routine with S.plainpv. If you are interested in looking and/or graphing segments of the data, run the S.plainpv_with_printout_and_graph_files script instead and use the showspect utility. You will need to have gnuplot installed. RETURN TO INDEX TWARP Twarp is like plainpv except that it works from an analysis file rather than a soundfile. This allows you to move forwards/backwards through time according to a time function file. Use pvanalysis through the script S.pvanalysis to make the analysis file; then run the script S.twarp. RETURN TO INDEX NOISEFILTER Noisefilter filters out the noise in a sound by subtracting out a frequency response. The frequency response is analyzed from a short segment in the file where noise alone is found. For sounds that do not have segments of isolated noise, there is a threshold mode. Run with S.noisefilter. RETURN TO INDEX AMPLITUDE WARPING COMPANDER Compander is a classic compressor/expander. What is different here is the use of a peaks response file. The peaks response file is a frequency response, analyzed from a segment of the sound, that is taken to represent the peak bin amplitudes for the sound. Each frequency bin of the peaks frequency response functions as the 0 dB reference point for that frequency bin. The amplitude of the frequency bin is companded relative to this reference. The entire analysis/companding process (including the analysis segment using freqresponse) can be run using the script S.compander. RETURN TO INDEX SPECTWARPER Spectwarper uses an expanded compansion scheme to highlight either a sound's stronger, resonant components or its weaker noise/residual components. Spectwarper is fairly similiar to compander; however, unlike compander which compands bins against the constant peak of an input response file, spectwarper compands bins using a peak drawn (in the current frame) from a narrow frequency band centered around the value being processed. This causes the compansion or "warping' of the amplitudes to accentuate(expansion) or mask(compression) formants located within the frequency bands; the result being the noise/pitch highlighting mentioned earlier. Part of this comes from the treatment of compression in Spectwaper. Unlike compander which only reduces the amplitude above the threshold when compressing, spectwarper reduces the amplitude of the entire range, becoming, in effect, an expander of the strongest amplitudes that expands them (when the compression level is severe) out of the picture. Spectwarper is one of my favorite routines of late simply because it provides such a simple and powerful control over the noise and pitch characteristics of a sound. I love it, and use it often. Run this routine with S.spectwarper. RETURN TO INDEX ADDITIVE SYNTHESIS -- HARMONIZER, CHORDMAPPER, AND INHARMONATOR: These routines all allow for a kind additive synthesis based on the remapping of phase vocoder data according to some model. Each requires an ascii data file specifying how phase vocoder information will be replicated or mapped. This mapping is constant for the run of the routine. HARMONIZER Harmonizer works much like a commercial harmonizer in that it allows you to create harmony against the source by adding a transposed copy of it. Here the concept is extended by allowing for multiple harmonizations, each taken from a different band of frequencies, output with seperate gain. Run this using the script S.harmonizer. CHORDMAPPER Chordmapper lets you specify how harmonically related groups of partials will be replicated or mapped to produce chords. An input data file organizes the remapping into tone groups, and includes ways to tune or neutralize the frequency deviations of partials. Time-varying control of these features is available as well. You can use this routine to build up thick chords from single tones, or to delicately reorganize a harmonic spectrum. Run this using the script S.chordmapper. INHARMONATOR Inharmonator lets you specify how the partials of one fundamental will be remapped or deviated. While the more recent and developed routine chordmapper is probably better for this task, I have decided to leave this routine in for now. (Think chordmapper.) RETURN TO INDEX SUBTRACTIVE SYNTHESIS FILTER Filter is a very useful routine for filtering a sound by a frequency response. Filtering is achieved by first creating the frequency response through either synthesis or analysis, followed by filtering with filter. Synthestic responses are created using either chordresponsemaker (which synthesizes a spectrum as a collection of harmonic tones), or filtresponsemaker (which synthesizes a frequency response using lines and breakpoints). Analyzed responses can be made with freqresponse (which analyzes a sound file segment and constructs a response representing the peak or average amplitudes). Once made, the magnitudes of the FFT response are multiplied against the time varying magnitudes of the input sound's FFT. Filter allows time-varying control of the response shape (warp), transposition/shift, compansion, smoothing, and source/filter mix, making this a very useful tool for quickly manipulating the spectral characteristics of a sound according to your synthetic or analytic goals. The synthetic forms can be run with the scripts S.filter_with_chord_synthesis or S.filter_with_breakpoint_synthesis; the analysis-based form with S.filter_with_analysis. The analytic form is a powerful tool for bringing the color of one sound into the realm of another. RETURN TO INDEX FREQRESPONSE Freqresponse is a routine used by several others to prepare a spectrum for use with routines that filter, compress or limit. The response can be normalized or not depending on the needs of the routine which will use the response. CHORDRESPONSEMAKER Chordresponsemaker is a routine that uses a collection of harmonic tones, variable in size, to create a synthetic frequency response. It is found in various filtering scripts. FILTRESPONSEMAKER Filtresponsemaker is a routine that uses breakpoints and straight lines to create a synthetic frequency response. It is found in various filtering scripts. PVANALYSIS Pvanalysis is the time varying form of freqresponse that creates a phase vocoder analysis for use by other routines. The routines which require pvanalysis files are twarp, convolver, tvfilter, ringtvfilter, and tvfiltdeviator. Run this using the script S.pvanalysis. TVFILTER Tvfilter is the time-varying (tv) form of filter. Tvfilter uses a pvanalysis file to change the magnitudes of the input sound file. As it is with filter, tvfilter multiplies the magnitudes of the analysis FFT against the magnitudes of the input sound's FFT, while preserving the frequency/phase characteristics of the input sound. Preserving the phase of the input sound file results in a cross-synthesis which sounds like the input sound file covered or suppressed by the shadow of the analysis file. Like filter, tvfilter offers a variety of controls for manipulating the filter characteristic. The use of a phase vocoder analysis to represent the filter characteristic also makes possible the temporal control of the filter file (i.e. backwards/forwards control) as found with twarp. Run this using the script S.tvfilter. RETURN TO INDEX CONVOLVER In its setup and controll, convolver is the same as tvfilter. It's processing, however, is different. In tvfilter filtering is produced by multiplying the magnitudes from the polar form of the two analyses; leaving the phases (or frequencies) of the source intact while modifying the amplitudes of those frequencies. Convolver goes a bit further by multiplying the two analyses in their Cartesian forms. This produces an intersection of the two spectra. Unlike tvfilter which produces a shadowlike intersection, shadowing the analysis file characteristic onto the input sound file, convolver creates a true spectral intersection, allowing only that which is common to both sounds to be heard. The effect is a sound which is somewhat garbled as it outputs the more intermittently common spectral components of the two. The form of the multiplication in convolver does not allow some of the filter transposition controls associated with tvfilter. There is however a convolution panpot which offers control of the mix between the convolution and source sounds. Run this using the script S.convolver. RETURN TO INDEX RESONANCE/REVERB RING Ring uses the phase vocoder to create an all-pass resonator. It works by structuring the FFT resynthesis as a bank of feedback filters that feed back the sinusoid of each bin in a strength proportional to the amplitude of that bin (after adjustment by global feedback controls). This allows the sound to "ring" in a way something like reverb or comb filter resonance. The difference from comb filtering is that with ring spectral resonance is created not through a collection of comb filters selected for their ability to resonate various pulse wave spectra, but rather, through an array of feedback filters (sized by the FFT) that resonate a sine wave spectrum while dynamically tuning their feedback frequencies to the frequencies of the input sound. In short, it creates a kind of "self resonance". Ring is a nice way of increasing the resonant pitch characteristics of a sound, although it has its weaknesses. Ring works best with larger FFT sizes as it is attempting to synthesize or accentuate the more pitched/harmonic characteristics of the sound; this is something larger FFTs, with their increased frequency resolution, handle better. Use of the Kaiser window, with its low sidelobe amplitudes, helps as well. In adition, there is a threshold for preventing the noise features of a sound from being resonated, plus an EQ which can be positioned to filter either the source input to the feedback loop, or the feedback return. Run this using the script S.ring. RINGFILTER Ringfilter marries filter with ring by allowing a frequency response to be imposed on the resonance created with ring. Ringfilter begins to look more like multiple-delay, comb filter resonance since the static frequency response selects which frequencies will feed back. What is unique here is that the frequency response can come from an analysis, allowing the input sound to be resonated by the average spectral characteristic of another sound. A synthesized frequency response can be used as well. Like the EQ in ring, the filter in ringfilter can be positioned to either filter the source input to the feedback loop, or the feedback return where it will have the effect of introducing the filter characteristic more slowly through the resulting variable rates of decay. Run ringfilter with S.ringfilter_with_chord_synthesis to create a synthetic frequency repsonse, and with S.ringfilter_with_analysis for an analyzed frequency response. RINGTVFILTER Ringtvfilter is to ringfilter what tvfilter is to filter; that is, it makes the filter in ringfilter time-varying. This is a sophisticated idea, that is, time-varying filtering of the resonance of a time-varying sound. The best characterization would be to say that Ringtvfilter imprints the shadow of one sound onto the reverb of another. Ringtvfilter requires some thought and finese in order to separate and articulate the evolutions of the source, resonance, and filter. The best results are created using dynamic, high-profiled source sounds, rich with transient noise; and more constant, pitch/harmonic sounds for the time-varying filter. Like tvfilter, ringtvfilter requires an analysis file. Run this routine using S.ringtvfilter. RETURN TO INDEX NONLINEAR FREQUENCY DEVIATION FILTDEVIATOR The idea behind filtdeviator is to use a frequency response function to not only filter a sound (as with filter), but to to create a topology of frequency deviation working in correlation with the filter. Consequently, filtdeviator is filter with added parameters for specifying how the filter frequency response function will be mapped into the deviation of frequency. The added parameters set the base and peak deviation for how the response will be mapped into both pitch transposition and frequency shift, and how the function will be warped within the range set by these limits. Their is also a master (0-1) deviation control for globally controlling the deviation. All the controls of filtdeviator allow you to dynamically vary the presence and effect of amplitude filtering and frequency deviation, making filtdeviator an interesting routine for exploring the way filters can be used to impede/transform the resonant signature of a sound. Using small amounts of frequency deviation, with no amplitude filtering, and a sweeping transposition of the filter will produce an effect something akin to the commercial guitar phase shifter; larger amounts of deviation take it into another place entirely. Adding the correlated amplitude filtering conceals the deviation more (positioning it more at the edges of formants), producing a sound something like the floppy resonant behavior of slide whistles. The scripts to run filtdeviator -- S.filtdeviator_with_ chord_synthesis and S.filtdeviator_with_analysis -- are designed with frequency response synthesis/analysis sections like those for filter and ringfilter. Run this routine using either S.filtdeviator_with_analysis or S.filtdeviator_with_chord_synthesis. TVFILTDEVIATOR Tvfiltdeviator is to filtdeviator what tvfilter is to filter; i.e. it uses a time-varying filter response in place of the constant one. This routine blows the lid off of what was unusual about tvfiltdeviator. It's great for making wacky sounds out of ones with nice, fixed harmonies. The best use is to use it to deviate itself. Try taking something like a harpsichord or guitar (pitched stuff with decay) and do an analysis of the sound with pvanalysis. Then use the analysis to deviate the same sound. What happens is the strength of each of the sound's components becomes a control over the frequency deviation of that component, one that causes the sound to go "sproing" whenever it has any amplitude. Makes tonal music sound really broken. Run this routine with tvfiltdeviator. RETURN TO INDEX FEATURE EXTRACTION ENVELOPE Envelope is a routine for tracking the amplitude envelope of a sound. Output can be ASCII, floats or a NeXT soundfile. Selecting floats or ASCII will produce a file suitable for use in the control of a parameter. Run this routine with S.envelope. CENTROID Centroid is a routine for tracking the centroid of a sound. The centroid is the average of all the frequencies weighted by their amplitudes. It essentially gives you a kind of center frequency value for your spectrum. The analysis can be restricted to a band of frequencies, allowing the centroid to track a particular frequency component (although pitchtracker can do this as well). Selecting floats or ASCII will produce a file suitable for use in the control of a parameter. Run this routine with S.centroid. FLUXOID Fluxoid is a routine for tracking the average frequency change of a sound. The average can be weighted (best) or not by the amplitudes. Selecting floats or ASCII will produce a file suitable for use in the control of a parameter. Run this routine with S.fluxoid. PITCHTRACKER Pitchtracker is a routine for tracking the fundamental pitch trajectory of a sound. It is an experimental routine that works, I believe, but forever has its quirks. Three detection methods are available for following the 1) fundamental of the harmonic collection, 2) the strongest formant, or 3) a band-limited centroid. Different output formats let you see, hear and eventually use the fruits of your pitch tracking. Run this routine with S.pitchtracker. RETURN TO INDEX CONTROL FUNCTION PROCESSING RESHAPE Reshape is a routine for transforming function streams to meet the needs of different parameters. It takes a headerless float or ASCII function file as input and outputs a headerless stream of float or ASCII values. With the appropriate flags, it can be used to limit, resample, translate, warp, expand, shrink, invert, quantize, and lowpass filter the input values. The output can be translated into different amp or pitch units depending on your needs. Run reshape at the command line. RETURN TO INDEX ---------------------------------------------------------------------------- TERMS AND COMMON FEATURES Below are various terms, parameters, or ways of doing things which are common to many of the routines. OVERLAP/ADD VS. OSCILLATOR BANK METHODS AND RESYNTHESIS THRESHOLDS: The phase vocoder resynthesizes the signal using one of two methods, depending on the type of changes made to the FFT. If the changes are only to the magnitudes (amplitudes), then the faster overlap/add method is used. If however changes in frequency are made, then the FFT integrity is compromised, necessitating use of the oscillator bank method in which each bin is synthesized as a sine wave changing in frequency and amplitude. This method is slower, although a resynthesis threshold is available which can be used to increase the computation speed by turning off bins whose amplitude falls below the threshold. A threshold of -60dB is appropriate, although safety warrants using a lower threshold if the spectrum is thin and its decays exposed; use your ear. SOURCE The source sound is the original input sound. Some routines allow for the mix of the processed sound with the original source sound. MULTIPLE CHANNELS All routines allow both monophonic and multi-channel input files to be processed. With multi-channelled files, you can either select one channel and produce a monophonic output file, or process all the channels. Channels are numbered beginning with 1. Processing of multi-channelled files is done one channel at a time beginning with channel 1, with zeros written to channels which have yet to be processed. Prcessing one channel at a time requires less memory and allows you to audition the output sooner than if you did all channels at once. INPUT SOUND FILE The input sound file must be a NeXT/Sun format sound file in either 16-bit short or 32-bit floating point format. It may have one or more channels. OUTPUT SOUND FILE The output sound file is written as a NeXT/Sun format sound file in either 16-bit short or 32-bit floating point format, of one or more channels. The channels are processed one at a time beginning with the first channel. The first pass writes zeros in the channels yet to be processed, replacing them when processing proceeds to those channels. RETURN TO INDEX FLOATING-POINT AMPLITUDE RESCALING Selection of the floating-point, output-file format invokes an amplitude rescaling feature. Once processing is complete, a second pass through the sound file is made to rescale the values to the decibel level specified. A dB rescale level of 1 causes rescaling to the level of the original input file. PLAYBACK DURING PROCESSING The header of the output soundfile is updated often, so if your peak amplitude has not exceeded the 16-bit limit of the converters, you may play the float or integer output file before processing has completed. OUTPUT STATISTICS Two flags are provided for controlling the output amplitude statistics; one turns the statistics on or off, and the other sets how often they will be reported. The statistics provide the peak output level in amplitude and decibels. Wth integer format ouput files, ouput values exceeding the normalized peak amplitude of 1. (0 dB) are clipped to a value of 1.0, and the statistics placed in clip mode; in clip mode reports are made only for frames where clipping occurs. The peak amplitude, its time, and the number of clipped samples are reported at the end of processing. With floating-point format output files, ouput values exceeding the normalized peak amplitude of 1. are not clipped since they will be rescaled in the second pass; output statistics proceed normally throughout. The levels before and after rescaling are reported at the end of processing. RETURN TO INDEX FREQUENCY RESPONSE TERMINAL OUTPUT In many filtering or companding routines, a crude terminal print of the frequency response is a available. A flag sets the high cutoff frequency for this output; a value of 0 (0 Hz) turns printing off. ANALYSIS FILES Analysis files are binary, 32-bit floating-point files written by pvanalysis, containing frames of FFT analysis data for one or more channels. Analysis file data is preceeded by a header containing information about the analysis. Analysis files are much larger than the sound files they represent, and increase in proportion to the FFT size used. As such, files can become very large, so it is advisable to only make them when needed unless you have disk space to spare. RETURN TO INDEX DECIBELS Amplitude is always handled in decibel units. The greatest magnitude of the 16-bit short integer is equated with an amplitude of 1.0 or 0 dB. 0 dB functions as unity gain, and the peak amplitude in issues of compression, expansion, and amplitude windowing. A change of +/- 6 dB represents a doubling or halving of the amplitude. Increments of 10 dB are loosely associated with one change in dynamic level. 16-bit shorts allow for a 96 dB dynamic range. Take care not to loose signal level as a consequence of processing since quantization noise will emerge when you attempt to regain your signal level by amplifying the integer sound file. LOW/HI SHELF EQUALIZATION Equalization has been provided at various points in routines to allow for the needed adjustment of spectra. The EQ consists of low and hi shelf segments, whose width is adjusted through control of the shelf breakpoint frequency. The region between the shelf segments is represented by a linear decibel gradient between the decibel levels of the two shelves. Some routines implement the EQ before pitch changes, others after. EQ placed before pitch changes (pre-transpose/shift) will cause the EQ to be transposed with the pitch changes, whereas afterwards (post-transpose/shift) will keep them fixed as shifts and transpositions occur. WARP INDEX Many of the routines employ the principle of warping in which a distribution of values is transformed by an identity function. In these places an exponential function is employed to remap a 0-1 range of values into a new orientation that preserves the minima (0) and maxima (1) while bringing the distribution closer to either extreme as a result of the curvature of the exponential function selected. The curvature of the exponential function is selected through a warp index. Specifically, warp index w will reorient the input x through the function below (^ = exponentiation). y = (1. - (e^(x * w))) / (1. - (e^w)) In this function, the warp index of 0 produces a linear function and an untransformed output. Positive warp index values of increasing magnitude produce curves of increasing concavity (increasing slope) that draw values towards the 0-valued minima, and reduce the function integral. Negative values do the opposite, drawing values towards the maxima of 1, increasing the integral. The practical use of this mechanism is found in various places. One such place is the reshaping of the frequency response distribution characteristics. In this, positive warp indeces cause the peaks of the response to be accentuated while the weaker frequencies are expanded out (i.e. pushed towards 0). Negative values have the opposite effect as they compress the dynamic range of the response and raise the relative level of the weaker noise components. Another place where warp applies is in the remapping of FFT amplitudes through the spectrum warpshape. In this, the sucessive FFT frames have their amplitudes remapped by the identity function, similiarly expanding or compressing the dynamic range depending upon the warp specified; 0 (linear warp function) leaves the amplitudes unchanged. RETURN TO INDEX PITCH TRANSPOSITION With the pitch transposition control, a constant or function value is multiplied against all bin frequncies. This is classic transposition, here specified in semitones of transposition (12 semitones equals an octave). Conversion is made to produce the appropriate frequency multiplier. FREQUENCY SHIFT With the frequency shift control, a constant or function value is added to all the bin frequencies to produce a nonlinear pitch domain translation of the spectrum. Frequency shift is related to things like ring modulation and their similarly nonlinear shifts of pitch characteristics. Use this to create small distortions of the harmonic integrity of a sound. RETURN TO INDEX ENVELOPE RESPONSE TIME The rate at which amplitude changes are allowed to occur effects how smooth spectral evolutions will be. To control this, many routines contain attack and decay response times controls: once translated these controls manipulate the coefficients of the following filter. y(n) = (1. - A) * x(n) + A * y(n) The filter is a lowpass designed to increasingly smooth the sudden changes in a signal as the value of the coefficient, A, is increased. Its control is through the response time parameter which is the time in seconds it takes a signal, shifting from one state to another, to decay to -60 dB of its former state. Response times are transformed to create the necessary coefficients for the selected frame rate. The response time is separated into attack and decay; this allows seperate control of the smoothing of the signal depending upon whether it is increasing or decreasing in amplitude. Short attack/decay response times can be used in places where dynamic processing induces garble or even pops. You can use longer response times to generally smooth or blur the onset/offset of sound components, particularly if the response controls are being applied to a time-varying filter. When applied to amplitudes, longer decay respsonse-times do not sound good, for in their delay of the decay, they end up amplifying the residual noise of a sound. RING DECAY TIME Decay time is an issue in the feedback of the ring routines. Like response time, it is the time it takes the signal to decay to -60dB of its former state, or better, the time it takes the reverb to decay to -60dB. RETURN TO INDEX FFT SIZE: The FFT size must be a power of 2. Larger FFT sizes resolve frequencies better but transient behavior more poorly. Choose your FFT size according to the sound you are working with. A size of 1024 or 2048 works well in most cases. WINDOW SIZE The window size is a less opaque parameter; like the FFT, it must be a power of 2. Windows which are twice the size of the FFT work well. Larger window sizes may resolve frequencies better. Specifying 0 for the window size will automatically set the window to twice the FFT size, a feature I have always used. WINDOW TYPE The FFT and inverse FFT are computed using a window. Like the FFT size, the shape of the window used can effect the quality of the analysis and resynthesis. (See F.R.Moore, Stieglitz, or Roads for further explanation.) A variety of windows are available including: Hamming, Rectangular, Blackman, Triangular, and Kaiser (in 8 different forms as related to 8 different alpha values). Blackman (-w2) or Kaiser (-w8) are reccomended for most applications. In some unusual cases where transient behavior is being lost, consider using other windows such as the Rectangular, although take care to assure that it is not producing pops or a buzzy sound. RETURN TO INDEX FRAMES PER SECOND This controls how often the phase vocoder will perform an analysis on the signal. It is a translation of the classic decimation control which specifies how many samples to skip between analysis frames. More frames increases the resolution of time but decrease speed. 200 frames per second is a good reference point. If you expand time you should increase this proportionately to maintain about 200 or more frames per second. TIME EXPANSION/CONTRACTION Once the spectral modifications are made to the FFT analysis, an inverse FFT is invoked to produce the samples of a time-domain signal. The classic phase vocoder paradigm controls the number of samples through the interpolation value and its relation to the decimation. The arcane relationship of decimation and interpolation is here translated into the parameter of time expansion/contraction, allowing for the direct scaling of time. Use values greater than 1 to expand time, less than 1 contract it. RETURN TO INDEX BEGIN/END TIMES Processing may be performed on an entire file or a segment of it by specifying begin and end times. End times less than or equal to 0 default to the end of the input file. GAIN: The output and other components can be gained. 0 dB represents unity gain, no change. See decibels. FILTERING: SOURCE SIGNAL LEVEL The mix of source and filtered sounds in the filter routines can be controlled by the source decibels floor. This value, taken from the -96 to 0 dB range, specifies the level of the source signal. The filtered signal level is equal to (1 - source amplitude floor). Consequently, the source level functions as a floor over which lies the filtered signal. A source floor of 0 dB would neutralize filtering since there would be no filter range above the floor, a floor of -96 dB would produce the full effect of the filter. RETURN TO INDEX TRANSPOSITION/SHIFT APPLICATION FLAG Filter routines which allow for transposition and frequency shifting of both filter and source have a flag which specifies whether transposition/shift should be applied before or after filtering. If it is applied before, the pitch transposition trajectory will evolve independent of the filter's trajectory of transposition. If it is applied after, then the pitch transposition trajectory will be added to the filter transposition trajectory, causing the filter to move in parallel with the pitch transposition movements plus any movements the filter transposition parameter adds. FILTER TYPES: PASS OR REJECT Filters can be toggled to use frequency responses in pass or rejection mode. In pass mode, the response's stronger magnitudes are used to pass source through the filter; in rejection mode, to impede or reject components. In rejection mode, the response is created by inversion in the decibel range, not amplitude. In time-varying filtering (tvfilter), rejection can be in mode 1 in which the response is inverted against a constant 0 dB peak, or in mode 2 in which the response is inverted against the current analysis frame's peak amp. Spectral warping is always applied after the response has been transformed by rejection. RETURN TO INDEX RESPONSE FUNCTION SMOOTHING Many routines which use frequency response files to filter or warp amplitudes have a control which allows the response to be smoothed. The smoothing is produced by replacing the magnitude of a frequency bin with an average taken from a band centered around that bin. The degree of smoothing is controlled through manipulation of a bandwidth value, specified in octave units. Larger bandwidths produce greater degrees of smoothness, 0 turns smoothing off. ANALYSIS DATA: ACCESS MODES Routines which use analysis data made with pvanalysis -- twarp, convolver, tvfilter, ringtvfilter, and tvfiltdeviator) -- access data the same; using the time-point, rate, and data window boundary parameters, set to function in either rate or explicit mode. In rate mode, the rate determines the speed of movement through a data file; the time-point sets the starting position. The rate may be positive (forward in time) or negative (backwards in time), and vary according to a function. Explicit mode uses the time point parameter to specify exactly where the analysis data should come from (units here are in the time of the analyzed sound). (Explicit mode does not use the rate control, and makes sense only if the time-point is controlled with a function.) Both rate and explicit modes abide by the upper and lower data window boundaries which delimit the data range. When the time-pointer moves beyond the specified upper and lower time boundaries, it re-enters the window from the other end, making the window into a circular/modular structure. The boundaries can be controlled with functions as well, giving this mode an expressive dimension far surpassing the time expansion/contraction parameter. There is also an auto-stop feature that, when turned on, causes processing to stop when it reaches the end of the analysis. RETURN TO INDEX CONVOLVER PANPOT The convolver routine has a unique panpot mechanism for controlling the mix of input sounds (A and B) with their convolution. The panpot is a crossfade mechanism that uses a -1 to 1 control range to accentaute either sound A, B or their convolution. A value of -1 produces an output consisting entirely of sound A, a value of 1, sound B. The 0 between these extremes produces the convolution of A and B. Values between these points produce a crossfade mix of either A or B and the convolution. For example, a trajectory from -1 to 1 would crossfade from sound A into the convolution, and on to sound B. Separate gain controls for A, B and the convolution make it possible to tune the continuity of this trajectory. In addition, the presence or spread of the convolution into the crossfade range can be tuned with the domain warp controls. The domain warp reshapes the movement through the crossfade range, allowing you to create a more gradual approach from A or B into the convolution center. This is achieved through a simple nonlinearizing of the crossfade domain in warp index style. Increasingly positive domain warp values (specified independently for each side) transform the linear trajectory towards the convolution into a decellerating one, causing the subtle mix area around 0 to be expanded. Therefore, if you want to hear more convolution in your crossfade, increase the panpot domain warps. FREQUENCY RESPONSE ACCUMULATION METHOD Several of the response-producing routines have the option of accumulating the response by either peak or average means. Whereas peak responses represent the record of a sound's thresholds (or synthesis specification's highest values), average responses represent the most common characteristics. If the sound you are analyzing has intermittent moments of sound whose peak characteristics you wish fully represented in the response, use the peak mode; otherwise use the average. RETURN TO INDEX RING ROUTINES: FILTER PLACEMENT Ringfilter and ringtvfilter use frequency response functions to filter the reverb. Two filtering modes are available in which either the source input to the feedback if filtered, or the feedback. When the response is used to filter the source input, it filters the signal before it enters the feedback mechanism, imposing its characteristic, from the start, on the feedback. However, when positioned to filter the feedback component, the appearance of the respsonse's spectral characteristic, in the reverb, appears gradually as the signal decays. In this mode, the time it takes the signal to decay into the response characteristic is controlled by an additional decay time associated with the filter. COMPRESSION AND EXPANSION Spectral compression and expansion play a role in many routines. Its implementation here is according to the traditional model that uses thresholds and magnitudes of compression/expansion to reduce or enlarge the dynamic range of a signal. With spectral compression, amplitudes that exceed the specified compression threshold are reduced by an amount determined by the decibels of compression (a multiplier of the bin's amplitude lying above the threshold). Expansion works in a similar fashion, except that it changes the amplitudes below, rather than above, the expansion threshold; this results in an expansion of the dynamic range as the bins falling below the threshold are made to cover a wider range. The term companding or compander is a merging of the two names, useful in situations where they are both available. While compander is the most obvious example of a routine using companding, traditional compression can be found in several other routines that involve filtering. It is not uncommon, in those routines, to reduce the dynamic range of an analyzed frequency response, particularly if it is time-varying, since the goal in filtering is more about color than dynamic range. In all routines that use some form of companding, the dynamic range of the unprocessed signal/response is assumed to lie between 0 and -96 dB; thresholds are chosen from within this range. The degree of compression or expansion, expressed in decibels, represents how much the signal lying beyond the threshold will be reduced. A value of -6 dB would halve the dynamic range above the threshold in compression, or double the range below the threshold in expansion. Compander applies compression for each frequency bin separately rather than as a macro gain change. It does this by using a frequency response file, created with freqresponse, to establish a unique, 0 dB point of reference for every bin; using its unique point of reference, every bin is compressed or expanded. RETURN TO INDEX ---------------------------------------------------------------------------- UTILITIES FILE CONVERSION: aiffs, aiffd, nexts, nextd, nextfloats The sound file conversion scripts: aiffs, aiffd, nexts, nextd, and nextfloats are shell scripts available for converting sound files back and forth between aiff and next formats, or from next to floats. They are all effectively SGI scripts since they use the SGI sound file format conversion utility, sfconvert. Aiffs and aiffd take next integer files and write new aiff files, nexts and nextd the opposite; in addition aiffs and aiffd can be used to write new aiff integer files converted from next float files. Nextfloats writes a new float file from a next integer file.The s or d following the aiff or next in the name stands for the action taken on the original file once the new file is made; the s saves the original file (i.e. does not delete it), the d causes it to be deleted. Multiple files may be converted with the same run of the command. Running the command without any input files will produce a description of the routine. FUNCTION VIEWING: showme, showspect Two graphing scripts are available for viewing functions and spectral data. You must have gnuplot installed on your computer to use them (Type gnuplot <CR> to see if you do). Showme is a simple script for viewing function files. Run without an input file for a description. Showme takes headerless floating-point or ASCII (give -a flag) function files and plots them. Showspect plots the file of FFT amplitude or frequency data produced by the plainpv script, S.plainpv_with_printout_and_graph_files. Showspect is useful for seeing a graphic representation of a very particular part of an analysis, it is not a substitute for a standard spectrogram application. RETURN TO INDEX ---------------------------------------------------------------------------- USING THE SHELL SCRIPTS SAMPLE SCRIPT: S.plainpv Below is a copy of the complete script for running plainpv which you can examine here to understand the basic structure of this shell script mechanism. In it you find a top section for variables and a bottom section for execution of the plainpv command. Set the variables of the top section with the appropriate files and constants; do not use spaces. To run the script, simply type it as a command as in the following. S.plainpv (See Running Commands with Shell Scripts) When the shell runs the script, it copies the value of the assigned variables into the respective flag positions and runs the command as if you had typed it at the prompt in a shell window. Each shell script is set up to print the command to the terminal just before running it. A sample of the output follows the script below. SHELL SCRIPT OUTPUT SOUND FILE CONVERSION The conversion routine, aiffd, has been added at the end of every script that writes an output sound file, as seen below. # aiffd $output_file ; To prevent it from unexpectedly converting the output of your sound file, it has been commented out with the # sign. If you would like your next integer or floating-point output files to be automatically converted to aiff sound file format, simply remove the # sign. (See file conversion.) GEN FUNCTION CONTROL OF PARAMETERS Any parameter whose flag on the routine's information page has the word (func) after it can be controlled by a function file. An easy way of doing this is to generate the file when the script is run. To do this, simply place the call to the gen function anywhere in the top section of the script; the script will then run the command and make the file needed by the shell variable. By the time the PVC routine is run in the bottom section, the file will be ready. The best place to locate your gen function calls is immediately following the variable assignment, as in the example below which sets the variable controlling the pitch transposition. pitch_transposition_in_semitones=/tmp/ptrans gen4 -L1000 0 -3 0 1 3 > /tmp/ptrans ; In this example, the variable, pitch_transposition_in_semitones, is set with the file name, /tmp/ptrans, which has in it the 1000 values output by the gen4 command. While the output of the gen4 command is in 32-bit floating point values, PVC allows them to be ASCII as well. So, it would be possible to replace the segment of text above with the following. pitch_transposition_in_semitones=/tmp/ptrans echo 0 1 2 2.5 5 10 > /tmp/ptrans ; Here the gen4 call has been replaced with a simple call to echo which places the ASCII values, 0 1 2 2.5 5 10, into the file /tmp/ptrans. When PVC looks at the file, it figures out that it is text rather floats. While the values in both cases are linearly interpolated by PVC, to create the continuous function needed by the routine, the gen4 case would be smoother since it has more values. Lines in shells can be continued onto new lines with the backslash, which comes in handy with gen functions. The above, for example, could be entered as: gen4 -L1000 \ \ 0 -3 0 \ \ 1 3 \ \ > /tmp/ptrans ; which would simplify our parsing of it. RETURN TO INDEX #!/bin/sh #****************************************************** #.................... PLAINPV ......................... #****************************************************** #******************** OUTPUT ************************** output_file=/S1/cm.mix.snd #...................................................... output_data_format=1 #( 0: Same as input file ) #( 1: integers ) #( 2: rescaled floats ) #................ RESCALE ............................. rescale_level_in_decibels=1 #(-96 to 0 dB. Set to 1 to rescale to peak of input file.) #******************** INPUT *************************** input_file=/S1/t.snd #........ BEGIN/END TIMES ............................. begintime=0 endtime=0 # (End time of 0 or less defaults to end of file.) #====================================================== #*** ANALYSIS PARAMETERS ****************************** FFT_length=1024 window_type=2 windowsize=0 # (0 sets windowsize to 2 * FFT (or larger)) frames_per_second=200 #====================================================== #*** RESYNTHESIS PARAMETERS *************************** #........... OUTPUT CHANNEL(S) ....................... output_channel=0 # (channels are numbered from 1 to the maximum.) # (0 = all channels) #.............OSCIL THRESHOLD ........................ oscillator_resynthesis_threshold_in_dB=-96 #( Try -60 to -70 unless dropouts become audible. ) #****************** MODIFICATIONS ********************* #.................. TIME .............................. time_expansion_contraction_factor=1 # (Adjust frames_per_second in proportion to keep a # constant rate.) #.................. DECIBELS .......................... gain_in_decibels=0 #.................. PITCH ............................. frequency_shift_in_Hz=-0 pitch_transposition_in_semitones=0 #............ AMPLITUDE RESPONSE ...................... release_time_in_seconds=0 attack_time_in_seconds=0 #............ SPECTRUM WARPSHAPE ...................... spectrum_warpshape_index=0 #............ BRICKWALL FILTER ........................ FILTER_TYPE=0 #( 0 = bandpass ) #( 1 = bandreject ) #...................................................... BRICKWALL_FILTER_window_low_frequency=-1 BRICKWALL_FILTER_window_high_frequency=-1 # (-1 selects respective lowest or highest frequency) #====================================================== #*************** LOW/HIGH SHELF EQ ********************* LOW_SHELF_EQ_gain_in_decibels=0 LOW_SHELF_EQ_frequency=200 HIGH_SHELF_EQ_gain_in_decibels=0 HIGH_SHELF_EQ_frequency=2000 #********** AMPLITUDE STATISTICS ********************** print_amplitude_statistics_0_no__1_yes=1 amplitude_statistics_time_interval=.25 #====================================================== #========= SCRATCH SPACE ============================== #====================================================== #==================================================== # COMMAND LINE SETUP -- OFFICE USE ONLY # (DO NOT WRITE BELOW THIS LINE) #==================================================== pvroutine=plainpv PVFLAGS="\ \ -N$FFT_length \ -M$windowsize \ -w$window_type \ -D$frames_per_second \ -I$time_expansion_contraction_factor \ \ -a$frequency_shift_in_Hz \ -P$pitch_transposition_in_semitones \ -A$gain_in_decibels \ \ -C$output_channel \ -t$oscillator_resynthesis_threshold_in_dB \ \ -b$begintime \ -e$endtime \ \ -H$LOW_SHELF_EQ_gain_in_decibels \ -m$LOW_SHELF_EQ_frequency \ \ -X$HIGH_SHELF_EQ_gain_in_decibels \ -R$HIGH_SHELF_EQ_frequency \ \ -L$release_time_in_seconds \ -l$attack_time_in_seconds \ \ -W$spectrum_warpshape_index \ \ -T$FILTER_TYPE \ -f$BRICKWALL_FILTER_window_low_frequency \ -F$BRICKWALL_FILTER_window_high_frequency \ \ -_$output_data_format \ -=$rescale_level_in_decibels \ \ -p$print_amplitude_statistics_0_no__1_yes \ -i$amplitude_statistics_time_interval \ " echo "\n\n$pvroutine $PVFLAGS $input_file $output_file " $pvroutine $PVFLAGS $input_file $output_file RETURN TO INDEX ---------------------------------------------------------------------------- SAMPLE OF OUTPUT FROM S.PLAINPV Below is a sample of the output from S.plainpv. plainpv -N1024 -M0 -w2 -D400 -I2 -a-0 -P2 -A0 -C0 -t-96 -b0 -e0 -H0 -m200 -X0 -R2000 -L0 -l0 -W0 -T0 -f-1 -F-1 -_1 -=1 -p1 -i.25 /S1/t.snd /S1/cm.mix.snd ///////////////////////////////////////////////////////////////////// --------------------------------------------------------------------- ============================== PLAINPV ============================== --------------------------------------------------------------------- ========================== INPUT SOUNDFILE ========================== INPUT FILE: FILENAME = /S1/t.snd INPUT FILE: SAMPLE RATE = 44100 INPUT FILE: NUMBER OF CHANNELS = 2 INPUT FILE: DURATION = 2.770386 INPUT FILE: BEGIN TIME = 0.000000 INPUT FILE: END TIME = 2.770386 INPUT FILE FORMAT: 16-BIT INTEGER ========================== OUTPUT SOUNDFILE ========================= OUTPUT FILE: FILENAME = /S1/cm.mix.snd OUTPUT FILE: SAMPLE RATE = 44100 OUTPUT FILE: NUMBER OF CHANNELS = 2 OUTPUT FILE FORMAT: 16-BIT INTEGER OUTPUT FILE: DURATION = 5.540771 ======================== ANALYSIS PARAMETERS ======================== FFT SIZE = 1024 * FUNDAMENTAL ANALYSIS FREQUENCY = 43.066406 * WINDOW SIZE = 2048 FRAMES/SECOND = 400 DECIMATION SAMPLES (samples between analysis frames) = 110 ======================= RESYNTHESIS PARAMETERS ====================== TIME EXPANSION/CONTRACTION FACTOR = 2 * INTERPOLATION SAMPLES (samples between resynthesis frames) = 220 * OSCILLATOR RESYNTHESIS THRESHOLD (in dB) = -96.000000 * GAIN (in dB) = 0.000 PITCH TRANSPOSITION (in semitones) = 2.000 FREQUENCY SHIFT (in Hz) = 0.000 * ENVELOPE ATTACK TIME (in seconds) = 0.000 ENVELOPE RELEASE TIME (in seconds) = 0.000 * SPECTRUM WARPSHAPE INDEX = 0.000 * FREQUENCY WINDOW: LOW BOUNDARY = 0.000000 FREQUENCY WINDOW: HIGH BOUNDARY = 22050.000000 * *............. LOW/HIGH SHELF EQ............* LOW SHELF FREQUENCY = 200.000 .......... LOW SHELF DECIBELS = 0.000 HIGH SHELF FREQUENCY = 2000.000 .......... HIGH SHELF DECIBELS = 0.000 *...........................................* * ===================================================================== ANALYSIS: CHANNEL = 1 ..............USING BLACKMAN WINDOW .....USING OSCILLATOR BANK RESYNTHESIS ********************************************************************* ** PEAK AMPLITUDE STATISTICS ** ********************************************************************* TIME PEAKAMP DECIBELS (LAST DECIBELS PEAK) ********************************************************************* ( 0.00 - 0.25) 0.0005 -66.295 -66.295 ( 0.25 - 0.50) 0.2052 -13.754 -13.754 ( 0.50 - 0.75) 0.3285 -9.668 -9.668 ( 0.75 - 1.00) 0.3066 -10.269 ( 1.00 - 1.25) 0.3176 -9.962 ( 1.25 - 1.50) 0.2731 -11.275 ( 1.50 - 1.75) 0.2655 -11.518 ( 1.75 - 2.00) 0.2416 -12.337 ( 2.00 - 2.25) 0.2930 -10.661 ( 2.25 - 2.50) 0.2915 -10.707 ( 2.50 - 2.75) 0.3067 -10.267 ( 2.75 - 3.00) 0.4094 -7.757 -7.757 ( 3.00 - 3.25) 0.3076 -10.241 ( 3.25 - 3.50) 0.2841 -10.930 ( 3.50 - 3.75) 0.2843 -10.924 ( 3.75 - 4.00) 0.3241 -9.786 ( 4.00 - 4.25) 0.3340 -9.524 ( 4.25 - 4.50) 0.3612 -8.845 ( 4.50 - 4.75) 0.3113 -10.136 ( 4.75 - 5.00) 0.3094 -10.189 ( 5.00 - 5.25) 0.3141 -10.058 ( 5.25 - 5.50) 0.1142 -18.846 ============= PEAK AMPLITUDE ======================================== CHANNEL TIME PEAKAMP DECIBELS (CLIPPED SAMPLES) ..................................................................... 1 2.898 0.4094 -7.757 ********************************************************************* ===================================================================== ANALYSIS: CHANNEL = 2 ..............USING BLACKMAN WINDOW ********************************************************************* ** PEAK AMPLITUDE STATISTICS ** ********************************************************************* TIME PEAKAMP DECIBELS (LAST DECIBELS PEAK) ********************************************************************* ( 0.00 - 0.25) 0.0004 -67.948 -67.948 ( 0.25 - 0.50) 0.2301 -12.763 -12.763 ( 0.50 - 0.75) 0.2477 -12.122 -12.122 ( 0.75 - 1.00) 0.1969 -14.115 ( 1.00 - 1.25) 0.2631 -11.599 -11.599 ( 1.25 - 1.50) 0.2086 -13.613 ( 1.50 - 1.75) 0.2559 -11.840 ( 1.75 - 2.00) 0.2671 -11.465 -11.465 ( 2.00 - 2.25) 0.2768 -11.157 -11.157 ( 2.25 - 2.50) 0.1762 -15.082 ( 2.50 - 2.75) 0.2113 -13.502 ( 2.75 - 3.00) 0.2549 -11.872 ( 3.00 - 3.25) 0.2673 -11.460 ( 3.25 - 3.50) 0.2869 -10.847 -10.847 ( 3.50 - 3.75) 0.2841 -10.931 ( 3.75 - 4.00) 0.1991 -14.019 ( 4.00 - 4.25) 0.2131 -13.427 ( 4.25 - 4.50) 0.2540 -11.904 ( 4.50 - 4.75) 0.2235 -13.014 ( 4.75 - 5.00) 0.2407 -12.369 ( 5.00 - 5.25) 0.2941 -10.629 -10.629 ( 5.25 - 5.50) 0.1166 -18.667 ============= PEAK AMPLITUDE ======================================== CHANNEL TIME PEAKAMP DECIBELS (CLIPPED SAMPLES) ..................................................................... 2 5.103 0.2941 -10.629 ********************************************************************* ===================================================================== PEAK AMPLITUDES: ALL CHANNELS --------------------------------------------------------------------- CHANNEL TIME PEAKAMP DECIBELS (CLIPPED SAMPLES) ..................................................................... 1 2.898 0.4094 -7.757 2 5.103 0.2941 -10.629 ===================================================================== PLAINPV: RESYNTHESIS COMPLETED