I've kind of touched on this topic in various threads, but I haven't really presented a clear overall picture, and I think most people have no idea what I'm going on about. I also noticed a considerable amount of confusion in
Won't computers produce a convincing analog sound someday? (which I didn't see earlier because I avoid such threads), so I think it's worth elaborating a little bit more.
The basic idea is this: certain early digital synthesizers played back low resolution waveforms at jitter-free variable sample rates (derived from a high frequency clock by dividing it by an integer) with inadequate signal reconstruction. The result is that there are audible image frequencies (copies of the harmonic series transposed to higher frequencies). Technically they're not supposed to be there, but the result is a nice "exciter" effect. This is distinct from aliasing, as these are extended high harmonics rather than inharmonic tones.
A discrete time signal is basically an amplitude modulated impulse train (mid left of image) with a periodic spectrum (lower left). The DAC holds a constant value until the next sample (mid center), which filters the periodic spectrum with a sinc function (lower center). To be theoretically proper, the extra higher frequency stuff is supposed to be removed by a reconstruction filter, resulting in a smooth signal (mid right) and correctly reproduced spectrum (lower right). But many vintage digital synths don't do this (or do a poor job of it), which instead results in a bright, gritty sound.
Modern sample playback techniques, which use phase accumulators and high quality interpolation, do a reasonably good job of producing the "correct" signal, but this doesn't have a gritty vintage sound because the image frequencies aren't there. The result is that low resolution samples just sound dull. Using drop-sample interpolation (equivalently known as truncation, nearest neighbor interpolation, etc.) retains the image frequencies, but this approach suffers from jitter/aliasing. Reasonable results can be obtained if a high sample rate is used (like oversampling in software), but it's still not as good as the jitter-free divide-by-n case.
Also, note that the divide-by-n architectures have limited frequency resolution:
For clock frequency Fclk and playback sample rate fp, the pitch resolution in cents is 1200*log(n+1/n)/log(2) for n = floor(Fclk/fp). For playback frequency f and waveform resolution m (samples per period), fp = f*m, so the pitch resolution becomes worse as the waveform resolution is increased.
So if you're playing back a sample at 50 kHz on an Emulator II, the pitch resolution is 8.63 cents, or the worst case error is 4.32 cents. That's not very good.
Good performance for high resolution waveforms at high pitches would require extremely high clock frequencies. This is why some systems (PPG Wave 2 and Synclavier FM) automatically downsample the waveforms for higher octaves (by incrementing the waveform address by 2, 4, 8, etc. samples). This distorts the waveform but ensures that pitch resolution is the same for each octave. Samplers similarly have limited transposition ranges so that several samples must be used to span the keyboard.
Here's a basic overview of the different classes. The examples included all use a zero order hold DAC or drop-sample interpolation:
High frequency VCO (no jitter/aliasing, theoretically sort of the ideal case, but too complicated for polyphonic implementation)
RMI Harmonic Synthesizer (uses Walsh functions; 32 samples per period)
Early digital drum machines (Linn LM-1, Linndrum, Oberheim DMX, Sequential Circuits DrumTraks, Roland TR-909)
Early digital delays
Early pitch shifters (Eventide H910, H949, H969)
High frequency VCO, divide-by-n clock generation (no jitter/aliasing, VCO only used for fine tuning, vibrato and pitch bend, thus pitch control is smooth but limited)
Most divide-down TOS organs and DCO synthesizers (clock frequency typically 500 kHz to 2 MHz)
PPG Wave 2 (128 samples per period; waveforms downsampled by powers of two for higher octaves, so the highest notes should just be square waves)
Gleeman Pentaphonic (32 samples per period)
Emu Emulator I (11 MHz master clocks, independent VCOs for upper and lower keyboard split)
High frequency fixed clock, divide-by-n clock generation (no jitter/aliasing, limited frequency resolution, most examples can be implemented in software with Fs=Fclk and constrained playback frequencies)
Atari 10444D TIA (30 kHz; spectacularly poor frequency resolution)
Atari CO12294 POKEY (64 kHz in the most commonly used mode)
Nintendo 2A03 (1.79 MHz; triangle wave is 32 samples per period)
Wersi MK1, EX20, DX10, EX10R (3 MHz; probably 256 samples per period, multisamples, waveforms downsampled for higher octaves)
Hudson HuC6280 (3.58 MHz; 32 samples per period)
Konami SCC (3.58 MHz; 32 samples per period)
Commodore Amiga (3.58 MHz)
Roland Juno 106 (4 MHz; obviously I'm referring to clock pulse generation only)
OSC OSCar (4 MHz with phase locked loop frequency multiplier; 256 samples per period)
Sequential Circuits Prophet 2000 (6 MHz)
Wersi DX400/DX500 (8 MHz; probably 256 samples per period)
Wersi CD600/CD700/CD800/CD900 (8 MHz; probably 256 samples per period plus sampled attack transients)
Akai S900/S950 (8 MHz; probably also applies to S612, S700/S750 etc.)
Korg DSS-1 (8 MHz; drawn/additive waveforms are 512 samples per period, multisampled for each octave)
Emu Emulator II (10 MHz)
Kurzweil 250 (10 MHz)
Emu Emulator III (20 MHz; includes 2x oversampling)
Fixed rate phase accumulator with drop-sample interpolation (good frequency resolution, basically equivalent to a trivial software implementation) In this category the sound quality depends a great deal on the sample rate, use of multisamples and waveform resolution. ~128 samples per period results in a desirably gritty sound, but the resulting image frequencies require a high sample rate (~200 kHz minimum) to avoid severe aliasing. Consequently the DW-8000 sounds reasonably clean and not particularly gritty while the Evolver sounds gritty but has a huge amount of aliasing.
Yamaha GS-1 (23 kHz?)
Emu SP-12 (~26 kHz)
Ensoniq Mirage (31.25 kHz)
DK Synergy (32 kHz)
Con Brio ADS 200 (35.2 kHz, 4096 samples per period)
Keytek CTS-2000 (128 samples per period, multisamples)
Ensoniq ESQ-1 (41.666 kHz, uses multisamples)
Dave Smith Instruments Evolver (I think 48 kHz, 128 samples per period)
Yamaha DX7 (49.097 kHz)
Korg DW-8000 (50 kHz; uses multisamples, bandlimited/oversampled tables up to 1024 samples per period)
Allen Digital Computer Organ (MOS series)/RMI Keyboard Computer (83.333 kHz, 32 samples per period)
PPG Wave 2.3 (195.313 kHz, 128 samples per period. I haven't confirmed but this should apply to the 2.2 as well)
Dynacord ADD One (something like 250 kHz)
Sequential Circuits Prophet VS (probably 250 or 500 kHz, 128 samples per period)
Casio Consonant-Vowel and SD keyboards (600 kHz?, 16 samples per period)
Emu Emax (~1 MHz)
MOS6851 SID (0.985 or 1.023 MHz)
High frequency fixed clock, jittery variable clock generation (jitter is equivalent or worse to drop-sample interpolation at the same sample rate, pitch resolution is worse)
NED Synclavier FM (401.929 kHz; 256 samples per period, waveforms downsampled by powers of two for higher octaves)
NED Synclavier polyphonic sample playback (effective sample rate ~1.6 MHz)
Fairlight CMI (17.145865 MHz, 128 samples per period. This applies to all models, but the CMI III should have slightly less jitter)
Any corrections or additions to this list would be appreciated. I can possibly also determine sample rate and waveform resolution from submitted test recordings.