6 Early quantum theory
6.1 Introduction: Cracks in the Classical Edifice
By the end of the 19th century, physics flexed like it had the final boss down. Newtonian mechanics ruled motion, Maxwell’s equations ruled fields, thermodynamics ruled heat. Yet precision experiments began posting L’s for classical theory. A new playbook—quantum—was loading.
What broke first? Three pillars:
-
Radiation and heat. The spectrum of an ideal glowing cavity (a blackbody) refused to follow classical predictions. Equipartition and Maxwell’s waves led to infinite ultraviolet energy—an “ultraviolet catastrophe.”
-
Light and electrons. Shining light on a clean metal knocked out electrons with sharp rules that defied wave-only intuition.
-
Atoms and spectra. Gases glowed at razor-thin frequencies described by simple formulas, yet any classical electron orbiting a nucleus should radiate and spiral inward.
These cracks weren’t edge cases; they were mainstream physics behaving weirdly.
6.1.1 Characters entering the stage
Max Planck (1900). A thermodynamics purist backed into heresy. To fit blackbody data, he quantized energy exchange in chunks of . He thought it was a mathematical trick; the universe took him literally.
Albert Einstein (1905). The “patent-clerk energy.” He said the quiet part out loud: light itself comes in quanta (photons). With that, the photoelectric rules suddenly made sense, and stepped out of the blackbody and into matter.
Ernest Rutherford (1911). He yeeted Thomson’s “plum pudding” and installed a nuclear atom after alpha particles bounced at wild angles from gold foil. But a classical electron racing around a tiny nucleus should radiate energy and crash. Stable atoms? Not in classical land.
Niels Bohr (1913). He proposed that electrons live only on special orbits with quantized angular momentum and jump between them by absorbing or emitting light at specific frequencies. The hydrogen spectrum clicked into place—partial victory, but weird rules demanded deeper logic.
Louis de Broglie (1923–24). If light acts like particles, maybe matter acts like waves. He proposed , a hypothesis begging for an experiment.
6.1.2 The “classical no-go”s (with minimal math)
Blackbody catastrophe. Classical equipartition feeds every electromagnetic cavity mode an average energy . Counting modes yields a spectral energy density that rises , blowing up at high frequency. Data said “nope”—the spectrum peaks and falls.
We will show in §6.2 that introducing energy quanta leads to Planck’s distribution and cures the divergence
No ultraviolet apocalypse appears in this curve
Photoelectric rules. Experiments found: (i) no electrons below a threshold frequency, however intense the light, (ii) above threshold, the maximum electron kinetic energy is linear in frequency, not intensity, (iii) emission is effectively instantaneous. Einstein’s one-liner,
nailed it, where is the work function. Intensity controls how many photons hit, not their individual energy
Atomic spectra. Balmer and Rydberg wrote beautiful empirical formulas for hydrogen lines. Rutherford’s nuclear atom raised the stakes: a tiny, charged nucleus should make orbiting electrons radiate continuously and smear the spectrum—contradiction. Bohr’s move was to quantize angular momentum and energy levels, producing discrete lines and a frequency rule that matched Rydberg’s data. But why those orbits? Old quantum theory worked, then stalled.
6.1.3 The vibe: discrete meets wave
The early quantum story is a series of “wait, both?” moments:
- Light interferes like a wave but trades energy like a particle
- Electrons behave like particles in tracks yet diffract like waves through crystals
- Energy in atoms is not continuous real estate; it’s a floor plan of levels with allowed transitions
The common thread is discreteness sneaking into systems previously modeled as continuous. That discreteness shows up in spectra, specific heats, threshold effects, and stability.
6.1.4 Minimal mathematical promises (to be delivered in later sections)
- Planck spectrum. From counting quanta with weight we will derive the energy density above and recover the Stefan–Boltzmann law and Wien’s displacement
-
Einstein photoelectric equation. Straight line plots of vs give slope and intercept —that is how early experiments measured Planck’s constant with electrons
-
Bohr energies. For hydrogen-like atoms we will get
and the emitted photon frequency from a jump will satisfy with Rydberg constant emerging naturally
- de Broglie hypothesis. Assuming and makes group velocity match particle velocity and predicts electron diffraction, later seen in labs
6.1.5 What “early quantum theory” is (scope note)
We will live mostly in 1900–1925: the old quantum theory (Planck → Einstein → Bohr–Sommerfeld → de Broglie) and its decisive experiments (Franck–Hertz, Davisson–Germer, Stern–Gerlach). Full quantum mechanics (Heisenberg’s matrices and Schrödinger’s waves, 1925–26) is the bridge to the next chapter, but we will aim the runway here: when classical intuition finally yields and a consistent quantum calculus becomes inevitable.
6.2 Blackbody Radiation and Planck’s Hypothesis
Blackbody radiation is where classical physics face-planted and quantum theory quietly spawned. A blackbody is an ideal cavity with a tiny hole: light that enters bounces around and is essentially absorbed; light that leaves carries the equilibrium spectrum of the cavity walls. The mission in the 1890s was to predict the energy density of this radiation as a function of frequency and temperature and to explain why the measured curve rises, peaks, and then crashes instead of diverging. Spoiler: classical equipartition said “infinite UV energy,” nature said “lol, no,” and Max Planck invented quanta to get the curve right.
6.2.1 The experimental scene: precision burns theory
German labs (Lummer, Pringsheim, Rubens, Kurlbaum) built exquisitely calibrated cavities and detectors. At moderate and high frequencies, the data followed Wien’s law remarkably well; at low frequencies (long wavelengths), new measurements started to bend away from Wien’s exponential falloff, demanding a better formula. Planck followed these plots obsessively—enough that when Rubens showed him fresh long-wavelength data in October 1900, he went home and cooked up the curve that matched everything.
Two common ways to describe the spectrum are by frequency and by wavelength. They are related but not identical in shape because changes the measure:
You’ll see two standard forms. By frequency:
By wavelength:
We’ll derive first and then convert to .
6.2.2 Mode counting in a cavity: the factor
Take a cubic cavity with perfectly conducting walls. Electromagnetic standing waves fit only certain wavevectors ; each allowed mode is a harmonic oscillator. Counting modes with frequencies between and gives a density of states per unit volume
The comes from spherical shells in -space and from two polarization states. This factor is pure geometry/kinematics and will be the same in classical and quantum derivations. The physics enters when we decide how much average energy to assign to each mode at equilibrium.
6.2.3 Classical attempt: equipartition and the ultraviolet catastrophe
In classical statistical mechanics, every quadratic degree of freedom contributes to the mean energy. A harmonic oscillator (one mode) has two quadratic pieces (potential and kinetic), so the mean energy per mode would be
Multiply by the mode density to get the spectral energy density
This is the Rayleigh–Jeans law. It works at low frequencies but skyrockets as and, when integrated over all frequencies, diverges. That divergence—the ultraviolet catastrophe—is not a cute nickname; it is classical physics predicting infinite energy in any warm room. Reality declines.
6.2.4 Wien’s law: a great half-truth
Empirically, Wien proposed
with constants fit to data. This law nails the high-frequency tail (it decays exponentially) and implies the correct displacement law (the peak slides linearly with ), but it fails at low frequencies, where experiments show (Rayleigh–Jeans behavior), not . In short, Wien gets the UV, Rayleigh–Jeans gets the IR, and neither gets the full mixtape.
6.2.5 Planck’s move: discretize the energy exchange
Planck’s radical idea was to treat the cavity walls as a set of microscopic oscillators (resonators) that exchange energy with the radiation field in discrete packets of size :
Given Boltzmann weights, the partition function for one oscillator of frequency is
The mean energy per oscillator is then
Multiply by the density of modes to get the full spectrum:
This is Planck’s law, 1900. It reduces to Rayleigh–Jeans at low frequency (expand the exponential) and to Wien’s exponential at high frequency. One curve to rule them all.
6.2.6 Low- and high-frequency limits (sanity checks)
Let .
- Low frequency (): , so
which is Rayleigh–Jeans
- High frequency (): , so
which is Wien’s law with and
So Planck’s formula gracefully bridges the two regimes that classical and empirical laws captured separately.
6.2.7 Total energy and Stefan–Boltzmann law
Integrate over all frequencies to get the total energy density :
Change variables ; then and
The integral is a famous one:
So
The radiated power per unit area from a black surface follows by (kinematic factor for isotropic radiation):
This is the Stefan–Boltzmann law with its constant expressed in terms of .
6.2.8 Where is the peak? Wien’s displacement law
Maximize with respect to at fixed . Setting , the condition reduces to
This transcendental equation has a root
Hence the frequency of the peak scales as
If you instead work with the wavelength form
and maximize with respect to , you get Wien’s displacement law
with a constant fixed by the root of a slightly different transcendental equation. Important PSA: the peak in and the peak in are not at the same photon energy because the Jacobian reshapes the curve.
6.2.9 Converting between frequency and wavelength spectra
Use with and . Then
Plugging Planck’s gives the standard quoted above. The minus sign drops because decreasing increases .
6.2.10 A statistical-mechanics view (why quantization matters)
Classically, energy is continuous and equipartition dumps into each mode no matter how high its frequency. Quantum statistically, the occupation of a mode with energy spacing is Bose–Einstein with zero chemical potential:
and the mean energy per mode is . High-frequency modes are exponentially hard to populate because you must pay upfront; low-frequency modes behave classically because makes large and . The catastrophe disappears because discrete energy spacing throttles the UV.
6.2.11 Planck’s “I didn’t mean to be a revolutionary” arc
Planck was a dyed-in-the-wool thermodynamics guy who wanted a principled derivation of Wien’s empirical law. After the Rubens data contradicted Wien at low , he sought an interpolation that matched both limits and then reverse-engineered a statistical story to justify it. He later called the quantization step “an act of desperation.” The act stuck. Within five years Einstein gave those energy packets—photons—independent life in the photoelectric effect. Planck’s constant went from curve-fitting fudge to a fundamental unit of nature.
6.2.12 Practical knobs: emissivity and real surfaces
Real materials are not perfect blackbodies. If a surface has emissivity between 0 and 1, its spectral radiance is
Kirchhoff’s law of thermal radiation says emissivity equals absorptivity at each frequency, so cavities with a small aperture are effectively black even if the walls are not—rays bounce until they hit an absorptive patch. This is why laboratory blackbodies are cavities.
6.2.13 Worked mini-examples
(a) Power of a star. Approximating a star as a blackbody of radius and surface temperature , its luminosity is
(b) Peak color of a filament. Using , heating a filament so doubles halves the peak wavelength, shifting from red toward blue. Photographs feel the shift because camera sensors respond to weighting.
(c) Cavity photon number density. The photon number spectrum is , hence the total number density is
Changing variables yields with a proportionality constant involving .
6.2.14 What to remember (exam-tier TL;DR)
- Geometry gives the density of modes ; physics gives the average energy per mode
- Classical equipartition Rayleigh–Jeans ultraviolet catastrophe
- Planck: energy exchange in quanta
- Multiply them to get Planck’s law; integrate to get Stefan–Boltzmann ; maximize to get Wien’s displacement
- Quantization is not optional decoration; it is the only way to make thermodynamics and electromagnetism coexist peacefully in a cavity
6.3 Photoelectric Effect and Einstein’s Light Quanta
Shine light on a clean metal and electrons pop out. Classical waves could explain “more light → more energy,” but experiments in the early 1900s said “frequency first, intensity second.” That plot twist pushed Einstein to a bold take: light energy arrives in discrete packets of size . The photoelectric effect is where photons stop being poetry and start being accounting.
6.3.1 The experimental rules (what needed explaining)
Laboratories found a crisp set of facts:
- There exists a threshold frequency below which no electrons are emitted, however intense the light
- Above threshold, the maximum kinetic energy of emitted electrons depends linearly on frequency, not intensity
- The emission is prompt on the scale of the light cycle; no measurable delay even at low intensities
- The photoelectric current (number of electrons per second) scales with light intensity for fixed frequency, up to a saturation set by the apparatus
These rules are weird in a pure wave picture where energy is spread out over the wavefront and can accumulate continuously.
6.3.2 Einstein’s one-liner
Einstein’s 1905 hypothesis: light of frequency behaves as a stream of quanta (photons), each carrying energy . An electron in the solid that absorbs one photon spends part of that energy to escape the material—this is the work function —and keeps the rest as kinetic energy. Thus
The threshold frequency is therefore
Intensity controls how many photons arrive per second, hence how many electrons can be emitted per second, but not the maximum energy of each electron. Frequency sets the budget per electron.
6.3.3 Measuring and with stopping potentials
Experimenters measure the maximum kinetic energy by applying a retarding voltage to stop the most energetic electrons from reaching the collector. The definition is
Combining with Einstein’s relation gives a straight-line plot
So a graph of versus has slope and intercept . That is how early experiments extracted Planck’s constant and the work function directly from electron kinetics.
6.3.4 Why the classical wave picture face-plants
A classical wave deposits energy continuously at a rate proportional to intensity. If that were the whole story, then at low intensities electrons should need a long time to accumulate enough energy to escape (a measurable delay), and for sufficiently intense low-frequency light the effect should happen anyway—no threshold. Neither occurs. The data scream “energy arrives in lumps,” with each electron grabbing at most one lump.
6.3.5 Millikan’s reluctant confirmation
Robert Millikan spent a decade trying to break Einstein’s interpretation by doing gold-standard measurements. He refined vacuum systems, surface preparation, and electronics. The result was the cleanest straight lines in physics: versus with slope and a frequency intercept at . Millikan didn’t like the photon idea philosophically, but his data left no wiggle room. If your enemy measures your theory and it wins, that’s a W.
6.3.6 From intensity to flux: how many electrons?
Hold fixed above threshold. Doubling intensity doubles the photon flux at the surface. If the surface is clean and space-charge effects are controlled, the saturation current scales linearly with intensity because each absorbed photon can liberate at most one electron. The fraction that actually escape is the quantum efficiency ; thus
where is the optical power absorbed on the active area. Real values of depend on the material, surface state, and whether photoelectrons are backscattered or trapped before reaching the collector.
6.3.7 Energy distribution and the role of the solid
is the upper edge. The distribution of kinetic energies depends on the band structure and scattering within the solid. In simple metals one can sketch an “initial state” between the Fermi level and a few below it; an electron that absorbs a photon emerges with energy roughly minus any losses to inelastic scattering on the way out. That is why measured spectra are broad, not delta functions, even though the top edge follows cleanly.
6.3.8 Time-of-flight and “instantaneous” emission
Modern fast electronics confirm the old claim: emission is prompt within picoseconds or faster, consistent with a single-photon process instead of a long integration. In photon language, a single absorption event transfers in one go; there is no need to “wait” for energy to build up.
6.3.9 Surface physics and work function engineering
The work function is a surface property: it depends on the material, crystal face, cleanliness, and adsorbates. Alkali metals with low respond to red light; clean noble metals require blue/UV. Coatings and cesiation can lower , tuning downward. This is why phototubes and photocathodes care deeply about surface preparation.
6.3.10 Typical lab analysis workflow
- Illuminate a freshly prepared metal surface with monochromatic light at several frequencies
- For each , sweep the collector voltage negative until the photocurrent drops to zero to find
- Plot versus ; fit a line to extract slope and intercept
- Independently, measure saturation current versus intensity at fixed to confirm linear scaling of electron count with photon flux
The two experiments probe different pieces of Einstein’s model: the energy per photon and the counting of photons.
6.3.11 What about intensity at fixed frequency?
At fixed , increasing intensity raises the number of emitted electrons (current) but leaves the maximum energy unchanged because is unchanged. Conversely, at fixed intensity, increasing frequency raises linearly. On the – plot, intensity just changes how quickly you reach saturation, not the slope or intercept.
6.3.12 Photons, momentum, and consistency checks
Photons also carry momentum . In photoemission from a flat surface, most of the photon momentum is small compared with electron Fermi momenta, so is less central than for escape energetics, but radiation pressure and light-driven forces make sense in the same bookkeeping. Later phenomena—Compton scattering of X-rays—extend the particle-like picture by showing wavelength shifts that require both photon energy and photon momentum conservation. The photoelectric effect is the gentle on-ramp; Compton is the highway.
6.3.13 Why “one photon → one electron” has caveats
At high intensities and short pulses, multi-photon photoemission can occur: two (or more) photons combine to eject an electron even when is below . The signature is a nonlinear dependence of current on intensity. The original continuous-wave experiments lived in a regime where the single-photon process dominates, which is why Einstein’s linear law described them. Quantum mechanics contains both regimes—you just move along the intensity axis.
6.3.14 Quick derivations you can do on a napkin
Threshold. Set in Einstein’s law to get . This is the cleanest way to define experimentally with light.
Slope extraction. From , the slope of vs is . Measure it and you have a value of independent of blackbody fits or spectroscopy.
Work function units. If is reported in electron-volts, then the threshold in hertz is
and in wavelength
It is common to quote for a photocathode because engineers think in wavelengths.
6.3.15 Photoelectric devices: from bench to tech
- Vacuum photodiodes and photomultipliers. Use photoemission from a low- cathode to seed electrons, then multiply via dynodes. The method turns into gain calibration and noise analysis
- Solar cells (not the same effect). Photovoltaic junctions convert light to electricity via electron–hole pairs and built-in fields, not vacuum photoemission, but the threshold idea still sets which photons can be absorbed
- Ultrafast electron sources. Femtosecond lasers drive single- or few-photon emission to produce short electron bunches for diffraction and microscopy
6.3.16 Conceptual audit
- Particle-like energy transfer. per photon, one photon per primary electron escape event
- Material gatekeeper. The work function sets the doorway; is the key frequency
- Linear edge. tracks regardless of intensity; stopping potential nails it as
- Counting vs energy. Intensity changes how many electrons, frequency changes how energetic they can be
Quantum takes the wheel precisely where classical physics insists on the wrong dependencies.
6.4 Atomic Spectra and Bohr’s Model
Spectral lines were the OG QR codes of atoms—thin, repeatable, and screaming “there’s structure in here.” Long before quantum mechanics became a full theory, careful spectroscopy mapped those lines into simple integer patterns. The old quantum theory’s biggest W was Bohr’s 1913 model: a semi-classical atom with quantized orbits that nailed hydrogen’s spectrum and hinted at a deeper wave picture.
6.4.1 Empirical prelude: Balmer–Rydberg patterns
Hydrogen’s visible lines fit Balmer’s 1885 formula; Rydberg soon generalized it to a wavenumber series
Here is the Rydberg constant and are positive integers labeling initial and final “levels.” Famous series:
- Lyman (ultraviolet)
- Balmer (visible; Hα is at nm)
- Paschen (infrared)
These were pure numerology—but spookily consistent across gases once you inserted effective constants.
6.4.2 Rutherford’s atom and the classical fail
Rutherford’s 1911 scattering showed a tiny, charged nucleus. Classical electrons orbiting such a nucleus are accelerated charges; Maxwell says they radiate and spiral in. That predicts continuous spectra and no stable atoms. Yet nature flexed discrete lines and stability. Enter Bohr.
6.4.3 Bohr’s postulates
Bohr’s model stapled three rules onto classical mechanics:
- Stationary states. Certain orbits are allowed and do not radiate despite acceleration
- Quantum of action. Angular momentum is quantized
- Frequency condition. Radiation occurs only when jumping between stationary states, with photon energy
These hacks were audacious—and shockingly predictive for one-electron atoms.
6.4.4 Coulomb plus quantization: solving for radii, speeds, energies
For a nucleus of charge and an electron in a circular orbit of radius and speed , centripetal force equals Coulomb attraction
Combine with the quantization to eliminate and solve for :
The bracket is the Bohr radius
so . The orbital speed follows as
where
is the fine-structure constant. The total energy is kinetic plus potential
which evaluates to
For hydrogen (), the ground state is
and levels scale like .
6.4.5 From Bohr levels to the Rydberg formula
Bohr’s frequency condition with and gives a photon wavenumber
Thus the Rydberg constant in the infinite nuclear mass limit is
For real hydrogen, the electron orbits the center of mass; replace by the reduced mass
to get
This tiny correction produces isotope shifts between H and D.
6.4.6 Why felt less random over time
A decade later, de Broglie proposed matter waves with . Demanding an integer number of wavelengths around a circular orbit,
implies
i.e., Bohr’s rule. The standing wave picture explains why only certain radii are stable: destructive interference kicks out the rest. This wave logic is what Schrödinger will formalize in Chapter 7.
6.4.7 Correspondence principle and large
Bohr insisted that for large quantum numbers, quantum predictions must approach classical results. For , the orbital frequency of the electron matches the emission frequency between neighboring levels, and the pattern of allowed transitions mimics the classical Fourier components of a Coulomb orbit. This “correspondence principle” guided the old quantum theory when full mechanics was still loading.
6.4.8 Selection rules and intensities (why some lines are strong)
Bohr’s model locates levels but doesn’t compute line intensities. Empirically, strong lines obey angular-momentum-like selection rules such as that later emerge from dipole transition matrix elements in wave mechanics. The old theory guessed at them via classical radiation patterns; quantum mechanics derives them cleanly.
6.4.9 Worked mini-derivations
(a) Lyman-α wavelength. From ,
so
Numerically this lands near nm once is included.
(b) Balmer Hα line. For ,
so
which evaluates to about nm.
(c) Ionization energy from level . Send ; the needed energy is
for hydrogen.
6.4.10 Sommerfeld’s ellipses and early fine structure
To push further, Sommerfeld allowed elliptical orbits and quantized both the radial and angular actions. Adding a relativistic mass–velocity correction produced a partial account of fine structure (small level splittings) and the anomalous spacing of closely lying lines. It was clever but messy, and ran out of road on multi-electron atoms and precision details.
6.4.11 Where Bohr wins, where it breaks
Wins
- Predicts and the exact hydrogen and hydrogenic spectra when reduced mass is included
- Explains series limits and ionization energies
- Anticipates de Broglie and provides a clean bridge to wave mechanics
Breaks
- Cannot explain line intensities and polarizations from first principles
- Fumbles fine structure fully and the Zeeman effect anomalies
- Faceplants on helium and other multi-electron atoms
- Treats non-radiating “accelerated” electrons by fiat rather than mechanism
These failures are not bugs—they are breadcrumbs pointing to a wave equation and spin.
6.4.12 Practical constants and forms you will use
Equivalent and handy expressions:
For a hydrogenic ion with nuclear charge , replace by and by .
6.4.13 Concept check: Bohr vs wave mechanics
Bohr’s radii correspond to maxima in the radial probability density of the exact hydrogen wavefunctions. The quantum number remains, but and split levels and determine shapes; stationary states become standing waves in three dimensions. The photon frequency condition survives as energy eigenvalue differences, while selection rules arise from matrix elements of the dipole operator.
6.4.14 Minimal problem kit
- Derive and from force balance plus and check numerically
- Starting from , recover the Rydberg formula and identify in terms of
- Include the reduced mass and estimate the H–D isotope shift of Hα
- Use to check nonrelativistic consistency ( for H)
6.5 Rutherford Scattering and the Nuclear Atom
Sometimes one experiment deletes an entire worldview. The Geiger–Marsden gold-foil experiment did exactly that. They fired alpha particles (helium nuclei) at an ultra-thin gold foil and counted scintillations on a screen. Most alphas zipped through with tiny deflections, but a surprising few bounced at large angles, even backward. If positive charge were smeared throughout the atom (Thomson’s “plum pudding”), big-angle kicks would be essentially impossible. The data demanded a concentrated, tiny, positively charged nucleus.
6.5.1 What was measured and why it shocked people
- Beam: alpha particles around a few MeV
- Target: sub-micron gold foil, atomic number
- Detector: zinc sulfide screen counting single scintillations
- Surprise: an exceedingly small fraction scattered through tens of degrees or more, some near
Classically, if positive charge were diffuse, only small, cumulative nudges add up. Large-angle events practically require a short-range, strong encounter with a compact center—i.e., a nucleus.
6.5.2 Coulomb repulsion and hyperbolic orbits
Treat the interaction as repulsive Coulomb between an alpha of charge (with ) and a nuclear charge :
It is convenient to define
Nonrelativistic trajectories are hyperbolae. The relation between scattering angle and impact parameter for a projectile of kinetic energy is
which inverts to
Here with reduced mass ; for a very heavy target, and lab CM.
6.5.3 The Rutherford differential cross section
The measurable is the angular distribution, i.e. the differential cross section:
Using above and gives the famed Rutherford formula
Key scalings:
- , so heavier nuclei punch harder
- , so higher beam energy suppresses scattering
- Diverges at small angles like
That falloff matches the data over a wide range—until other physics (screening, nuclear forces, relativity) kicks in.
6.5.4 Distance of closest approach and nuclear scale
For head-on approach (, ) the kinetic energy converts into potential energy at the turning point :
so
Numerically, the handy constant is . For alphas on gold (), . With ,
That is tens of femtometers—orders of magnitude smaller than the atomic size (). Pushing to higher energies drives down toward a few femtometers, the scale of nuclear radii, consistent with a tiny dense core.
6.5.5 From counts to cross sections in a foil
For a thin target with atomic number density and thickness , the expected rate into solid angle at angle is
“Thin” means single scattering dominates and attenuation is negligible. At larger or for softer beams, multiple scattering blurs the pure Rutherford shape at small angles.
6.5.6 Where the classical picture bends
-
Atomic screening (small ). At large the projectile sees a nearly neutral atom; bound electrons screen nuclear charge. Effective drops, taming the divergence. A crude cutoff angle corresponds to impact parameters beyond the screening length
-
Nuclear size and strong force (large , high ). When approaches a few fm, the finite nuclear radius and the strong interaction spoil the pure Coulomb potential. Deviations at the largest angles or highest energies carry nuclear-structure info
-
Relativity and spin. For faster beams and electrons as projectiles, one needs Mott scattering, i.e., Dirac theory corrections and spin–orbit effects. The qualitative survives as a backbone but with spin-dependent factors
-
Quantum mechanics. The quantum calculation (partial waves or Born approximation) reproduces the Rutherford formula for Coulomb potentials—remarkable agreement between classical and quantum for this case
6.5.7 What the experiment proved
- Positive charge and most mass live in a tiny volume compared to the atom
- Electrons are peripheral; the atom is mostly empty space
- Long-range deflection is Coulombic and centered, not a diffuse push
Those facts killed the plum-pudding model and launched the nuclear atom, setting the stage for Bohr’s quantized orbits (§6.4) and, later, quantum mechanics proper.
6.5.8 Worked mini-derivations
(a) Differential cross section
Start from
(b) Integrated yield beyond a cut angle
Using and ,
Handy for estimating count rates when your detector subtends large angles
(c) Closest approach at nonzero
Conserve energy and angular momentum ; the turning point satisfies
Solve the quadratic for to link geometry to how near the alpha gets
6.5.9 Problem kit
- Use to predict the ratio of counts at and for the same detector solid angle
- For alphas on Au, compute at and compare to an estimate of the nuclear radius with
- Include screening by imposing a maximum impact parameter equal to a screening length and show how it regularizes the small-angle divergence
- Derive the lab–CM relation for and show why using lab is an excellent approximation for targets
In summary: The gold-foil experiment showed that atoms are mostly vacuum with a tiny, massive, positively charged nucleus at the center. The Coulomb scattering math predicts a sharp, parameter-free angular law,
and the observed large-angle events made the nuclear atom unavoidable. Once you accept a compact nucleus, Bohr’s quantized orbits and discrete spectra stop being magic tricks and start being architecture.
6.6 Matter Waves: de Broglie to Davisson–Germer
If light can act like particles, maybe particles can act like waves. That was Louis de Broglie’s audacious 1923–24 symmetry flex. He guessed that every material particle with momentum carries a wavelength . Wild? Sure. But the prediction stuck landing after a few years: electrons diffracted from crystals exactly like X-rays, with the measured angles matching . Matter waves were not a metaphor; they were lab reality.
6.6.1 The hypothesis
de Broglie’s one-liner is
Together with photon and , he suggested the wave–particle dictionary
The two equalities say: if something carries energy–momentum , there is a wave with angular frequency and wavevector marching in step.
6.6.2 Phase and group velocities
For a free particle with dispersion relation , define
Then
- Relativistic and give
Phase can be superluminal with no drama (it carries no information); group velocity equals the particle speed
- Nonrelativistic gives
This is why wave packets can track particles
6.6.3 Wave packets and uncertainty (sneak preview)
A localized particle corresponds to a superposition of plane waves—a wave packet. If the momentum spread is (i.e., spread in ), then the spatial width obeys
The packet moves at but disperses in time for nonrelativistic , because different components travel at slightly different speeds. This is the seed of the uncertainty principle and of Schrödinger dynamics.
6.6.4 Electrons from a potential: handy formulas
Accelerate an electron through a potential difference ; it gains kinetic energy . The nonrelativistic momentum is
so
Engineers love the quick constant (with in ångström, in eV):
For higher voltages, include relativity:
which reduces to the nonrelativistic result when
6.6.5 Diffraction: crystals as electron gratings
A crystal is a 3D diffraction grating. For planes of spacing , constructive interference follows Bragg’s law
Given , electrons should diffract with angles set by —just like X-rays. This was the smoking gun de Broglie asked the world to look for.
6.6.6 Davisson–Germer (and G. P. Thomson): the receipts
Davisson & Germer (1927). Electrons of tens of eV scattered from a nickel single crystal produced sharp intensity peaks at angles consistent with Bragg’s law when you plug in . Varying the accelerating voltage shifted the peak exactly as predicts.
G. P. Thomson (also 1927) sent keV electrons through thin polycrystalline foils and recorded ring patterns—the powder-diffraction version of Bragg. Two independent routes, one conclusion: matter waves are real.
Rule of thumb: gives (interatomic distances), perfect for crystal diffraction.
6.6.7 From Bohr to de Broglie: standing waves
Bohr’s angular-momentum postulate stops feeling arbitrary once you impose standing waves on a circular orbit:
which gives
Same quantization, now with wave logic. Unstable radii are those where the phase does not fit—destructive interference kicks them out. de Broglie supplied the mechanism Bohr’s model lacked.
6.6.8 Schrödinger’s motivation (why a wave equation exists)
Take the free-particle dispersion and the de Broglie dictionary , . Demand a linear differential equation whose plane-wave solutions have . You land on
the time-dependent Schrödinger equation. In other words, de Broglie’s is the dispersion law that Schrödinger’s PDE realizes in real space.
A complementary viewpoint uses Hamilton–Jacobi theory: write ; then behaves like momentum. Quantum mechanics is classical mechanics plus phase and interference.
6.6.9 Electron double-slit: the mic drop
Send single electrons through two slits; record impacts one by one. The screen builds an interference pattern over time—then vanishes when you measure which slit each electron took. The pattern wavelength matches . The message is not that electrons smear; it is that probabilities interfere with phases set by action differences.
6.6.10 Matter-wave tech: microscopes and metrology
- Electron microscopes. Resolution scales roughly like , so swapping visible light () for 100 keV electrons ( with relativistic correction) makes nanoworld imaging routine
- LEED/RHEED. Low- and reflection-energy electron diffraction fingerprint surface structures and growth in real time
- Atom interferometry. Neutral atoms also carry de Broglie waves; splitting and recombining them measures gravity, rotations, and fundamental constants with absurd precision
6.6.11 Worked mini-examples
(a) De Broglie wavelength at .
Using ,
Squarely in the range of crystal spacings—no wonder early peaks showed up near modest voltages
(b) Voltage for .
Invert the same rule:
(c) Group velocity check (nonrelativistic).
With ,
The packet moves like the particle—consistency unlocked
(d) Bragg angle estimate.
For , , first order ,
Real experiments include geometry factors (incidence vs detection), but the scaling is right on the nose
6.6.12 Subtleties and limits
- Wave packet spread. Nonrelativistic packets broaden with time; real beams are continually collimated or detected before big dispersion
- Scattering versus bound states. In crystals, elastic diffraction uses the crystal as a giant phase grating; in atoms, bound-state standing waves quantize energies
- Heavy particles. shrinks fast with mass; neutron and atom interferometers compensate by running at ultra-low velocities to get usable wavelengths
- Relativistic electrons. Use the corrected ; at 100 keV, nonrelativistic formulas understate the momentum (and overstate )
6.6.13 Problem kit
- Show that for a free relativistic particle
- For an electron beam at , compute with and without relativistic correction and compare
- Given a cubic crystal with lattice constant , list -spacings for , , and predict first-order Bragg angles for a chosen
- Build a Gaussian wave packet , evolve it with the free Schrödinger kernel, and extract the time dependence of the rms width
6.7 Old Quantum Theory: Hits and Misses
Before full quantum mechanics landed in 1925–26, physicists hacked classical mechanics with a few bold patches. They quantized certain integrals, imposed selection rules, and stitched in Bohr’s frequency condition. This old quantum theory nailed hydrogen’s spectrum and hinted at deeper structure, but it stumbled on multi-electron atoms, intensities, and magnetic quirks. This section is the status report: what worked, what broke, and what carried over into modern QM.
6.7.1 What “old quantum” meant
The playbook had three pillars:
- Stationary states exist that do not radiate while the electron whirls in a Coulomb orbit
- Bohr–Sommerfeld quantum conditions select allowed motions
- Bohr frequency condition connects level differences to light
In symbols, energy levels and jumps emit or absorb photons with
The deep math piece was the quantization of actions.
6.7.2 Bohr–Sommerfeld quantum conditions
For a system with separable coordinates and conjugate momenta , allowed orbits satisfy
The line integral runs over one full period of motion in coordinate . Define the action variables
Then the rule is simply . These are adiabatic invariants: under slow changes of parameters, they remain constant, a fact Ehrenfest elevated into the adiabatic principle. Where the dynamics is integrable and separable, the rules bite; where it is not, they flail.
6.7.3 Hydrogen, again but more general
Bohr’s circular orbits are a special case. With Sommerfeld’s ellipses, the Coulomb problem has two actions: radial and angular . Quantization gives
and the total energy depends only on :
The principal quantum number counts nodes in the old theory too, but without a wavefunction to say it out loud. Sommerfeld also added a relativistic mass–velocity correction and extracted a qualitative fine structure splitting, a hint of what Dirac would do properly.
6.7.4 Harmonic oscillator: the “almost right” spectrum
For a 1D harmonic oscillator with frequency , the action integral evaluates to
Bohr–Sommerfeld demands , hence
This is close to the truth; modern QM gives . The missing is the vacuum’s zero-point energy, one of several places where old quantum theory is off by a half.
6.7.5 Correspondence principle and selection rules
Bohr’s correspondence principle guided guesses: in the limit of large quantum numbers , quantum predictions must morph into classical ones. For bound motion one can Fourier-expand a classical trajectory and show that the strongest allowed quantum transitions tend to frequencies present in the classical spectrum. This logic nudged the empirical dipole selection rule
and explained why transitions with wildly different frequencies are feeble. The principle later reappeared as actual mathematics in the Kramers–Heisenberg dispersion formula and sum rules.
6.7.6 Einstein’s and coefficients: blackbody from jumps
In 1916–17 Einstein wrote a kinetic model of radiative transitions between two levels and with degeneracies . In a radiation field of spectral energy density , the rates are
Thermal equilibrium plus Boltzmann factors then enforce
and reproduce Planck’s law for . This was the first theoretical appearance of stimulated emission, which lasers would monetize decades later. Old quantum theory could balance the books for radiation.
6.7.7 Dispersion theory and the first sum rules
Kramers recast optical dispersion in terms of virtual jumps between levels and, using correspondence, derived frequency-dependent polarizabilities that matched classical Lorentz–Drude behavior at low frequency but respected quantum selection rules. Heisenberg and Born then took the hint and replaced classical amplitudes by arrays of transition amplitudes—Heisenberg’s bridge to matrix mechanics. Even before that leap, sum rules relating oscillator strengths to charges were early “no free lunch” constraints on spectra.
6.7.8 Thermodynamics wins: specific heats of solids
Classical equipartition predicts a constant molar heat capacity (Dulong–Petit). Data at low tank that prediction. Einstein’s model (1907) treats each atom as an independent quantum oscillator of frequency :
It captures the drop but not the exact low- power. Debye improved it by quantizing acoustic modes up to a cutoff , yielding at low and at high . While not part of orbital quantization, these successes showed that discreteness fixes thermodynamics too.
6.7.9 Where old quantum theory face-planted
-
Helium and many-electron atoms. The three-body Coulomb problem has no closed integrals of motion to quantize cleanly. Ad hoc quantum conditions gave conflicting energies and wrong term structures. Helium was the “here be dragons” sign until wave mechanics and spin arrived
-
Anomalous Zeeman effect. In weak magnetic fields, many spectral lines split into more components than classical or orbital angular momentum alone can explain. The fix needs electron spin and spin–orbit coupling, which the old theory lacked
-
Intensities and polarizations. Level positions were OK for hydrogenic systems; line strengths, polarizations, and lifetimes demanded matrix elements of operators between wavefunctions—tools old quantum did not possess
-
Radiation from accelerated charges. Old quantum declared stationary orbits non-radiating by fiat, while classical electrodynamics says accelerating charges radiate. The rule worked phenomenologically but had no mechanism until wave mechanics turned “orbit” into a stationary standing wave
-
Nonseparable systems and chaos. The action-quantization rule needs separable coordinates and closed invariant tori in phase space. Many motions (think coupled anharmonic oscillators, generic molecules) do not cooperate. Without a PDE for , the method stalls
-
Zero-point energy. The oscillator’s missing showed up in experiments sensitive to low-temperature fluctuations and later in field quantization. Old quantum had no universal way to assign it
6.7.10 What survived into modern quantum mechanics
- Actions and adiabatic invariants became the semiclassical WKB/EBK quantization rules
with Maslov indices supplying the missing halves
-
Correspondence principle remains a sanity check and underlies sum rules, selection rules, and semiclassical approximations
-
Einstein / coefficients and oscillator strengths live on in atomic physics, lasers, and astrophysical diagnostics
-
Quantum numbers are still the labels; they are now eigenvalue data of commuting operators rather than areas under curves
-
De Broglie’s standing waves turned into the full eigenfunction story, explaining stability and radiation probabilities from overlaps, not orbits
6.7.11 Worked mini-examples
(a) Action of the Kepler problem
For the Coulomb potential, the radial action can be computed from the periapsis–apoapsis bounds. One finds and
so depends only on , reproducing and degeneracy in that wave mechanics later lifts via spin–orbit coupling and Lamb shifts
(b) Oscillator’s missing half
Given , yields . WKB with turning points adds a phase at each boundary, shifting the rule to and fixing
(c) Dipole selection
Take a classical circular orbit and expand its dipole moment into Fourier modes at multiples of the orbital frequency. The strongest harmonic is the first, mirroring and nearest-neighbor transitions in the large- limit
6.7.12 Problem kit
- Starting from , derive for the harmonic oscillator and compare to the WKB-corrected
- For Coulomb motion, compute and and show depends only on
- Use Einstein’s rate equations to re-derive and extract and in terms of
- Show how Kramers’ dispersion reduces to the classical Lorentz model when and transitions crowd densely
- Identify one system where variables are not separable and explain why Bohr–Sommerfeld quantization stalls without wave mechanics
In summary: Old quantum theory was a brilliant stopgap. It guessed the right invariants to quantize, respected a correspondence to classical physics, and even anticipated stimulated emission. But without wavefunctions, operators, and spin, it could not compute intensities, multi-electron spectra, or magnetic fine print. The successes became the scaffolding for Heisenberg’s matrices and Schrödinger’s waves; the failures marked the exact spots where the full theory had to grow
6.8 Spin, Stern–Gerlach, and Exclusion
Atoms were shouting a secret in the 1910s–20s: angular momentum was coming in half-units and electron seats were limited. Two experiments and one principle sealed the deal—Stern–Gerlach split neutral beams into discrete tracks, spin explained “anomalous” Zeeman splittings, and Pauli’s exclusion principle organized the periodic table and built the stability of matter. This section introduces spin as a bona fide quantum degree of freedom, its magnetic moment, how fields act on it, and why “no two identical fermions in the same state” is the rule holding up chemistry and white dwarfs.
6.8.1 Stern–Gerlach: space quantization goes brrr
In 1922, Otto Stern and Walther Gerlach sent a beam of neutral silver atoms through a non-uniform magnetic field. Classically, the component of magnetic moment along the field gradient would vary continuously, giving a smear on the screen. Instead they saw two sharp spots. Conclusion: the -component of angular momentum is quantized.
Silver was clutch because its valence structure leaves a single unpaired electron dominating the atom’s magnetic moment while the atom remains neutral (no Lorentz deflection). The force is
and the observed two-way split screams that only takes two values. That is spin shouting “.”
6.8.2 Spin- algebra and eigenvalues
Quantum mechanically, spin components do not commute:
For an electron, the total spin and its -projection are
A convenient matrix representation uses the Pauli matrices with :
Spin states are two-component spinors. Rotations act via these matrices, and measurement along any axis picks one of two outcomes.
6.8.3 Magnetic moment, Zeeman splitting, and Larmor precession
The electron’s spin carries a magnetic moment
where from Dirac theory (with small QED corrections). Define the Bohr magneton
In a uniform magnetic field , the interaction energy is
so a spin- splits into two levels separated by
Classically flavored but quantum-exact, spins also precess about at the Larmor frequency
This underlies magnetic resonance (NMR/EPR) and shows up as beat frequencies in time-dependent spin dynamics.
6.8.4 Normal vs “anomalous” Zeeman: spin to the rescue
Old spectroscopy found that many lines in a magnetic field split into patterns beyond the normal Zeeman triplet. The fix is the total angular momentum
with a level-specific Landé factor . The Zeeman energy shift is
where . Spin explains the “anomalous” patterns immediately: comes out of vector-coupling algebra and selection rules handle line multiplicities and polarizations. No epicycles required.
6.8.5 Spin–orbit coupling and Thomas factor
In atoms, an electron moving in the nuclear field sees, in its rest frame, a magnetic field that couples and :
A careful relativistic treatment adds the Thomas precession factor , bringing predictions in line with observed fine structure. The result: terms split by even when and are fixed, e.g., the famous doublets in alkali spectra.
6.8.6 Pauli’s exclusion principle: the two-per-seat rule
In 1925 Wolfgang Pauli proposed: no two identical fermions can occupy the same quantum state. For electrons in atoms that means at most two per spatial orbital, with opposite spins. Label a one-electron state by ; exclusion forbids duplicate quadruples. Immediate wins:
- Periodic table architecture. Shells and subshells fill in the observed order (with correlation and screening tweaks), explaining chemical valence. Two per orbital gives the shell capacities
- Magnetism. Most electrons are spin-paired; unpaired spins produce paramagnetism. In metals, Pauli paramagnetism of conduction electrons is weak and temperature-independent compared to Curie’s law for localized moments
- Stability of matter. Electrons cannot all collapse into the lowest orbital; packed fermions build degeneracy pressure that counters compression, from metals’ mechanical properties to white dwarf support
Under the hood, exclusion is a theorem in relativistic quantum field theory: spin–statistics. Particles with half-integer spin (fermions) have antisymmetric many-body wavefunctions and obey Fermi–Dirac statistics; integer-spin particles (bosons) are symmetric and obey Bose–Einstein statistics.
6.8.7 Fermi–Dirac distribution and the Fermi sea
For identical fermions with single-particle energies and chemical potential ,
At , states fill up to the Fermi energy . The factor of 2 for spin- doubles the density of states per energy, shifting and controlling metallic properties. Pauli paramagnetic susceptibility in a simple metal scales with the density of states at —tiny but nonzero.
6.8.8 Sequential Stern–Gerlach (the measurement vibes)
Run a beam through three magnets: SG (keep only ), then SG (split into and ; keep ), then SG again. The last stage shows both and . Lesson: measuring along re-prepares the spin state and erases previous information. Algebraically,
so a projective measurement in randomizes outcomes. This is the cleanest tabletop demo that incompatible observables do not share simultaneous sharp values.
6.8.9 Electron beams and why Stern–Gerlach used neutrals
You might ask: why not a spin split with electrons? Free electrons in inhomogeneous fields feel huge Lorentz forces and electric-field forces from fringe fields, which swamp the tiny splitting. Neutral atoms dodge that, so Stern–Gerlach reached the quantum without EM steering drama.
6.8.10 Tiny but real deviations:
Dirac’s is insanely accurate but not perfect. Quantum electrodynamics adds a small anomalous magnetic moment
whose leading term is and higher orders keep theorists busy. Precision measurements of are stress tests for the Standard Model and help pin the fine-structure constant .
6.8.11 Worked mini-examples
(a) Zeeman splitting size
For and ,
Corresponding frequency , i.e., microwave EPR territory
(b) SG deflection estimate
With gradient , flight time , and mass of Ag atom, the -kick is
Translate to a spot separation on a screen a few tens of cm away to see that mm-scale splits are reasonable
(c) White dwarf scale (order-of-magnitude)
Electron degeneracy pressure balances gravity; pushing up increases without heat—exclusion is literally structural support. Details live in astrophysics, but the scaling rests on spin- plus antisymmetry
6.8.12 Problem kit
- Derive the two-outcome split of Stern–Gerlach using eigenvalues and estimate spot separation for given field gradient and flight time
- Starting from and , show that spinors precess with
- Using vector addition rules, compute for a given term symbol and predict Zeeman components and polarizations
- Show how splits a given multiplet into and sketch selection-rule-allowed lines
- From the Fermi–Dirac distribution, derive the zero-temperature electron number density and Fermi energy of a free-electron metal including spin degeneracy
In summary: Spin is not a tiny orbit; it is an intrinsic two-level degree of freedom with algebra, matrices, and a magnetic moment. Stern–Gerlach revealed its discreteness, Zeeman patterns and fine structure confirmed its coupling to fields and orbital motion, and Pauli’s exclusion principle turned spin into architecture—filling shells, setting chemistry, and propping up stars. Add in Fermi–Dirac statistics and , and you have the modern, compact story behind a century of “anomalies” that were really just spin waiting to be named
6.9 From Old Quantum to Wave/Matrix Mechanics
By 1925, the “old quantum” patches were running out of duct tape. Selection rules worked, spectra mostly matched, but intensities, multi-electron messiness, and the logic behind the rules begged for a fresh OS. Two different apps booted almost simultaneously: Heisenberg’s matrix mechanics (with Born and Jordan) and Schrödinger’s wave mechanics. Same physics, wildly different vibes. By 1926, they were proved equivalent, and quantum theory finally had a consistent calculus.
6.9.1 From dispersion to Umdeutung
Heisenberg’s leap started with optical dispersion. Classical electrons driven by light oscillate and emit at combination frequencies. Old quantum theory rephrased this via transition frequencies between stationary states, , and amplitudes linked to line strengths. Heisenberg’s “Umdeutung” (reinterpretation) was to ditch unobservable electron orbits and keep only observable transition data, organizing them into arrays with rows and columns indexed by energy levels. Composition of motions became matrix multiplication. Out went trajectories; in came algebras of transitions.
6.9.2 Matrix mechanics in one page
Observables are matrices with elements in the energy eigenbasis. Time dependence carries the Bohr frequencies:
The product rule that respects composition of transitions is matrix multiplication. The Newtonian equation becomes, after the dictionary , ,
Time derivatives act elementwise via the phases, and one demands consistency with the correspondence principle at large quantum numbers. The crucial structure constant is the canonical commutator
This is not an optional spice; it is the algebraic heartbeat that replaces Poisson brackets. With it, the harmonic oscillator, rotor, and hydrogen spectrum drop out in purely algebraic fashion.
6.9.3 Uncertainty from noncommutation
Once is law, spreads of incompatible observables cannot both be squeezed to zero. The general inequality reads
Take and to get the famous bound
This is not about measurement sloppiness; it is geometry in Hilbert space. The bound follows from Cauchy–Schwarz and the operator algebra, independent of any postulated “disturbance.”
6.9.4 Schrödinger’s wave mechanics
Schrödinger arrived from the opposite coast: waves. Starting with de Broglie’s , , require a wave equation whose plane-wave solutions carry the correct dispersion. The result is the time-dependent Schrödinger equation
Looking for stationary states yields the time-independent form
Boundary conditions discretize and generate the spectral lines Bohr had cataloged.
6.9.5 Born’s rule and the meaning of
Max Born supplied the interpretive key in 1926: is a probability density. Expectation values are integrals with that weight. For a normalized state,
and for an operator acting on wavefunctions,
Probability current and continuity follow from Schrödinger’s equation, making the probabilistic reading self-consistent.
6.9.6 Operators and the canonical dictionary
Wave mechanics uses differential operators that mirror the matrix commutators:
These satisfy
so the uncertainty relation and algebra of observables match matrix mechanics. Potentials become multiplication operators ; kinetic energy becomes . The correspondence principle is codified as “replace Poisson brackets by times commutators” in the classical–quantum dictionary.
6.9.7 Equivalence: showing the two apps are the same OS
Schrödinger himself showed that wave mechanics and matrix mechanics are unitarily equivalent. In the basis of energy eigenfunctions ,
That integral is exactly the matrix element used by Heisenberg–Born–Jordan. Conversely, given a matrix one builds by its action on expansion coefficients. Same spectra, same transition amplitudes, same selection rules—different clothes. Dirac later packaged this as bra–ket notation and spectral theory, but the core equivalence is already here.
6.9.8 Worked snapshots: harmonic oscillator and hydrogen
Harmonic oscillator. In matrix mechanics, define ladder operators
which satisfy
The Hamiltonian becomes
giving eigenvalues
Wave mechanics solves the differential equation and gets the same ladder and energies; Hermite polynomials appear, but the spectrum is identical.
Hydrogen. In wave mechanics, separability in spherical coordinates produces radial equations with bound-state quantization
Degeneracies match the old -only dependence. In matrix mechanics, one uses hidden Runge–Lenz symmetry to reproduce the same spectrum algebraically.
6.9.9 Ehrenfest’s theorem and the classical limit
Quantum expectation values obey equations that mirror classical motion:
This is Ehrenfest’s theorem. When the wave packet is narrow and the potential is smooth, , and the centroid follows Newton. That is the correspondence principle rephrased as dynamics.
6.9.10 Stationary states, transitions, and selection rules
In wave language, stationary states are energy eigenfunctions. Time dependence is a global phase , so expectation values in a stationary state are time-independent. Driving the system with a weak periodic field and using time-dependent perturbation theory yields transition rates proportional to matrix elements and a resonance at . The dipole operator enforces selection rules like , matching old empirical rules while giving the intensities from first principles.
6.9.11 Commutators, Poisson brackets, and quantization rules
The formal bridge from classical mechanics uses
and promotes canonical variables to operators obeying . Ordering ambiguities appear when turning nonlinear classical expressions into operators; early prescriptions by Born–Jordan and Weyl tried to systematize this. In practice, symmetries and experimental checks decide the correct operator form. For quadratic Hamiltonians and central potentials, the naive promotion works cleanly.
6.9.12 Variational and spectral viewpoints
Schrödinger also wrote a variational principle: stationary states extremize the functional
Minimizing under normalization yields the lowest eigenvalue; constraining orthogonality gives excited states. This is the ancestor of modern Rayleigh–Ritz methods central to atomic, molecular, and solid-state calculations.
6.9.13 Continuum, scattering, and phase shifts
Not all spectra are discrete. For energies above binding, solutions are scattering states normalized to delta functions in energy or momentum. In 3D with short-range , asymptotic wavefunctions look like plane waves plus outgoing spherical waves, and the physics hides in partial-wave phase shifts . Cross sections follow from them. Matrix mechanics encodes the same data in the -matrix, whose unitarity expresses probability conservation.
6.9.14 What changed conceptually
- Observables as operators. No more hidden orbits; measurable quantities are spectra and expectation values of operators
- States as vectors. A system is a vector in a complex inner-product space, evolved by a linear equation
- Incompatibility via algebra. Noncommutation replaces classical simultaneity; uncertainty is structural, not instrumental
- Probabilities from amplitudes. Interference is about complex phases of amplitudes, not only waves in space
The old rules become corollaries of a single linear framework.
6.9.15 Minimal problem kit
- Show that implies using Cauchy–Schwarz
- Solve the 1D infinite square well with Schrödinger’s equation and extract energies and orthonormal eigenfunctions
- Build the oscillator ladder operators and prove and
- For hydrogen, separate variables in spherical coordinates and identify the three quantum numbers and their degeneracies
- Derive Fermi’s golden rule for a weak periodic perturbation and recover the dipole selection rule
In summary: Heisenberg replaced paths by algebra of transitions; Schrödinger replaced orbits by waves whose squared magnitude is probability. Born provided the measurement rule; commutators supplied the logic; the two formalisms proved equivalent and complete. With this, quantum physics leveled up from clever hacks to a unified theory that predicts spectra and dynamics, computes intensities, and explains why “classical” is just a special zoomed-out limit. Chapter 7 will take this machinery for a spin, solving canonical systems and building the perturbation toolbox that powers modern physics and chemistry