General Chemistry

Empirical vs Molecular Formula

The simplest ratio vs the real count

An empirical formula is the simplest whole-number ratio of the atoms in a compound, while a molecular formula is the actual number of each atom in one molecule. The two are tied together by a single whole number n: molecular formula = empirical formula × n, where n = molar mass ÷ empirical formula mass. Glucose, for example, has the molecular formula C₆H₁₂O₆ but the empirical formula CH₂O (the ratio 6:12:6 reduces to 1:2:1, so n = 6). You get the empirical formula from percent composition — convert each element's mass percent to moles and divide by the smallest — and you get the molecular formula by comparing the empirical formula mass to a measured molar mass. Empirical formulas dominate ionic and network solids (NaCl, SiO₂), where no discrete molecule exists.

  • EmpiricalSimplest whole-number atom ratio
  • MolecularActual atoms per molecule
  • LinkMolecular = empirical × n
  • Finding nn = molar mass ÷ empirical mass
  • GlucoseCH₂O (emp.) → C₆H₁₂O₆ (mol.), n = 6
  • BenzeneCH (emp.) → C₆H₆ (mol.), n = 6

Interactive visualization

Press play, or step through manually. The visualization is yours to drive — try it before reading on.

Open visualization fullscreen ↗

Watch the 60-second explainer

A condensed visual walkthrough — narrated, captioned, under a minute.

Two formulas, one compound

Chemists describe a compound with several layers of formula, and the two most commonly confused are the empirical formula and the molecular formula. The empirical formula answers a question about ratio: in what simplest whole-number proportion do the elements combine? The molecular formula answers a question about count: how many of each atom actually sit in one molecule? For glucose the answers diverge — its molecules contain 6 carbons, 12 hydrogens and 6 oxygens, so the molecular formula is C₆H₁₂O₆, but those subscripts all divide by 6, giving the empirical formula CH₂O.

The relationship is exact and always a whole number. If you let n be that integer, then for every element the molecular subscript equals the empirical subscript times n. Equivalently:

molecular formula = (empirical formula)n   where   n = molar mass ÷ empirical formula mass

Because n is an integer, the molecular formula never carries information the empirical formula plus a molar mass cannot reconstruct. That is precisely why analytical chemistry splits the identification of an unknown into two independent measurements: one experiment fixes the ratio (the empirical formula) and a second, completely different experiment fixes the size (the molar mass, and hence n).

From percent composition to empirical formula

The classic route to an empirical formula starts with percent composition — the mass fraction of each element, which an elemental analyzer reports directly. The procedure is mechanical:

  • Assume 100 g. Each percent becomes a mass in grams, so 40.0% C means 40.0 g of carbon.
  • Convert mass to moles. Divide each mass by the element's molar mass: 40.0 g C ÷ 12.011 g/mol = 3.33 mol C.
  • Divide by the smallest. This normalizes the moles into a ratio. The smallest mole count becomes 1.
  • Clear fractions. If a ratio lands near 1.5, 1.33 or 1.25, multiply all values by 2, 3 or 4 respectively to reach whole numbers.

Worked example: a compound is 40.0% C, 6.7% H and 53.3% O by mass. In 100 g there are 3.33 mol C, 6.64 mol H and 3.33 mol O. Dividing by 3.33 gives 1 : 1.99 : 1, which rounds cleanly to 1 : 2 : 1. The empirical formula is therefore CH₂O — and notice that this single ratio is shared by formaldehyde, acetic acid, lactic acid and glucose. Percent composition alone cannot tell them apart; it has fixed the ratio but not the count.

Finding n: the molar-mass step

To go from CH₂O to a real compound you need the molar mass, measured independently. Mass spectrometry reads the molecular-ion mass directly; for volatile or dissolved substances, colligative methods (freezing-point depression, osmotic pressure) or gas-density measurements work too. Once you have the molar mass M, compute the empirical formula mass and divide:

empirical mass of CH₂O = 12.01 + 2(1.008) + 16.00 = 30.03 g/mol

If the analyzer reports M = 180.16 g/mol, then n = 180.16 ÷ 30.03 = 6.00, so the molecular formula is (CH₂O)₆ = C₆H₁₂O₆, glucose. If instead M came back as 60.05 g/mol, n = 2 and the answer is C₂H₄O₂, acetic acid. The same empirical formula, two different molecules, distinguished only by the second measurement. Rounding matters here: n must be a whole number, and a value like 5.9 or 6.1 should be read as 6 — small deviations come from rounding atomic masses and experimental error, not from a fractional molecule.

A side-by-side comparison

CompoundEmpirical formulaMolecular formulaEmpirical mass (g/mol)Molar mass (g/mol)n
WaterH₂OH₂O18.0218.021
Carbon dioxideCO₂CO₂44.0144.011
Hydrogen peroxideHOH₂O₂17.0134.012
BenzeneCHC₆H₆13.0278.116
AcetyleneCHC₂H₂13.0226.042
GlucoseCH₂OC₆H₁₂O₆30.03180.166
Acetic acidCH₂OC₂H₄O₂30.0360.052
HydrazineNH₂N₂H₄16.0232.052

Two patterns jump out. First, benzene and acetylene share the empirical formula CH, yet they are wildly different substances — an aromatic liquid versus a welding gas — because n differs. Second, water and carbon dioxide have n = 1: their subscripts are already in lowest terms (1 and 2 share no common factor), so empirical and molecular formulas coincide.

Combustion analysis: the experiment behind the ratio

For organic compounds the empirical formula is usually obtained by combustion analysis. A precisely weighed sample is burned in a stream of excess oxygen so that every carbon atom ends up as CO₂ and every hydrogen atom as H₂O. The exhaust passes through absorbers — a desiccant traps the water, a base such as soda lime traps the CO₂ — and the mass gain of each absorber is measured.

  • Carbon: moles C = (mass CO₂ ÷ 44.01) × 1, since each CO₂ carries one C.
  • Hydrogen: moles H = (mass H₂O ÷ 18.02) × 2, since each H₂O carries two H.
  • Oxygen (and other elements): found by difference — subtract the masses of C and H from the original sample mass; whatever remains is the oxygen mass.

Suppose 0.250 g of a compound burns to give 0.366 g CO₂ and 0.150 g H₂O. The CO₂ contains 0.366 ÷ 44.01 = 8.32 × 10⁻³ mol C (0.0999 g C); the H₂O contains 2 × (0.150 ÷ 18.02) = 0.0166 mol H (0.0168 g H). Oxygen by difference: 0.250 − 0.0999 − 0.0168 = 0.133 g, or 8.33 × 10⁻³ mol O. The mole ratio C : H : O is 8.32 : 16.6 : 8.33 ≈ 1 : 2 : 1, giving CH₂O once more. Combustion analysis is the workhorse that turned nineteenth-century organic chemistry from guesswork into a quantitative science, and refined versions still calibrate modern CHN elemental analyzers.

Ionic and network solids: empirical formulas only

For molecular substances the molecular formula is the "real" description, but a huge class of materials has no molecular formula at all. Ionic compounds such as sodium chloride are extended lattices: there is no discrete NaCl "molecule," only a repeating array in which each Na⁺ is surrounded by six Cl⁻ and vice versa. The formula NaCl is therefore strictly an empirical formula — the simplest cation-to-anion ratio that keeps the crystal electrically neutral. The same is true of network covalent solids: quartz is written SiO₂ even though a single grain contains Avogadro-scale numbers of Si and O atoms bonded into one continuous framework.

This is why a "formula unit" rather than a "molecule" is the counting entity for ionic compounds. Calcium chloride is CaCl₂ because charge balance demands two Cl⁻ per Ca²⁺; iron(III) oxide is Fe₂O₃ for the same reason. The empirical formula is not a simplification here — it is the only formula that exists.

Why ratio alone is never enough

The deepest lesson is that an empirical formula is an underdetermined description. It pins down composition but not molecular size, and it says nothing about connectivity. CH₂O is shared by at least four common compounds spanning molar masses from 30 to 180 g/mol; CH is shared by benzene, acetylene and many polycyclic aromatics. Even the molecular formula can be ambiguous: C₂H₆O is both ethanol (CH₃CH₂OH) and dimethyl ether (CH₃OCH₃), two isomers with the same atom count but different structures and properties. To name a compound unambiguously you climb the ladder — empirical formula fixes the ratio, molecular formula fixes the count, and a structural formula fixes the arrangement. Each step needs its own experiment: elemental analysis for the ratio, mass spectrometry for the molar mass, and spectroscopy (NMR, IR) or diffraction for the structure.

Frequently asked questions

What is the difference between empirical and molecular formula?

An empirical formula gives the simplest whole-number ratio of atoms in a compound; a molecular formula gives the actual number of each atom in one molecule. Glucose has the molecular formula C₆H₁₂O₆ but the empirical formula CH₂O (dividing 6:12:6 by 6 gives 1:2:1). The molecular formula is always a whole-number multiple of the empirical formula: molecular = empirical × n.

How do you find an empirical formula from percent composition?

Assume a 100 g sample so each percent becomes grams. Divide each element's mass by its atomic mass to get moles. Divide every mole value by the smallest of them to get the ratio. If the ratios are not whole numbers, multiply all of them by a small integer (e.g., 2 for 1.5, 3 for 1.33). Example: 40.0% C, 6.7% H, 53.3% O gives 3.33 : 6.6 : 3.33 mol, which divides to 1 : 2 : 1, so the empirical formula is CH₂O.

How do you find the molecular formula from the empirical formula?

You need the compound's molar mass, usually from mass spectrometry or a colligative-property measurement. Compute the empirical formula mass, then n = molar mass ÷ empirical formula mass, rounded to the nearest whole number. Multiply every subscript in the empirical formula by n. For glucose: empirical CH₂O has mass 30.03 g/mol; the measured molar mass is 180.16 g/mol; n = 180.16 ÷ 30.03 ≈ 6, so the molecular formula is C₆H₁₂O₆.

Can the empirical and molecular formula be the same?

Yes. When the molecular formula is already in lowest terms, n = 1 and the two formulas are identical. Water (H₂O), carbon dioxide (CO₂), ammonia (NH₃) and methane (CH₄) are all cases where the empirical and molecular formulas coincide. Subscripts that share no common factor — like 1 and 2 in H₂O — cannot be reduced further.

Why can different compounds share the same empirical formula?

Because the empirical formula only fixes the ratio, not the size or arrangement of the molecule. CH₂O is the empirical formula of formaldehyde (CH₂O, molar mass 30 g/mol), acetic acid (C₂H₄O₂, 60 g/mol), glyceraldehyde (C₃H₆O₃, 90 g/mol) and glucose (C₆H₁₂O₆, 180 g/mol). They differ in molar mass, structure and chemistry, so molecular formula plus a structural formula is needed to identify a compound uniquely.

What is combustion analysis used for?

Combustion analysis burns a known mass of an organic compound in excess oxygen and traps the products to find its empirical formula. All carbon becomes CO₂ and all hydrogen becomes H₂O, which are absorbed and weighed. The CO₂ mass gives moles of C, the H₂O mass gives moles of H, and oxygen is found by difference from the original sample mass. The mole ratio yields the empirical formula; a separate molar-mass measurement then gives the molecular formula.