tonesmandarinpronunciationbeginners

The 4 Tones of Mandarin Chinese — A Visual & Audio Guide

TonePerfect··6 {minutes} min read

If you have started learning Mandarin Chinese, you have already heard the warning: "Get the tones right or you will be misunderstood." That advice is absolutely true. But it is also easy to make tones sound more mysterious than they actually are.

Tones are not magic. They are pitch contours — the same kind of pitch movements you already make in English when you ask a question, contradict someone, or sound surprised. The trick is that in Mandarin those contours are part of the word itself, not just an emotional flavour on top of it.

This guide gives you a clear, audio-friendly walkthrough of all four Mandarin tones, plus the neutral tone, with concrete tips for getting each one right.

Why Mandarin tones matter

Mandarin is a tonal language. The same syllable spoken with a different tone is, for all practical purposes, a different word. The classic example uses the syllable ma:

  • (妈, T1) — mother
  • (麻, T2) — hemp
  • (马, T3) — horse
  • (骂, T4) — to scold
  • ma (吗, neutral) — a question particle

Five different meanings, one set of consonants and vowels. The only difference is the tone. If you skip tones, native listeners have to do the work of guessing, which is exactly the gap between "I understood you with effort" and "I understood you naturally."

You can experiment with this directly on the interactive pinyin chart — pick any cell, then change tones at the top and listen.

Tone 1: high level (ā)

Tone 1 is a steady, high pitch. Think of holding a single note. The voice doesn't move up or down — it just sustains.

Common mistake: English speakers tend to start too low and let the pitch fall. Stay up. If you are speaking from the bottom of your range, you are not making T1 yet.

Tip: Sing the syllable on a single high note. Once you can do that, just shorten it — that is T1.

Tone 2: rising (á)

Tone 2 starts in the middle of your range and rises to the top. It feels exactly like the way an English speaker says "What?" in surprise.

Common mistake: Rising too late. T2 should already be moving up by the time you start the vowel — it is not a flat tone followed by a quick lift at the end.

Tip: Practise saying the English word "huh?" and pay attention to the rising pitch. That contour is T2.

Tone 3: dip then rise (ǎ)

Tone 3 is the most-discussed tone for a reason: in citation form (the way a teacher pronounces it slowly) it falls and then rises again. It feels like saying "Really?" with a hint of disbelief.

In real, fast speech, T3 almost never appears with the full dip-rise. Native speakers typically pronounce it as just a low tone, and only fully realise the rise when the syllable is final or emphasised.

Common mistake: Producing the full dip-rise on every T3 syllable in a sentence. It will sound textbook-correct but unnatural.

Tip: Start by learning the citation form so you have a clear contrast with the others, then relax it to a low tone in connected speech.

There is also a tone-change rule called third-tone sandhi: when two T3 syllables come back to back, the first one becomes T2. So 你好 (nǐ hǎo) is actually pronounced "ní hǎo." We've written a deeper explanation in our third-tone sandhi guide.

Tone 4: falling (à)

Tone 4 is a sharp drop from high to low. It feels decisive — like saying "No!" or "Stop!" in English.

Common mistake: Underdoing the fall. If you stop halfway down, T4 ends up sounding like a flat low tone, which is too close to T3.

Tip: Imagine slamming a book shut on the syllable. The pitch should land firmly at the bottom.

The neutral tone

The neutral tone is unstressed, short, and pronounced without a clear contour. Its actual pitch depends on the tone of the syllable before it. You will see it on grammatical particles (吗, 了, 的, 们) and on the second syllable of many common compounds (爸爸 bàba, 妈妈 māma).

Common mistake: Forcing a tone onto a syllable that should be neutral. If a textbook gives you 妈妈 with a tone mark only on the first syllable, the second syllable really is light and floppy. Do not try to give it weight.

How to practise tones effectively

Listening to tones is not the same as producing them. Here is a four-step routine that actually moves the needle:

  1. Listen first, then mimic immediately. Always hear the target before you try to produce it. Do not read a tone mark and guess.
  2. Compare your voice to the model. Record yourself, then play the model and your attempt back-to-back. The differences become obvious.
  3. Drill in pairs, not singles. The hard part of tones is contrast. Practising "mā" and "má" alternately is more useful than practising "mā" thirty times.
  4. Get specific feedback. Native ears or AI feedback can tell you which segment of the tone is wrong (start, middle, end). General "your tones are a bit off" comments are not actionable.

That last point is exactly what the TonePerfect app is built for. It records you saying each syllable, scores your tone separately from your initial and final, and points to the segment of the contour where you went wrong. You can take the free 2-minute pronunciation test to see how your tones land.

Summary

  • Mandarin has 4 lexical tones plus a neutral tone.
  • Tones are pitch contours, not stress or volume.
  • The hard part is not producing one tone — it is contrasting them in connected speech.
  • The fastest way to improve is targeted, repeated comparison with feedback.

Once you can hear and produce all four tones reliably, the rest of Mandarin pronunciation becomes much easier. Tones are the gatekeeper. The good news is they are perfectly learnable. Open the pinyin chart or the app and start drilling.

Want to perfect your Chinese pronunciation?

TonePerfect uses AI to analyze your tones, initials, and finals — giving you instant, detailed feedback.