cmu-pronouncing-dictionary

The 134,000+ words and their pronunciations in the CMU pronouncing dictionary

Usage no npm install needed!

<script type="module">
  import cmuPronouncingDictionary from 'https://cdn.skypack.dev/cmu-pronouncing-dictionary';
</script>

README

cmu-pronouncing-dictionary

Build Downloads Size

The 134,000+ words and their pronunciations in the CMU pronouncing dictionary.

The CMU Pronouncing Dictionary (also known as cmudict) is a public domain pronouncing dictionary created by Carnegie Mellon University (CMU). It defines a mapping from English words to their North American pronunciations, and is commonly used in speech processing applications.

Crawled from cmusphinx/cmudict.

Install

This package is ESM only: Node 12+ is needed to use it and it must be imported instead of required.

npm:

npm install cmu-pronouncing-dictionary

Use

import { dictionary } from 'cmu-pronouncing-dictionary'

console.log(dictionary)

Yields:

{
  a: 'AH0',
  'a(1)': 'EY1',
  "a's": 'EY1 Z',
  'a.': 'EY1',
  "a.'s": 'EY1 Z',
  'a.s': 'EY1 Z',
  a42128: 'EY1 F AO1 R T UW1 W AH1 N T UW1 EY1 T',
  aa: 'EY2 EY1',
  aaa: 'T R IH2 P AH0 L EY1',
  aaberg: 'AA1 B ER0 G',
  aachen: 'AA1 K AH0 N',
  aachener: 'AA1 K AH0 N ER0',
  aah: 'AA1',
  aaker: 'AA1 K ER0',
  aaliyah: 'AA2 L IY1 AA2',
  aalseth: 'AA1 L S EH0 TH',
  aamodt: 'AA1 M AH0 T',
  aancor: 'AA1 N K AO2 R',
  // …and many more
}

API

This package exports the following identifiers: dictionary. There is no default export.

dictionary

Object.<string> — Map of English words to ARPABET phonetic transcription codes.

Note that sometimes there are multiple possible pronunciations. Those are represented as $word($counter), like so:

{
  // …
  "unnatural": "AH0 N N AE1 CH ER0 AH0 L",
  "unnaturally": "AH0 N N AE1 CH ER0 AH0 L IY0",
  "unnaturally(2)": "AH0 N N AE1 CH ER0 L IY0",
  "unnaturally(3)": "AH0 N AE1 CH ER0 L IY0",
  "unnaturally(4)": "AH0 N N AE1 CH R AH0 L IY0",
  "unnecessarily": "AH0 N N EH1 S AH0 S EH2 R AH0 L IY0",
  "unnecessary": "AH0 N N EH1 S AH0 S EH2 R IY0",
  // …
}

License

ISC © Zeke Sikelianos