The Project Gutenberg Etext of Moby Pronunciator II by Grady Ward **** PLEASE SEE AT THE BOTTOM OF THIS PAGE INSTRUCTIONS FOR DOWNLOADING THIS FILE**** Copyright laws are changing all over the world, be sure to check the laws for your country before redistributing these files!!! Please take a look at the important information in this header. We encourage you to keep this file on your own disk, keeping an electronic path open for the next readers. Do not remove this. This should be the first thing seen when anyone opens the book. Do not change or edit it without written permission. The words are carefully chosen to provide users with the information they need about what they can legally do with the texts. **Welcome To The World of Free Plain Vanilla Electronic Texts** **Etexts Readable By Both Humans and By Computers, Since 1971** *These Etexts Prepared By Hundreds of Volunteers and Donations* Information on contacting Project Gutenberg to get Etexts, and further information is included below. We need your donations. Presently, contributions are only being solicited from people in: Texas, Nevada, Idaho, Montana, Wyoming, Colorado, South Dakota, Iowa, Indiana, and Vermont. As the requirements for other states are met, additions to this list will be made and fund raising will begin in the additional states. These donations should be made to: Project Gutenberg Literary Archive Foundation PMB 113 1739 University Ave. Oxford, MS 38655 Title: Moby Pronunciator II Author: Grady Ward, grady@gradyward.com Release Date: May, 2002 [Etext #3205] Edition: 1.0 The Project Gutenberg Etext of Moby Pronunciator II by Grady Ward ******This file should be named 3205.txt****** This etext was prepared by Mike Pullen, globaltraveler5565@yahoo.com. Project Gutenberg Etexts are usually created from multiple editions, all of which are in the Public Domain in the United States, unless a copyright notice is included. Therefore, we usually do NOT keep any of these books in compliance with any particular paper edition. We are now trying to release all our books one year in advance of the official release dates, leaving time for better editing. Please be encouraged to send us error messages even years after the official publication date. Please note: neither this list nor its contents are final till midnight of the last day of the month of any such announcement. The official release date of all Project Gutenberg Etexts is at Midnight, Central Time, of the last day of the stated month. A preliminary version may often be posted for suggestion, comment and editing by those who wish to do so. Most people start at our sites at: http://gutenberg.net/pg http://promo.net/pg Those of you who want to download our Etexts before announcment can surf to them as follows, and just download by date; this is also a good way to get them instantly upon announcement, as the indexes our cataloguers produce obviously take a while after an announcement goes out in the Project Gutenberg Newsletter. http://metalab.unc.edu/pub/docs/books/gutenberg/etext01 or ftp://metalab.unc.edu/pub/docs/books/gutenberg/etext01 Or /etext00, 99, 98, 97, 96, 95, 94, 93, 92, 92, 91 or 90 Just search by the first five letters of the filename you want, as it appears in our Newsletters. Information about Project Gutenberg (one page) We produce about two million dollars for each hour we work. The time it takes us, a rather conservative estimate, is fifty hours to get any etext selected, entered, proofread, edited, copyright searched and analyzed, the copyright letters written, etc. This projected audience is one hundred million readers. If our value per text is nominally estimated at one dollar then we produce $2 million dollars per hour this year as we release fifty new Etext files per month, or 500 more Etexts in 2000 for a total of 3000+ If they reach just 1-2% of the world's population then the total should reach over 300 billion Etexts given away by year's end. The Goal of Project Gutenberg is to Give Away One Trillion Etext Files by December 31, 2001. [10,000 x 100,000,000 = 1 Trillion] This is ten thousand titles each to one hundred million readers, which is only about 4% of the present number of computer users. At our revised rates of production, we will reach only one-third of that goal by the end of 2001, or about 3,333 Etexts unless we manage to get some real funding. Something is needed to create a future for Project Gutenberg for the next 100 years. We need your donations more than ever! Presently, contributions are only being solicited from people in: Texas, Nevada, Idaho, Montana, Wyoming, Colorado, South Dakota, Iowa, Indiana, and Vermont. As the requirements for other states are met, additions to this list will be made and fund raising will begin in the additional states. All donations should be made to the Project Gutenberg Literary Archive Foundation and will be tax deductible to the extent permitted by law. Mail to: Project Gutenberg Literary Archive Foundation PMB 113 1739 University Avenue Oxford, MS 38655 [USA] We are working with the Project Gutenberg Literary Archive Foundation to build more stable support and ensure the future of Project Gutenberg. We need your donations more than ever! You can get up to date donation information at: http://www.gutenberg.net/donation.html *** You can always email directly to: Michael S. Hart hart@pobox.com forwards to hart@prairienet.org and archive.org if your mail bounces from archive.org, I will still see it, if it bounces from prairienet.org, better resend later on. . . . We would prefer to send you this information by email. Example command-line FTP session: ftp metalab.unc.edu login: anonymous password: your@login cd pub/docs/books/gutenberg cd etext90 through etext99 or etext00 through etext01, etc. dir [to see files] get or mget [to get files. . .set bin for zip files] GET GUTINDEX.?? [to get a year's listing of books, e.g., GUTINDEX.99] GET GUTINDEX.ALL [to get a listing of ALL books] **The Legal Small Print** (Three Pages) ***START**THE SMALL PRINT!**FOR PUBLIC DOMAIN ETEXTS**START*** Why is this "Small Print!" statement here? You know: lawyers. They tell us you might sue us if there is something wrong with your copy of this etext, even if you got it for free from someone other than us, and even if what's wrong is not our fault. So, among other things, this "Small Print!" statement disclaims most of our liability to you. It also tells you how you can distribute copies of this etext if you want to. *BEFORE!* YOU USE OR READ THIS ETEXT By using or reading any part of this PROJECT GUTENBERG-tm etext, you indicate that you understand, agree to and accept this "Small Print!" statement. If you do not, you can receive a refund of the money (if any) you paid for this etext by sending a request within 30 days of receiving it to the person you got it from. If you received this etext on a physical medium (such as a disk), you must return it with your request. ABOUT PROJECT GUTENBERG-TM ETEXTS This PROJECT GUTENBERG-tm etext, like most PROJECT GUTENBERG-tm etexts, is a "public domain" work distributed by Professor Michael S. Hart through the Project Gutenberg Association (the "Project"). Among other things, this means that no one owns a United States copyright on or for this work, so the Project (and you!) can copy and distribute it in the United States without permission and without paying copyright royalties. Special rules, set forth below, apply if you wish to copy and distribute this etext under the Project's "PROJECT GUTENBERG" trademark. To create these etexts, the Project expends considerable efforts to identify, transcribe and proofread public domain works. Despite these efforts, the Project's etexts and any medium they may be on may contain "Defects". Among other things, Defects may take the form of incomplete, inaccurate or corrupt data, transcription errors, a copyright or other intellectual property infringement, a defective or damaged disk or other etext medium, a computer virus, or computer codes that damage or cannot be read by your equipment. LIMITED WARRANTY; DISCLAIMER OF DAMAGES But for the "Right of Replacement or Refund" described below, [1] the Project (and any other party you may receive this etext from as a PROJECT GUTENBERG-tm etext) disclaims all liability to you for damages, costs and expenses, including legal fees, and [2] YOU HAVE NO REMEDIES FOR NEGLIGENCE OR UNDER STRICT LIABILITY, OR FOR BREACH OF WARRANTY OR CONTRACT, INCLUDING BUT NOT LIMITED TO INDIRECT, CONSEQUENTIAL, PUNITIVE OR INCIDENTAL DAMAGES, EVEN IF YOU GIVE NOTICE OF THE POSSIBILITY OF SUCH DAMAGES. If you discover a Defect in this etext within 90 days of receiving it, you can receive a refund of the money (if any) you paid for it by sending an explanatory note within that time to the person you received it from. If you received it on a physical medium, you must return it with your note, and such person may choose to alternatively give you a replacement copy. If you received it electronically, such person may choose to alternatively give you a second opportunity to receive it electronically. THIS ETEXT IS OTHERWISE PROVIDED TO YOU "AS-IS". NO OTHER WARRANTIES OF ANY KIND, EXPRESS OR IMPLIED, ARE MADE TO YOU AS TO THE ETEXT OR ANY MEDIUM IT MAY BE ON, INCLUDING BUT NOT LIMITED TO WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimers of implied warranties or the exclusion or limitation of consequential damages, so the above disclaimers and exclusions may not apply to you, and you may have other legal rights. INDEMNITY You will indemnify and hold the Project, its directors, officers, members and agents harmless from all liability, cost and expense, including legal fees, that arise directly or indirectly from any of the following that you do or cause: [1] distribution of this etext, [2] alteration, modification, or addition to the etext, or [3] any Defect. DISTRIBUTION UNDER "PROJECT GUTENBERG-tm" You may distribute copies of this etext electronically, or by disk, book or any other medium if you either delete this "Small Print!" and all other references to Project Gutenberg, or: [1] Only give exact copies of it. Among other things, this requires that you do not remove, alter or modify the etext or this "small print!" statement. You may however, if you wish, distribute this etext in machine readable binary, compressed, mark-up, or proprietary form, including any form resulting from conversion by word pro- cessing or hypertext software, but only so long as *EITHER*: [*] The etext, when displayed, is clearly readable, and does *not* contain characters other than those intended by the author of the work, although tilde (~), asterisk (*) and underline (_) characters may be used to convey punctuation intended by the author, and additional characters may be used to indicate hypertext links; OR [*] The etext may be readily converted by the reader at no expense into plain ASCII, EBCDIC or equivalent form by the program that displays the etext (as is the case, for instance, with most word processors); OR [*] You provide, or agree to also provide on request at no additional cost, fee or expense, a copy of the etext in its original plain ASCII form (or in EBCDIC or other equivalent proprietary form). [2] Honor the etext refund and replacement provisions of this "Small Print!" statement. [3] Pay a trademark license fee to the Project of 20% of the gross profits you derive calculated using the method you already use to calculate your applicable taxes. If you don't derive profits, no royalty is due. Royalties are payable to "Project Gutenberg Literary Archive Foundation" the 60 days following each date you prepare (or were legally required to prepare) your annual (or equivalent periodic) tax return. Please contact us beforehand to let us know your plans and to work out the details. WHAT IF YOU *WANT* TO SEND MONEY EVEN IF YOU DON'T HAVE TO? The Project gratefully accepts contributions of money, time, public domain etexts, and royalty free copyright licenses. If you are interested in contributing scanning equipment or software or other items, please contact Michael Hart at: hart@pobox.com *END THE SMALL PRINT! FOR PUBLIC DOMAIN ETEXTS*Ver.04.07.00*END* Moby (tm) Pronunciator II Documentation Notes This documentation, the software and/or database are: Public Domain material by grant from the author, January, 2001. Moby (tm) Pronunciator II for MSDOS operating systems is compressed and distributed as a single zip file. After decompression the hyphanation file included with this product is in ordinary ASCII format with CRLF (ASCII 13/10) delimiters. MOBY Pronunciator II CONTENTS Read Me First File (aaREADME.txt) Acknowledgments (abreadme.txt) CMU Dictionary list (cmudict.txt) Pronunciation List (mpron.txt) Phone Set List (phoneset.txt) Quick Start 1) Insure you have at least 9Mb of free disk space to hold the contents of this zip file. 2) Create a directory to hold these files listed above. 3) Extract the contents of this zip file into the destination directory using any compatible zip file extraction utility. 4) Delete the original zip file from your disk to save space. (optional) LEGEND Each pronunciation vocabulary entry consists of a word or phrase field followed by a field delimiter of space " " and the IPA-equivalent field that is coded using the following ASCII symbols (case is significant). Spaces between words in the word or phrase or pronunciation field is denoted with underbar "_". /&/ sounds like the "a" in "dab" /(@)/ sounds like the "a" in "air" /A/ sounds like the "a" in "far" /eI/ sounds like the "a" in "day" /@/ sounds like the "a" in "ado" or the glide "e" in "system" (dipthong schwa) /-/ sounds like the "ir" glide in "tire" or the "dl" glide in "handle" or the "den" glide in "sodden" (dipthong little schwa) /b/ sounds like the "b" in "nab" /tS/ sounds like the "ch" in "ouch" /d/ sounds like the "d" in "pod" /E/ sounds like the "e" in "red" /i/ sounds like the "e" in "see" /f/ sounds like the "f" in "elf" /g/ sounds like the "g" in "fig" /h/ sounds like the "h" in "had" /hw/ sounds like the "w" in "white" /I/ sounds like the "i" in "hid" /aI/ sounds like the "i" in "ice" /dZ/ sounds like the "g" in "vegetably" /k/ sounds like the "c" in "act" /l/ sounds like the "l" in "ail" /m/ sounds like the "m" in "aim" /N/ sounds like the "ng" in "bang" /n/ sounds like the "n" in "and" /Oi/ sounds like the "oi" in "oil" /A/ sounds like the "o" in "bob" /AU/ sounds like the "ow" in "how" /O/ sounds like the "o" in "dog" /oU/ sounds like the "o" in "boat" /u/ sounds like the "oo" in "too" /U/ sounds like the "oo" in "book" /p/ sounds like the "p" in "imp" /r/ sounds like the "r" in "ire" /S/ sounds like the "sh" in "she" /s/ sounds like the "s" in "sip" /T/ sounds like the "th" in "bath" /D/ sounds like the "th" in "the" /t/ sounds like the "t" in "tap" /@/ sounds like the "u" in "cup" /@r/ sounds like the "u" in "burn" /v/ sounds like the "v" in "average" /w/ sounds like the "w" in "win" /j/ sounds like the "y" in "you" /Z/ sounds like the "s" in "vision" /z/ sounds like the "z" in "zoo" Stress or emphasis is marked in the data with the primary "'" or secondary "," marks: "'" (uncurled apostrophe) marks primary stress "," (comma) marks secondary stress. Moby Pronunciator contains many common names and phrases borrowed from other languages; special sounds include (case is significant): "A" sounds like the "a" in "ami" "N" sounds like the "n" in "Francoise" "R" sounds like the "r" in "Der" /x/ sounds like the "ch" in "Bach" /y/ sounds like the "eu" in "cordon bleu" "Y" sounds like the "u" in "Dubois" Words and Phrases adopted from languages other than English have the unaccented form of the roman spelling. For example, "etude" has an initial accented "e" but is spelled without the accent in the Moby Pronunciator II database. Each two-part vocabulary record is delimited from others with CRLF (ASCII 13/10). SPECIAL FEATURE OF THIS LEXICON: several hundred words pronounced differently because of their part-speech have been distinguished. For example, the entries: close/v kl/oU/z and close/aj kl/oU/s (terminal sibilant varies) or effect/n '/I/,f/E/kt and effect/v ,/I/'f/E/kt (stress varies) distinguish those two parts of speech. (Any word with this information will terminate with the virgule (slash) in the vocabulary field, followed by one or more of the following part-of-speech abbreviations: n, v, av, aj, interj, followed by the rest of the pronunciation record. Acknowledgements: Date: 9-15-93 This directory contains a pronunciation dictionaries (cmudict.txt is the most up-to-date) containing approximately 100k words and their transcriptions. We use these dictionaries at CMU in our speech understanding systems. The phone set for this dictionary contains 39 phones, which can be found in phoneset.txt. Stress is indicated by means of a numeral [012] attached to a vowel: 0 = no stress 1 = primary stress 2 = secondary stress Alternate transcriptions are identified with a numeral in parentheses as part of the lexical entry. We generated this dictionary using the following independent sources: - a 20k+ general English dictionary, built by hand at CMU (extensively proofed and used). - a 200k+ UCLA-proofed version of the shoup dictionary. - a 32k subset of the Dragon dictionary. - a 53k+ dictionary of proper names, synthesiser-generated, unproofed. - a 200k dictionary generated with Orator, unproofed. - a 200k dictionary generated with Mitalk, unproofed. All entries that occur solely in copyrighted sources, like the Dragon dictionary, are not currently included in this dictionary. If you have words and transcriptions that you would like included in this unrestricted resource, please send them to Robert L. Weide (weide@cs. cmu.edu) and we will consider them for an upcoming version. All of the above sources were preprocessed and the transcriptions in the current cmudict.0.1 were selected from the transcriptions in the sources or a combination thereof. We have removed some potentially unreliable transcriptions from this dictionary, including those based on only one source, and will reintroduce them once we have verified the transcriptions. CMU does not guarantee the accuracy of this dictionary, nor its suitablity for any specific purpose. In fact, we expect a number of errors, omissions and inconsistencies to remain in the current result. We intend to continually update the dictionary as we make progress in correcting them. We will make subsequent versions available via anonymous ftp, and those who would like notification when updated versions are available should send email to weide@cs.cmu.edu. We welcome input from users: send e-mail to Robert L. Weide (weide@cs. cmu.edu) for comments and suggestions on the content of the dictionary, or to Peter Jansen (pjj@cs.cmu.edu) for questions regarding the combination process. The Carnegie Mellon Pronouncing Dictionary [cmudict.0.1] is Copyright 1993 by Carnegie Mellon University. Use of this dictionary, for any research or commercial purpose, is completely unrestricted. If you make use of or redistribute this material, we would appreciate acknowlegement of its origin. Finally, if you add words to or correct words in this dictionary, we would like the additions and corrections sent to us (weide@cs) for consideration in a subsequent version. All final entries will be approved by Robert L. Weide and Peter Jansen, editors of the dictionary. Quick Start 1) Create a destination directory to hold the files listed above. 2) On the PG Catalog page click on the selection "More Files". You will see a "files.zip" folder in the list. Move this zipped folder to your computer. On your computer open "files.zip", double click on its "files" subdirectory and copy the contents into the destination directory on your computer. End of this Project Gutenberg etext of Moby Pronunciator II by Grady Ward.