Need a translation database abbreviation
Is there such a thing?
I want to translate something like "mfg" => "production" or. I suppose I need a mapping database. However, I cannot find it. I don't know very much about lexical translations and the budget doesn't require a lot of research, so are there any quick dirty tools out there?
a source to share
I took the list from the cookre and changed it. This is a start and mostly for my use, but it might help someone else:
<?php
$abb = array();
$abb['abbr'] = "abbreviation";
$abb['abr'] = "abridged";
$abb['acad'] = "academy";
$abb['adj'] = "adjective";
$abb['adm'] = "administration";
$abb['agr'] = "agricultural";
$abb['agri'] = "agricultural";
$abb['agric'] = "agricultural";
$abb['anon'] = "anonymous";
$abb['app'] = "appended";
$abb['approx'] = "approximately";
$abb['assn'] = "association";
$abb['bact'] = "bacteria";
$abb['bib'] = "Bible";
$abb['bibliog'] = "bibliography";
$abb['biog'] = "biographer";
$abb['biol'] = "biology";
$abb['bk'] = "bank";
$abb['bkg'] = "banking";
$abb['bldg'] = "building";
$abb['blvd'] = "boulevard";
$abb['bot'] = "botanical";
$abb['bp'] = "bishop";
$abb['brig'] = "brigade";
$abb['bur'] = "bureaus";
$abb['ca'] = "chartered accountant";
$abb['cal'] = "calendar";
$abb['cap'] = "capital";
$abb['capt'] = "captain";
$abb['cath'] = "capital";
$abb['cent'] = "central";
$abb['ch'] = "church";
$abb['chem'] = "chemist";
$abb['chm'] = "chairman";
$abb['chron'] = "chronology";
$abb['cit'] = "citizen";
$abb['civ'] = "civil";
$abb['clk'] = "clerk";
$abb['co'] = "biographer";
$abb['col'] = "collected";
$abb['com'] = "committee";
$abb['comdr'] = "commander";
$abb['cond'] = "conductor";
$abb['conf'] = "conference";
$abb['cong'] = "congress";
$abb['consol'] = "consolidated";
$abb['constr'] = "construction";
$abb['cont'] = "continued";
$abb['corp'] = "corporation";
$abb['cp'] = "compare";
$abb['cr'] = "credit";
$abb['ct'] = "court";
$abb['dep'] = "department";
$abb['der'] = "derivative";
$abb['diag'] = "diagram";
$abb['dial'] = "dialect";
$abb['dict'] = "dictionary";
$abb['dipl'] = "diplomatic";
$abb['dir'] = "director";
$abb['disc'] = "discount";
$abb['dist'] = "district";
$abb['distr'] = "distribution";
$abb['div'] = "division";
$abb['doc'] = "document";
$abb['dpt'] = "department";
$abb['dr'] = "doctor";
$abb['dup'] = "duplicate";
$abb['ea'] = "each";
$abb['ecol'] = "ecology";
$abb['econ'] = "economic";
$abb['ed'] = "editor";
$abb['elec'] = "electrician";
$abb['elect'] = "electrical";
$abb['eng'] = "engineer";
$abb['ency'] = "encyclopedia";
$abb['encyc'] = "encyclopedia";
$abb['encycl'] = "encyclopedia";
$abb['est'] = "estimate";
$abb['entom'] = "entomologist";
$abb['exch'] = "exchange";
$abb['exec'] = "executive";
$abb['fac'] = "facsimile";
$abb['fax'] = "facsimile";
$abb['fed'] = "federal";
$abb['ff'] = "following";
$abb['fin'] = "finance";
$abb['fl'] = "fluid";
$abb['ft'] = "foot";
$abb['fwd'] = "foreword";
$abb['gal'] = "gallon";
$abb['gall'] = "gallon";
$abb['gaz'] = "gazette";
$abb['geog'] = "geographer";
$abb['geol'] = "geologic";
$abb['geom'] = "geometry";
$abb['gloss'] = "glossary";
$abb['gov'] = "governor";
$abb['govt'] = "government";
$abb['gram'] = "grammar";
$abb['hist'] = "history";
$abb['her'] = "heraldry";
$abb['hort'] = "horticulture";
$abb['hr'] = "hour";
$abb['ht'] = "height";
$abb['illus'] = "illustrator";
$abb['in'] = "inch";
$abb['inc'] = "incorporated";
$abb['ins'] = "insurance";
$abb['inst'] = "institution";
$abb['intl'] = "international";
$abb['international'] = "introduction";
$abb['is'] = "island";
$abb['jour'] = "journal";
$abb['jr'] = "junior";
$abb['jud'] = "judicial";
$abb['lab'] = "laboratory";
$abb['lang'] = "language";
$abb['lat'] = "latitude";
$abb['lib'] = "library";
$abb['lieut'] = "lieutenant";
$abb['lt'] = "lieutenant";
$abb['lit'] = "literature";
$abb['ltd'] = "limited";
$abb['mach'] = "machinist";
$abb['mag'] = "magazine";
$abb['maj'] = "major";
$abb['math'] = "mathematics";
$abb['mech'] = "mechanics";
$abb['med'] = "medical";
$abb['mfg'] = "manufacturing";
$abb['mfr'] = "manufacturer";
$abb['mgr'] = "manager";
$abb['misc'] = "miscellaneous";
$abb['mo'] = "month";
$abb['mod'] = "modern";
$abb['ms'] = "manuscript";
$abb['mt'] = "mountain";
$abb['mus'] = "museum";
$abb['narr'] = "narrator";
$abb['natl'] = "national";
$abb['nav'] = "naval";
$abb['neg'] = "negative";
$abb['no'] = "number";
$abb['obit'] = "obituary";
$abb['obj'] = "object";
$abb['orch'] = "orchestra";
$abb['orig'] = "original";
$abb['p'] = "page";
$abb['par'] = "paragraph";
$abb['pat'] = "patent";
$abb['pct'] = "percent";
$abb['perf'] = "performer";
$abb['philos'] = "philosopher";
$abb['phys'] = "physician";
$abb['prod'] = "production";
$abb['prin'] = "principal";
$abb['psych'] = "psychologist";
$abb['pt'] = "port";
$abb['pub'] = "publisher";
$abb['qr'] = "quarter";
$abb['ques'] = "question";
$abb['quot'] = "quotation";
$abb['rec'] = "receipt";
$abb['ref'] = "referee";
$abb['reg'] = "region";
$abb['rev'] = "review";
$abb['rel'] = "release";
$abb['rpt'] = "reprint";
$abb['s'] = "south";
$abb['sc'] = "scene";
$abb['sch'] = "school";
$abb['sci'] = "scientist";
$abb['sculp'] = "sculptor";
$abb['sec'] = "secretary";
$abb['secy'] = "secretary";
$abb['sect'] = "section";
$abb['ser'] = "series";
$abb['serg'] = "sergeant";
$abb['sergt'] = "sergeant";
$abb['sing'] = "singular";
$abb['sol'] = "solicitor";
$abb['sp'] = "secretary";
$abb['sub'] = "suburb";
$abb['subj'] = "subject";
$abb['sup'] = "supply";
$abb['supt'] = "superintendent";
$abb['surg'] = "surgeon";
$abb['sym'] = "symbol";
$abb['t'] = "town";
$abb['tel'] = "telephone";
$abb['temp'] = "temperature";
$abb['terr'] = "territory";
$abb['theol'] = "theology";
$abb['topog'] = "topographical";
$abb['trans'] = "translator";
$abb['tr'] = "translator";
$abb['treas'] = "treasurer";
$abb['treas'] = "treasury";
$abb['twp'] = "township";
$abb['ult'] = "ultimate";
$abb['univ'] = "university";
$abb['usu'] = "usually";
$abb['var'] = "variant";
$abb['vet'] = "veteran";
$abb['vol'] = "volume";
$abb['vp'] = "vice president";
$abb['writ'] = "writer";
$abb['wt'] = "weight";
$abb['yd'] = "yard";
$abb['yr'] = "year";
?>
a source to share
A small domain can be enough to shorten the lists of acronyms and their extensions.
However, if you need a more complete solution, you should consider the following issues:
- The domain may grow and you may need to maintain an abbreviation / extension dictionary
- Ambiguous abbreviations that have more than one extension depending on the context, and it might not be trivial to automatically pick the right one.
- Ad hoc abbreviations , for example, programmers tend to abbreviate words that would not be abbreviated in a normal context.
For a growing domain problem, you can use any of the online abbreviations:
- http://www.abbreviations.com/
- http://ciir.cs.umass.edu/irdemo/acronym/index.html
- http://acronyms.silmaril.ie/cgi-bin/uncgi/acronyms
- http://www.all-acronyms.com/
- http://www.acronymfinder.com/
- http://www.acronymfinder.com/
For the problem of ambiguous abbreviations, you may need more information to select an extension. For example, the mentioned online dictionary Abbreviations.com contains information about the domain of the extension and "popularity". I used a programmable API combined with domain recognition algorithms to automate the extension process.
Ad hoc abbreviation extensions can usually be found around the acronym and can be matched against simple regular expressions.
Disclaimer: I was working on an issue of finding the correct acronym extension when trying to integrate two databases or XML files.
a source to share