DIMAP provides the capability for creating and maintaining dictionaries for use in natural language processing and language technology applications. DIMAP contains basic functionality for managing multiple dictionaries, search and analysis of dictionary data (such as parsing definitions to create semantic networks, extracting subdictionaries to focus on specific lexical items, and mapping definitions between dictionaries), import and export of data (e.g., into XML), and automatic creation of dictionaries using integrated resources (such as WordNet). All parsing functionality uses the Proximity Parser, with integrated and dynamic auxiliary dictionary lookup, using either WordNet or licensed dictionaries (e.g., from Oxford University Press (The Oxford Dictionary of English)). Several publicly-available dictionaries are available, including alphabetic versions of WordNet, the UMLS Specialist Lexicon, and FrameNet.
DIMAP-4 is available at $350 for single users (ordering information), with a 45-day money back guarantee, and with guaranteed updates for at least one year. The demo version contains significant portions of the full functionality, excluding those functions requiring parsing or licensed dictionaries.
|Dictionary Management (open any number of dictionaries at a time, with quick overview of dictionary entries; rapid opening of recently used dictionaries; merge several dictionaries; and easy backup or renaming)|
|Entry Management (easy editing, deletion, and creation of new entries; quick overview of an entry's senses; create bilingual dictionaries entries in source language and dictionary information in English (enabling use of most of the DIMAP functionality)|
|Entry Maintenance (multiple senses; special fields for: part of speech (customizable), definition identifier, label number, usage label, definition, usage note, multiple superconcepts (hypernym, genus), multiple instances (hyponyms), multiple feature structures (attribute-value pairs), multiple roles (semantic relations) each with multiple links; customized editing for superconcepts, instances, feature structures, and roles)|
|Drop-down Feature Attributes and Value (user can predefine commonly used feature attributes and their associated values to be used as a drop-down list that can facilitate reuse of these items with single keystrokes)|
|Hypernym Disambiguation (superconcept or hypernym links can access the sense inventory of the auxiliary dictionary, e.g., WordNet, a licensed dictionary, or a user-developed dictionary, for easy lookup and selection of one or more appropriate senses)|
|Search of Definitions|
|Regular expression search (on all fields, with search results shown on screen or printed to a file, with format to your specifications)|
|Extract subdictionaries (using search mechanism to create a file of selected entries that can be uploaded into a new DIMAP dictionary) definition parsing (using the Proximity Parser)|
|Definition Parsing (Using the Proximity Parser)|
|Parse individual definitions or all definitions, in step or batch mode; start at any entry, with position remembered between sessions; automatically identify and/or add semantic relations discovered during parsing (including synonyms), with user-customizable regular expression patterns for recognition; diagnostic definition parsing aids (print to files such things as parse output, identified semantic relations, bad parses, definitions with no identified semantic relations, comparison to WordNet hierarchy, and unknown words)|
|Compare and map definitions (across dictionaries, useful for mapping among a main dictionary and independently developed derived dictionaries; all or individual entries, with or without stop list; word overlap using best fit and componential analysis using score based on matches between hypernyms and other semantic relations, using WordNet synsets to allow "fuzzy" matches; visual display of edit distance difference between definitions)|
|Analysis of dictionary digraph (based on hypernym links to identify primitive senses among the definitions, for whole or partial dictionaries such as thesaurus groupings; summarizes hypernym links among entries; identifies non-primitive (derived) entries and senses; identifies primitive defining vocabulary; identifies definitional cycles; particularly useful when thesaurus entries are linked to definitions|
|Conversion (Import and Export)|
|Uploading dictionary data from other sources (requires specific format)|
|Downloading dictionary data for use elsewhere according to your own format (template editor to facilitate format specification, including addition of your own strings (such as SGML, HTML, or XML codes)|
|Create dictionaries based on analyzing your own texts (lists or continuous, including Latin-1 languages): all words (with automatic tokenizer), capitalized phrases (with join words, approximating named-entity acquisition), longest contiguous non-interrupted phrase without a stop word or punctuation (with dictionary lookup, approximating compound noun acquisition)||No lookup|
|Batch creation using integrated licensed dictionary or WordNet||Requires licensed dictionary|
|Integrated Dictionary Lookup|
|WordNet, with all information converted into DIMAP format, thus allowing a word-based use of WordNet, rather than a synset-based use|
|Licensed dictionary (or use WordNet as auxiliary dictionary)||Requires DIMAP
CL Research is seeking beta-testers and others interested in assisting the furtherance of lexicological and text-analysis research objectives for its Windows-based DIMAP dictionary and content analysis software. To obtain DIMAP, you must first complete a license. DIMAP is available without cost for beta-testing and research purposes to academic organisations and by arrangement for beta-testing, evaluation, and research purposes to commercial organisations. For organisations wishing to use DIMAP-3 for commercial purposes, commercial licensing arrangements may be made: contact email@example.com for further details. Source code for both PC and Unix systems is also available for the Proximity parser to interested parties.
The license agreements are standard agreements (to protect CL Research's intellectual property rights) for software evaluation and beta-testing, with only a few differences from boilerplate language. The differences are focused in sections 3.2, 5.2, and 6.2 and are designed to give licensees flexibility. The commercial license agreement is particularly designed for individuals who wish to develop ideas for making use of lexicons in their own applications. The academic agreement is for 12 months (allowing students to use a copy on their own computers) and the commercial agreement for 3 months.
Academic and commercial license agreements are available without cost. Those signing a license agreement will have access to the latest versions of DIMAP as it evolves. CL Research is especially interested in making arrangements with researchers involved in lexicon design and content analysis. (The academic license agreement allows students to use DIMAP on their own personal computers.) Since CL Research maintains the "Lexical Resources" page for Association for Computational Linguistics Special Interest Group on the Lexicon, we are particularly interested in developing dictionaries (lexicons) that can be made freely and publicly available (and especially those which follow procedures for category development, as described in papers available at the Web site). CL Research will also accept beta-test and evaluation license agreements with those interested in using DIMAP for commercial purposes.
Please select the license appropriate for your organisation:
print it, and mail TWO COPIES of the signed, completed license to
9208 Gue Road
Damascus, MD 20879-1025 USA
To facilitate your access, please include your email address, a user name as you wish to type it in, and a 6-character or more password (letters and numbers). Access can then be activated upon receipt instead of waiting for assignment at CL Research and return by regular mail.
Maintained by Ken Litkowski
© 2001-8 CL Research