About | Citing |Questions |Download |Included Tools |Extensions |Release history |Sample output |Online |FAQ
- Mate Universal Tab Translator
- Mate: Universal Tab Translator 6 0 36
- Mate: Universal Tab Translator 6 0 3 0
About
We may not be able to travel to foreign countries right now, but we can bring foreign countries to us. Start learning a new language today with Babbel: https. Translate real-time conversations, menus and street signs while offline, websites, documents, and more using the Translator apps For business use Globalize your business and customer interactions by translating text and speech using the Translator API and Speech service, both in the Azure Cognitive Services family.
Sharon Osbourne Says She & Ozzy Still Have Sex 'Couple of Times a Week': 'It Used to Be 3 Times a Day' People Social media companies go into fire-alarm mode over Trump diagnosis POLITICO. Quickly convert Universal Time (UTC) to time in Brasilia, Brazil with this easy-to-use, modern time zone converter.
A natural language parser is a program that works out the grammaticalstructure of sentences, for instance, which groups of words go together(as 'phrases') and which words are the subject or object of averb. Probabilistic parsers use knowledge of language gained fromhand-parsed sentences to try to produce the most likely analysis of newsentences. These statistical parsers still make some mistakes, butcommonly work rather well. Their development was one of the biggest breakthroughs innatural language processing in the 1990s. You can try out our parseronline.
Package contents
This package is a Java implementation of probabilistic natural languageparsers, both highly optimized PCFG and lexicalized dependency parsers, and alexicalized PCFG parser. The original version of this parser was mainly written by Dan Klein,with support code and linguistic grammar development by Christopher Manning. Extensive additional work (internationalization and language-specificmodeling, flexible input/output, grammar compaction, lattice parsing,k-best parsing,typed dependencies output,user support, etc.) has been done by Roger Levy, Christopher Manning,Teg Grenager, Galen Andrew, Marie-Catherine de Marneffe, BillMacCartney, Anna Rafferty, Spence Green, Huihsin Tseng, Pi-Chuan Chang, WolfgangMaier, and Jenny Finkel.
The lexicalized probabilistic parser implements a factored product model, with separate PCFG phrase structure and lexical dependency experts, whose preferences are combined by efficient exact inference, using an A* algorithm.Or the software can be used simply as an accurate unlexicalized stochasticcontext-free grammar parser.Either of these yields a good performance statistical parsing system.A GUI is provided for viewing the phrase structure tree output of the parser.
As well as providing an English parser, the parser can beand has been adapted to work with other languages.A Chinese parser based on the Chinese Treebank, a Germanparser based on the Negra corpus and Arabic parsers based on the Penn Arabic Treebank are also included.The parser has also been used for other languages, such as Italian,Bulgarian, and Portuguese.
The parser provides Universal Dependencies (v1) and Stanford Dependencies output as well as phrase structure trees. Typed dependencies areotherwise known grammatical relations. This style of output is available only for English and Chinese.For more details, please refer to the Stanford Dependencies webpage and the Universal Dependencies v1 documentation. (See also the current Universal Dependencies documentation, but we are yet to update to it.).
Shift-reduce constituency parser
As of version 3.4 in 2014, the parser includes the code necessary to run a shift reduce parser, a much faster constituent parser with competitive accuracy. Models for this parser are linked below.
Neural-network dependency parser
In version 3.5.0 (October 2014) we released a high-performance dependency parser powered by a neural network. The parser outputs typed dependency parses for English and Chinese. The models for this parser are included in the general Stanford Parser models package.
Dependency scoring
The package includes a tool for scoring of generic dependency parses, in a class edu.stanford.nlp.trees.DependencyScoring. This tool measures scores for dependency trees, doing F1 and labeled attachment scoring. The included usage message gives a detailed description of how to use the tool.
Usage notes
The current version of the parser requires Java 8 or later.(You can also download an old version of the parser, version 1.4,which runs under JDK 1.4, version 2.0 which runs under JDK 1.5, version 3.4.1which runs under JDK 1.6, but those distributions are no longer supported.)The parser also requires a reasonable amount of memory (at least 100MB to run as a PCFG parser on sentences up to 40 words in length; typically around 500MB of memory to be able to parse similarly long typical-of-newswire sentences using the factored model).
The parser is available for download,licensed under the GNUGeneral Public License (v2 or later). Source is included. The packageincludes components for command-line invocation, a Java parsingGUI, and a Java API.
The download is a 261 MB zipped file (mainly consisting of included grammar data files). If you unpack the zip file, you should have everything needed. Simple scripts are included to invoke the parser on a Unix or Windows system. For another system, you merely need to similarly configure the classpath.
Licensing
The parser code is dual licensed (in a similar manner to MySQL, etc.). Open source licensing is under the full GPL,which allows many free uses.For distributors of proprietarysoftware, commercial licensing is available.(Fine print: The traditional (dynamic programmed) Stanford Parser does part-of-speech tagging as it works, but the newer constituency and neural network dependency shift-reduce parsers require pre-tagged input. For convenience, we include the part-of-speech tagger code, but not models with the parser download. However, if you want to use these parsers under a commercial license, then you need a license to both the Stanford Parser and the Stanford POS tagger. Or you can get the whole bundle of Stanford CoreNLP.)If you don't need a commercial license, but would like to supportmaintenance of these tools, we welcome gift funding: use this form and write 'Stanford NLP Group open source software' in the Special Instructions.
Citing the Stanford Parser
The main technical ideas behind how these parsers work appear in thesepapers. Feel free to cite one or more of the following papers or people depending on what youare using. Since the parser is regularly updated, we appreciate it ifpapers with numerical results reflecting parser performance mention theversion of the parser being used!
For the neural-network dependency parser:
Danqi Chen and Christopher D Manning. 2014. A Fast and Accurate Dependency Parser using Neural Networks. Proceedings of EMNLP 2014
For the Compositional Vector Grammar parser (starting at version 3.2):
Richard Socher, John Bauer, Christopher D. Manning and Andrew Y. Ng. 2013.Parsing With Compositional Vector Grammars. Proceedings of ACL 2013
For the Shift-Reduce Constituency parser (starting at version 3.2):
This parser was written by John Bauer. You can thank him and cite the web page describing it: https://nlp.stanford.edu/software/srparser.html. You can also cite the original research papers of others mentioned on that page.
For the PCFG parser (which also does POS tagging):
Dan Klein and Christopher D. Manning. 2003. Accurate Unlexicalized Parsing. Proceedings of the 41st Meeting of the Association for Computational Linguistics, pp. 423-430.
For the factored parser (which also does POS tagging):
Dan Klein and Christopher D. Manning. 2003. Fast Exact Inference with a Factored Model for Natural Language Parsing. In Advancesin Neural Information Processing Systems 15 (NIPS 2002), Cambridge, MA: MIT Press, pp. 3-10.
For the Universal Dependencies representation:
Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajič,Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira,Reut Tsarfaty, and Daniel Zeman. 2016. Universal Dependencies v1: A Multilingual Treebank Collection. In LREC 2016.
For the English Universal Dependencies converter and the enhanced English Universal Dependencies representation:
Sebastian Schuster and Christopher D. Manning. 2016. Enhanced English Universal Dependencies: An Improved Representation for Natural Language Understanding Tasks.In LREC 2016.
For the (English) Stanford Dependencies representation:
Marie-Catherine de Marneffe, Bill MacCartney and Christopher D. Manning. 2006. GeneratingTyped Dependency Parses from Phrase Structure Parses. In LREC 2006.
For the German parser:
Anna Rafferty and Christopher D. Manning. 2008.Parsing Three German Treebanks: Lexicalized and Unlexicalized Baselines.In ACL Workshop on Parsing German.
For the Chinese Parser:
Roger Levy and Christopher D. Manning.2003.Is it harder to parse Chinese, or the Chinese Treebank?ACL 2003, pp. 439-446.
For the Chinese Stanford Dependencies:
Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, and Christopher D. Manning.2009.Discriminative Reordering with Chinese Grammatical Relations Features.In Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation.
For the Arabic parser:
Spence Green and Christopher D. Manning.2010.Better Arabic Parsing: Baselines, Evaluations, and Analysis.In COLING 2010.
For the French parser:
Spence Green, Marie-Catherine de Marneffe, John Bauer, and Christopher D. Manning.2010.Multiword Expression Identification with Tree Substitution Grammars: A Parsing tour de force with French..In EMNLP 2011.
For the Spanish parser:
Most of the work on Spanish was by Jon Gauthier. There is no published paper, but you can thank him and/or citethis webpage:https://nlp.stanford.edu/software/spanish-faq.html
Questions about the parser?
- If you're new to parsing, you can start by running the GUI to tryout the parser. Scripts are included for linux (lexparser-gui.sh) andWindows (lexparser-gui.bat).
- Take a look at the Javadoc
lexparser
packagedocumentation andLexicalizedParser
class documentation.(Point your web browser at theindex.html
file in the includedjavadoc
directory and navigate to those items.) - Look at the parser FAQ for answers to common questions.
- If none of that helps, please see our emailguidelines for instructions on how to reach us for further assistance.
Download
Download Stanford Parser version 4.0.0The standard download includes models for Arabic, Chinese, English, French, German, and Spanish. Thereare additional models we do not release with the standalone parser, including shift-reduce models, thatcan be found in the models jars for each language. Below are links to those jars.
Arabic Models Chinese Models English Models French Models German Models Spanish Models
Extensions: Packages by others using the parser
Java
- tydevi Typed DependencyViewer that makes a picture of the Stanford Dependencies analysis of a sentence. By Bernard Bou.
- DependenSee A Dependency Parse Visualisation Tool that makespictures of Stanford Dependency output. By Awais Athar. (GitHub)
- GATEplug-in. By the GATE Team (esp. Adam Funk).
- GrammarScopegrammatical relation browser. GUI, especially focusing on grammatical relations (typed dependencies), including an editor. ByBernard Bou.
PHP
- PHP-Stanford-NLP. Supports POS Tagger, NER, Parser. By Anthony Gentile (agentile).
Python/Jython
- Pythoninterface built using JPype by Stefanie Tellex.
- Jython interface.by Viktor Pekar.
Ruby
- Ruby wrapper to theStanford Natural Language Parser. By Bill McNeill. An extended andbetter packaged version of this by John Wilkinson is available at github.
.NET / F# / C#
- Sergey Tihon has ported the Stanford Parser to F# (or any .NET language, including C#), using IKVM. See his blog post, his Github site, or the listing on NuGet.
OS X
- If you use Homebrew, you can install the Stanford Parser with:brew install stanford-parser
Release history
Version 4.0.0 | 2020-05-22 | Model tokenization updated to UDv2.0 | |
Version 3.9.2 | 2018-10-17 | Updated for compatibility | |
Version 3.9.1 | 2018-02-27 | new French and Spanish UD models, misc. UD enhancements, bug fixes | |
Version 3.8.0 | 2017-06-09 | Updated for compatibility | |
Version 3.7.0 | 2016-10-31 | new UD models | |
Version 3.6.0 | 2015-12-09 | Updated for compatibility | |
Version 3.5.2 | 2015-04-20 | Switch to universal dependencies | shift reduce parser models |
Version 3.5.1 | 2015-01-29 | Dependency parser fixes and model improvements | shift reduce parser models |
Version 3.5.0 | 2014-10-31 | Upgrade to Java 8; add neural-network dependency parser | shift reduce parser models |
Version 3.4.1 | 2014-08-27 | Add Spanish models | shift reduce parser models |
Version 3.4 | 2014-06-16 | Shift-reduce parser, dependency improvements, French parser uses CC tagset | shift reduce parser models |
Version 3.3.1 | 2014-01-04 | English dependency 'infmod' and 'partmod' combined into 'vmod', other minor dependency improvements | |
Version 3.3.0 | 2013-11-12 | English dependency 'attr' removed, other dependency improvements, imperative training data added | |
Version 3.2.0 | 2013-06-20 | New CVG based English model with higher accuracy | |
Version 2.0.5 | 2013-04-05 | Dependency improvements, -nthreads option, ctb7 model | |
Version 2.0.4 | 2012-11-12 | Improved dependency code extraction efficiency, other dependency changes | |
Version 2.0.3 | 2012-07-09 | Minor bug fixes | |
Version 2.0.2 | 2012-05-22 | Some models now support training with extra tagged, non-tree data | |
Version 2.0.1 | 2012-03-09 | Caseless English model included, bugfix for enforced tags | |
Version 2.0 | 2012-02-03 | Threadsafe! | |
Version 1.6.9 | 2011-09-14 | Improved recognition of imperatives, dependencies now explicitely include a root, parser knows osprey is a noun | |
Version 1.6.8 | 2011-06-19 | New French model, improved foreign language models, bug fixes | |
Version 1.6.7 | 2011-05-18 | Minor bug fixes. | |
Version 1.6.6 | 2011-04-20 | Internal code and API changes (ArrayLists rather than Sentence; use of CoreLabel objects) to match tagger and CoreNLP. | |
Version 1.6.5 | 2010-11-30 | Further improvements to English Stanford Dependencies and other minor changes | |
Version 1.6.4 | 2010-08-20 | More minor bug fixes and improvements to English Stanford Dependencies and question parsing | |
Version 1.6.3 | 2010-07-09 | Improvements to English Stanford Dependencies and question parsing, minor bug fixes | |
Version 1.6.2 | 2010-02-26 | Improvements to Arabic parser models, and to English and Chinese Stanford Dependencies | |
Version 1.6.1 | 2008-10-26 | Slightly improved Arabic and German parsing, and Stanford Dependencies | |
Version 1.6 | 2007-08-19 | Added Arabic, k-best PCCFG parsing; improved English grammatical relations | |
Version 1.5.1 | 2006-06-11 | Improved English and Chinese grammatical relations; fixed UTF-8 handling | |
Version 1.5 | 2005-07-21 | Added grammatical relations output; fixed bugs introduced in 1.4 | |
Version 1.4 | 2004-03-24 | Made PCFG faster again (by FSA minimization); added German support | |
Version 1.3 | 2003-09-06 | Made parser over twice as fast; added tokenization options | |
Version 1.2 | 2003-07-20 | Halved PCFG memory usage; added support for Chinese | |
Version 1.1 | 2003-03-25 | Improved parsing speed; included GUI, improved PCFG grammar | |
Version 1.0 | 2002-12-05 | Initial release |
Sample input and output
The parser can read various forms of plain text input and can outputvarious analysis formats, including part-of-speech tagged text, phrasestructure trees, and a grammatical relations (typed dependency) format.For example, consider the text:
The strongest rain ever recorded in India shut down the financialhub of Mumbai, snapped communication lines, closed airports and forcedthousands of people to sleep in their offices or walk home during thenight, officials said today.
Mate Universal Tab Translator
The following output showspart-of-speech tagged text, then a context-free phrase structure grammarrepresentation, and finally a typed dependency representation. All ofthese are different views of the output of the parser.
Mate: Universal Tab Translator 6 0 36
This output was generated with the command:
Mate: Universal Tab Translator 6 0 3 0
java -mx200m edu.stanford.nlp.parser.lexparser.LexicalizedParser-retainTMPSubcategories -outputFormat'wordsAndTags,penn,typedDependencies' englishPCFG.ser.gz mumbai.txt