COMPAS (COMpiler for PArsing Schemata)
In this page, you can download the source code and binary files for the COMPAS parsing schema compiler.
About the COMPAS system
COMPAS (COMpiler for PArsing Schemata) is a system that can be used to
automatically compile formal specifications of parsing algorithms (in
the form of parsing schemata) to efficient Java implementations of the corresponding parsers.
The system was registered with the Xunta de Galicia Intellectual Property Office, with registration number C-40-2009.
The system's design and implementation is described in this journal article:
Carlos Gómez-Rodríguez, Jesús Vilares and Miguel A. Alonso,
A compiler for parsing schemata,
Software: Practice and Experience, 39(5):441-470, 2009. ISSN 0038-0644. DOI 10.1002/spe.904
[GomVilAlo2009a.pdf]
Some design aspects are described in less detail in the following publications:
Carlos Gómez-Rodríguez, Jesús Vilares and Miguel A. Alonso,
Compiling Declarative Specifications of Parsing Algorithms,
in R. Wagner, R. Newell and G. Pernul
(eds.), Database and Expert Systems Applications, volume 4653 of
Lecture Notes in Computer Science, pp. 529-538, Springer-Verlag,
Berlin-Heidelberg-New York, 2007. ISSN 0302-9743.
[GomVilAlo2007a.pdf]
Carlos Gómez-Rodríguez, Miguel A. Alonso and Manuel Vilares,
Generation of indexes for compiling efficient parsers from formal specifications,
in Roberto Moreno-Díaz, Franz
Pichler, and Alexis Quesada-Arencibia (eds.), Computer Aided Systems
Theory, volume of Lecture Notes in Computer Science, Springer-Verlag,
Berlin-Heidelberg-New York, 2007. ISSN 0302-9743.
[GomAloVil2007b.pdf]
Carlos Gómez-Rodríguez, Jesús Vilares and Miguel A. Alonso,
Prototyping Efficient Natural Language Parsers,
in Proc. of International Conference RANLP 2007, Recent Advances in Natural Language Processing, Borovets, Bulgaria, 2007.
[GomVilAlo2007b.pdf]
Carlos Gómez-Rodríguez, Jesús Vilares and Miguel A. Alonso,
Automatic Generation of Natural Language Parsers from Declarative Specifications,
in Loris Penserini, Pavlos Peppas and
Anna Perini (eds.), STAIRS 2006 - Proceedings of the Third Starting AI
Researchers' Symposium, Riva del Garda, Italy, August 28-29, 2006,
volume 142 of Frontiers in Artificial Intelligence and Applications,
pp. 259-260, IOS Press, Amsterdam/Berlin/Oxford/Tokyo/Washington DC,
2006. ISSN 0922-6389 / ISBN 1-58603-645-9.
[GomVilAlo2006a.pdf]
Additionally, the system has been used to perform the empirical tests reported in the following publications:
Carlos Gómez-Rodríguez, Miguel A. Alonso and Manuel Vilares,
On Theoretical and Practical Complexity of TAG Parsers,
in Paola Monachesi, Gerald Penn,
Giorgio Satta and Shuly Wintner (eds.), FG 2006: The 11th conference on
Formal Grammar. Malaga, Spain, July 29-30, 2006, chapter 5, pp. 61-75,
Center for the Study of Language and Information, Stanford, 2006.
[GomAloVil2006b.pdf]
Carlos Gómez-Rodríguez, Miguel A. Alonso and Manuel Vilares,
Generating XTAG Parsers from Algebraic Specifications,
in Proceedings of the 8th
International Workshop on Tree Adjoining Grammar and Related
Formalisms. Sydney, July 2006, pp. 103-108, Association for
Computational Linguistics, East Stroudsburg, PA, 2006. ISBN:
1-932432-85-X.
[GomAloVil2006a.pdf]
Both a description of the system and empirical tests conducted with it are reported in the following Ph. D. thesis:
Carlos Gómez-Rodríguez,
Parsing schemata for practical text analysis,
Ph.D. Thesis, Departamento de Computación, Universidade da Coruńa, Spain, 2009 (xviii + 290 pp).
How to use
The COMPAS system is written in Java, and therefore it is
runnable in any system for which a Java Virtual Machine (JVM) is
available, including Windows and Linux.
You need to have the following software installed in order to use the system::
- A Java Runtime Environment (JRE), version 1.4 or higher. Download it here.
- The Apache Ant Build System
is not strictly necessary, but highly recommended in order to be able
to easily compile the code generated by the system.
Once you have this software, download the system and refer to "readme.txt" for detailed usage instructions. If you understand Spanish, you can also download an user manual in this language.
Download (source code + binaries)
The COMPAS system can be downloaded here. The archive contains both the binaries and complete source code.
A 20-page user manual in Spanish is also available, download it here.
Licensing terms
COMPAS is free software: you can redistribute it and/or modify it under
the terms of the GNU General Public License (version 3) as published by
the Free Software Foundation.
COMPAS is distributed in the hope that it will be useful, but WITHOUT
ANY WARRANTY; without even the implied warranty MERCHANTABILITY or
FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public
License for more details.
The text of the GNU General Public License can be found inside the distribution or at http://www.gnu.org/licenses/
Contact
The system is maintained by Carlos Gómez Rodríguez. Please direct any questions or comments by email to <cgomezr@udc.es>.
Last update: 2009-06-14.