GATE 2: 18-month report
This is the second in a series of progress reports for the GATE 2 project
(EPSRC grant GR/M31699, running from July 1999 to summer 2002).
The previous report is here.
Our starting points are the original
GATE project (GR/K25267 - available at http://www.dcs.shef.ac.uk/nlp/gate/),
and the proposal document for the new project.
Our work over the last 9 months, from April through December
2000 has been to implement the second version of the GATE system
in order to satisfy as many as possible of the requirements identified
in the first phase of the project.
Use cases
We have identified a number
requirements
for Software
Architecture for Language Engineering (SALE); the version of the system
at half-way through the project meets (partially or completely) the
following subset:
- Processing resources
- Locating, loading and initialising components from local and non-local
machines. [Fully met]
- Executing processing components, serially or in parallel.
[Partially met]
- Representing information about components. [Fully met]
- Factoring out commonalities amongst components.
- Language Resources, corpora and annotation
- Accessing data components.
- Managing collections of documents (including recordings) and their
formats. [Partially met]
- Representing information about text and speech. [Fully met]
- Representing information about language.
- Indexing and retrieving information. [Partially met]
- Methods and applications
- Method support
- FST, unification and statistics over information. [Partially
met]
- Comparing different versions of information. [Partially
met]
- Application issues
- Storing information. [Partially met]
- Deployment and embedding. [Fully met]
- Development issues
- Interoperation with other infrastructures. [Partially
met]
- Viewing and editing data components and information.
[Partially met]
- UI access to architectural facilities (development environments).
[Partially met]