GitHub - XiliangSong/relationfactory: End-to-end relation extraction and knowledge base population pipeline.

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
bin		bin
components		components
config		config
evaluation/bin		evaluation/bin
indexing		indexing
lib/ghc		lib/ghc
resources		resources
.gitignore		.gitignore
LICENSE		LICENSE
README		README

Repository files navigation

RelationFactory is a relation extraction and knowledge-base population system.
It was the top-ranked system in TAC KBP 2013 English Slot-filling (http://www.nist.gov/tac/2013/KBP/index.html).
If you want to use RelationFactory in a TAC benchmark, please contact the authors (see LICENSE for details).
RelationFactory uses SVMLight (http://svmlight.joachims.org/) for classification, so you must agree to the
License of SVMLight, especially to it being restricted to scientific use only.

QUICK START
===========

0. Prerequisites

Make sure the following software is installed:

ghc, version >= 7.4.1
cabal, version >= 1.14.0
java / JDK, version >= 6 (the Oracle one)
unix tools, including wget

1. Download models

If you want to use pre-trained models, download them from our server:

wget https://www.lsv.uni-saarland.de/fileadmin/data/relationfactory_models.tar.gz
tar xzf relationfactory_models.tar.gz

2. Set paths

E.g. by putting the following lines in your ~/.bashrc :

# relationfactory clone
export TAC_ROOT=/path/to/relationfactory
# pre-trained models
export TAC_MODELS=/path/to/relationfactory_models

The TAC_ROOT variable has to be set. The TAC_MODELS variable is optional.
If it is not set, the models have to be specified in the config file.

3. Compile system

$TAC_ROOT/bin/generate_system.sh

4. Index corpus

See the corresponding README in $TAC_ROOT/indexing

5. Configure run

The settings can be taken from $TAC_ROOT/config/system2013.config .
Make sure to adapt it to your models and index locations.
Also point to the TAC queries file for which you want to get results, and
specify a rundir where files for that run are put.

6. Run

$TAC_ROOT/bin/run.sh your_system.config

7. Check response

check the output file, /your/rundir/response_fast_pp13. It should contain
for each query some mixture of NIL answers and other answers, many of which
score by 1.0, others with lower score.

Evaluate your run using the official TAC scorer.
Note that due to refactoring, slightly different answers are returned than in TAC 2013.
The 'exact' evaluation, that is dependent on document id's and offsets to be included in the answer pool,
is very sensitive to that.
Use 'anydoc' evaluation mode to obtain more robust scores.

8. How to change the pipeline

Change $TAC_ROOT/bin/makefile and insert a rule describing your new target.

About

End-to-end relation extraction and knowledge base population pipeline.

Readme

View license

Activity

0 stars

2 watching

0 forks

Report repository

Releases

No releases published

Packages

No packages published

Languages

Java 92.9%
Shell 3.1%
Perl 1.8%
Makefile 1.1%
Other 1.1%

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bin

bin

components

components

config

config

evaluation/bin

evaluation/bin

indexing

indexing

lib/ghc

lib/ghc

resources

resources

.gitignore

.gitignore

LICENSE

LICENSE

README

README

Repository files navigation

About

Releases

Packages

Languages

License

XiliangSong/relationfactory

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Languages