Implementation of confabulation theory at the symbolic level and experiments

Implementation of different sentence completion architectures using confabulation , module hierarchies and multiconfabulation using a symbol-level model of modules and knowledge links.

This implementation is initially the product of a master thesis by Bernard Paulus and Cédric Snauwaert, relased under the GPL - see LICENSE.txt .

Each of the directories here correspond to an eclipse project. See Setup, build and run with Eclipse for an overview on how tho run them.

Overview of the different programs

Here are the programs that might be interesting:

.
|-- java_corpus_preprocessor
|   `-- src
|       `-- Main.java
`-- java_sentence_completion
    `-- src
        |-- colt
        |       `-- SparseMatricesBenchmarks.java
        `-- confabulation
            |-- Main.java
            `-- tests
                `-- BatchCompletionTest.java

In java_corpus_preprocessor, Main.java pre-processes a corpus UTF8 text file into a form suitable for the sentence completion program. It opens a GUI to request the location of the file to pre-process.

In java_sentence_completion,

Main.java is the main sentence completion program. It is REPL that completes the sentences that are inputted in the command line.
SparseMatricesBenchmarks.java is the file where we carried out our test to check whether it was appropriate to walk away of parallelcolt. All that files in that packages are the only ones left that need parallelcolt.
BatchCompletionTest.java is the program that runs multiple completions at once. It was the one used to generate the example in the chapter 5 of our master thesis.

Setup, build and run with Eclipse

Here are the instructions to set up and run the sentence completion project with Eclipse

Prerequisites

You need

At least 1.5 GB of RAM for the intermediary sized corpus (mille et une nuits). Architectures with less RAM are can still run, but for smaller corpus's.
An installation of Eclipse IDE at least Indigo, with JUnit4, downloadable as a single program here (download the "classic" version)
Some preprocessed corpus files. We uploaded some here.

Setup

Launch eclipse and start a new project
Create the project and give java_sentence_completion as the project location
Click next and open the libraries tab
Add java_sentence_completion/src/parallelcolt-0.9.4.jar to the set of external libraries. This is required to compile and run the matrix benchmarks.
Add JUnit4 to the project libraries
Click finish.

Setup: done!

Build and run

This assumes you have your corpus preprocessed / unzipped from the above archive.

Open the Main.java file, and click on the build and run button The program will open a dialog to choose the preprocessed corpus file.
Select your preprocessed corpus file. Beware: if you plan to use an intermediary-sized corpus, like the full text of "les contes des mille et une nuits", apply first the next step first.
If you run the project with a corpus that necessitates too much memory, it crashes and prints the following message
```
Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
    ...
```
To solve this problem, we will rise the limit on memory usage of the Java virtual machine.
1. go to the properties of Main.java
2. open the run/debug settings, and click "edit"
3. Augment the maximal memory usage allowed for the JVM by inserting
```
-Xmx1000m
```
  in the VM field of the Argument tab This allows the JVM to use up to 1000 MB of memory. Launch Main.java again.
You are done!

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
java_sentence_completion		java_sentence_completion
LICENSE.txt		LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

java_sentence_completion

java_sentence_completion

LICENSE.txt

LICENSE.txt

README.md

README.md

Repository files navigation

Implementation of confabulation theory at the symbolic level and experiments

Overview of the different programs

Setup, build and run with Eclipse

Prerequisites

Setup

Build and run

About

Releases

Packages

Languages

License

confabulation/symbolic

Folders and files

Latest commit

History

Repository files navigation

Implementation of confabulation theory at the symbolic level and experiments

Overview of the different programs

Setup, build and run with Eclipse

Prerequisites

Setup

Build and run

About

Resources

License

Stars

Watchers

Forks

Languages