Skip to content

Neuw84/wikipediaminer

 
 

Repository files navigation

wikipediaminer

An open source toolkit for mining Wikipedia forked from: https://github.com/dnmilne/wikipediaminer

Contain some improvements in the WebServices and a lot of bugfixes to Milne's sources.

Documentation at : https://github.com/dnmilne/wikipediaminer/wiki

TODO:

Add support for live snapshots of wikipedia (DBPedia approach) to stay updated
Implement other disambigation approaches like http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=6354382
Support for binary data on the webServices, (Thrift for example) to avoid problems with UTF8 characters.

Add this repository to your POM.xml.

 <repository>
        <id>galan-maven-repo</id>
        <name>galan-maven-repo-releases</name>
        <url>http://galan.ehu.es/artifactory/ext-release-local</url>
    </repository>

Then...

 <repository>
        <id>galan-maven-repo</id>
        <name>galan-maven-repo-releases</name>
        <url>http://galan.ehu.es/artifactory/ext-release-local</url>
 </repository>

Then add the required subproyect, for example...

<dependency>
    <groupId>org.wikipedia-miner</groupId>
            <artifactId>wikipedia-miner-core</artifactId>
            <version>1.2.4</version>
</dependency>



About

An open source toolkit for mining Wikipedia

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 92.0%
  • JavaScript 3.9%
  • HTML 2.9%
  • CSS 1.2%