jurgenprins/WebXtractor-Appengine
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
WebXtractor-Appengine is a java implementation based on the concepts in WebXtractor-PHP library. WebXtractor-Appengine uses: 1) Google Appengine * JDO persistence * TaskQueues 2) Spring framework (integration) * Servlet Dispatcher * Bean injection 3) Google Web Toolkit to implement a demo of the WebXtraction library, that allows one to extract normalized web items (links or images) from any url, and have the robot also automatically follow subsequent navigation links. A demo is deployed to http://webxtractor.appspot.com/
About
Extracting lists of relevant links or images from html, including auto follow navigation
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published