-
What's PigML?
PigML is a machine learning pack leveraging Apache Pig.
Machine learning algorithms in PigML will typically consists a) Pig UDFs - the atomic and core part algorithm b) PigLatin scripts - connect and make data flow between UDFs, including data loading, storing, filtering, joining, grouping etc c) shell scripts - connect and make data flow between PigLatin scripts.
-
Why PigML?
When talking about big data and machine learning, there exists alternatives like Apache Mahout. Yet we believe Pig is a great utility doing this job in both development efficiency and runtime flexibility.
-
How to use it?
Description of algorithms and easy to go sample Pig scripts (PigLatin) will come along with the codes.
-
How to contribute?
[TODO]
hanborq/pigml
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
An Apache Pig based machine learning pack for bigdata
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published