Apache Pig

Pig is a dataflow programming environment for processing very large files. Pig's language is called Pig Latin. A Pig Latin program consists of a directed acyclic graph where each node represents an operation that transforms data. Operations are of two flavors: (1) relational-algebra style operations such as join, filter, project; (2) functional-programming style operators such as map, reduce.

Pig compiles these dataflow programs into (sequences of) map-reduce or Apache Tez jobs and executes them using Hadoop. It is also possible to execute Pig Latin programs in a "local" mode (without Hadoop cluster), in which case all processing takes place in a single local JVM.

General Info

For the latest information about Pig, please visit our website at:

http://pig.apache.org/

and our wiki, at:

http://wiki.apache.org/pig/

Getting Started

To learn about Pig, try http://wiki.apache.org/pig/PigTutorial
To build and run Pig, try http://wiki.apache.org/pig/BuildPig and http://wiki.apache.org/pig/RunPig
To check out the function library, try http://wiki.apache.org/pig/PiggyBank

Contributing to the Project

We welcome all contributions. For the details, please, visit http://wiki.apache.org/pig/HowToContribute.

Name		Name	Last commit message	Last commit date
Latest commit History 2,188 Commits
.eclipse.templates		.eclipse.templates
bin		bin
conf		conf
contrib		contrib
ivy		ivy
lib-src/bzip2/org/apache		lib-src/bzip2/org/apache
license		license
shims		shims
src		src
test		test
tutorial		tutorial
.gitignore		.gitignore
CHANGES.txt		CHANGES.txt
KEYS		KEYS
LICENSE		LICENSE
NOTICE.txt		NOTICE.txt
README.md		README.md
RELEASE_NOTES.txt		RELEASE_NOTES.txt
autocomplete		autocomplete
build.xml		build.xml
doap_Pig.rdf		doap_Pig.rdf
ivy.xml		ivy.xml

License

hrishikeshvganu/spork

Folders and files

Latest commit

History

Repository files navigation

Apache Pig

General Info

Getting Started

Contributing to the Project

About

Resources

License

Stars

Watchers

Forks

Languages