Skip to content

AndreyBurikhin/pentaho-hadoop-shims

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pentaho-hadoop-shims

Hadoop Configurations, also known and shims and the Pentaho Big Data Adaptive layer, are collections of Hadoop libraries required to communicate with a specific version of Hadoop (and related tools: Hive, HBase, Sqoop, Pig, etc.). They are designed to be easily configured.

Pre-requisites for building the project:

  • Maven, version 3+
  • Java JDK 1.8

Building it

Build for nightly/release

$ mvn clean install -Drelease

This will build, unit test, and package the whole project (all of the sub-modules). Every submodule in pentaho-hadoop-shims is independent set of libraries and resources for one hadoop vendor.

Running the tests

Unit tests

This will run all tests in the project (and sub-modules).

$ mvn test

If you want to remote debug a single java unit test (default port is 5005):

$ cd server/core
$ mvn test -Dtest=PlaceTest -Dmaven.surefire.debug

Running tests on Windows

Running tests on Window requires additional environment set up because of existing problems running Hadoop on Windows (please see https://wiki.apache.org/hadoop/WindowsProblems).

Exactly it needs to have hadoop.home.dir variable pointed to dir with \bin\winutils.exe.

Steps to set up environment:

$ mvn test -Dhadoop.home.dir="d:\TEMP_DIR"

or

$ mvn clean install -Dhadoop.home.dir="d:\TEMP_DIR"

Integration tests In addition to the unit tests, there are integration tests in the core project.

$ mvn verify -DrunITs

To run a single integration test:

$ mvn verify -DrunITs -Dit.test=<<YourIT>>

To run a single integration test in debug mode (for remote debugging in an IDE) on the default port of 5005:

$ mvn verify -DrunITs -Dit.test=<<YourIT>> -Dmaven.failsafe.debug

PentahoMapReduceIT.java

IntelliJ

  • Don't use IntelliJ's built-in maven. Make it use the same one you use from the commandline.
    • Project Preferences -> Build, Execution, Deployment -> Build Tools -> Maven ==> Maven home directory

About

Hadoop Configurations

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Java 100.0%