HDFS UIOpen HDFS UI by browser, the URL is {hadoop_master_ip:50070}.
This article aims at explaining everything in the HDFS Overview UI in detail, oth
...
BackgroundIt never becomes easy to setup/install/upgrade environments. This article aims at describing how to use/create debian packages to help setup
...
PurposeThe operation of Hadoop cluster is not easy. There are many parameters and there is no perfect configuration that fit all situations. Monitorin
...
Wiki descriptionAccording to the wiki: CAP theorem explains that it is impossible for a distributed data store to simultaneously provide more than two
...
BackgroundThis article aims at discussing how hive generic User-defined function(GenericUDF) works. In the java doc, it says GenericUDF can do short-c
...
This article explains how to create git hooks in mac, and how to use the customized git hook chains.
Preparation for creating hook chain(This part ref
...
Whether a job runs in local mode or distributed mode is decided by mapreduce.framework.name. In local mode, the mapper and reducer will run locally in
...
This article aims at introducing what are the manually configured settings that override the default during using hive.
environmentThis article is bas
...
This article aims at explaining hive scratch directory.
Scratch directory usageHive scratch directory is a temporary working space for storing the pla
...
backgroundWhen running hive jobs in hadoop clusters on mapreduce, we always set the limitation of how much local and hdfs disk a job can use at most.
...