Hadoop is an Apache open-source project that provides software for reliable and scalable distributed computing.
I just downloaded Hortonworks sandbox VM, inside it there are Hadoop with the version 2.7.1. I adding some files by using …
hadoop hdfs hortonworks-data-platformBased on the Hive doc below: Rename Table ALTER TABLE table_name RENAME TO new_table_name; This statement lets …
hadoop hive hiveqlIs there a Hive query to quickly find table size (i.e. number of rows) without launching a time-consuming MapReduce …
hadoop hiveI'm new to spark. Now I can run spark 0.9.1 on yarn (2.0.0-cdh4.2.1). But there is no log after execution. The …
hadoop logging apache-spark cloudera yarnI'm planning to use one of the hadoop file format for my hadoop related project. I understand parquet is efficient …
hadoop avro parquetIs there a hdfs command to see available free space in hdfs. We can see that through browser at master:…
hadoop hdfsAs far as I understand; sort by only sorts with in the reducer order by orders things globally but shoves …
hadoop hql hiveI have installed cloudera CDH 5 by using cloudera manager. I can easily do hadoop fs -ls /input/war-and-peace.txt hadoop …
hadoop apache-spark cloudera-cdhI have the following string representation of a timestamp in my Hive table: 20130502081559999 I need to convert it to a …
hadoop hive hiveqlI am trying to setup Hadoop version 0.20.203.0 in a pseudo distributed configuration using the following guide: http://www.javacodegeeks.com/2012/01/…
hadoop hdfs