Popular "hdfs" questions | Page 11

We've realized a bit too late that archiving our files in GZip format for Hadoop processing isn't such a great …

hadoop gzip hdfs bzip2

I tried to define what the high throughput vs low latency means in HDFS in my own words, and came …

hadoop hdfs low-latency throughput

I installed the Cloudera CDH4 distribution on a single machine in pseudo-distributed mode and successfully tested that it was working …

hadoop hdfs cloudera

I have data that's already grouped and aggregated, it looks like so: user value count ---- -------- ------ Alice third 5 …

hadoop hdfs apache-pig

I have a big distributed file on HDFS and each time I use sqlContext with spark-csv package, it first loads …

apache-spark pyspark hdfs apache-spark-sql spark-csv

How can I view how many blocks has a file been broken into, in a Hadoop file system?

hadoop hdfs

i have a problem in setting hadoop file permissions in hortonworks and cloudera. My requirement is: 1. create a new user …

hadoop permissions hdfs cloudera hortonworks-data-platform

Is there a way to delete files older than 10 days on HDFS? In Linux I would use: find /path/to/…

hadoop hdfs

Do we need to verify checksum after we move files to Hadoop (HDFS) from a Linux server through a Webhdfs ? …

hadoop hdfs checksum

I am a newbie in Hadoop trying to install Hbase in pseudo distributed mode, version hbase-0.98.10.1-hadoop1-bin, with Hadoop 2.5.2 . …

hadoop hbase hdfs

Top "Hdfs" questions