Top "Partitioning" questions

Partitioning is a performance strategy whereby you divide possibly very large groups of data into some number of smaller groups of data.

Algorithm for finding nearby points?

Given a set of several million points with x,y coordinates, what is the algorithm of choice for quickly finding …

algorithm gis partitioning distance linear-algebra
Spark lists all leaf node even in partitioned data

I have parquet data partitioned by date & hour, folder structure: events_v3 -- event_date=2015-01-01 -- event_…

apache-spark amazon-s3 apache-spark-sql partitioning parquet
Changing a partition with fdisk shows a warning like "partition#x contains ext4-signature"

I'm shrinking a partion size with #Reduce Partition Size fsck -f /dev/sdb2 resize2fs /dev/sdb2 -M -p #Limit …

filesystems partitioning partition ext4
How does range partitioner work in Spark?

I'm not so clear about how range partitioner works in Spark. It uses (Reservoir Sampling) to take samples. And I …

apache-spark partitioning
How to partition MySQL table by column that is not in the unique index

Let's have a simple table of products. Each produch has its unique ID and category. Users often search by category …

mysql partitioning unique-index
Is it possible to partially refresh a materialized view in Oracle?

I have a very complex Oracle view based on other materialized views, regular views as well as some tables (I …

oracle data-warehouse partitioning materialized-views
SQL Error: ORA-14006: invalid partition name

I am trying to partition an existing table in Oracle 12C R1 using below SQL statement. ALTER TABLE TABLE_NAME …

oracle partitioning partition database-partitioning
Clustering, Sharding or simple Partition / Replication

I need some advice from you experts on this subject. The thing is that we have created a Facebook application …

mysql database-design partitioning cluster-computing sharding
Dynamic MySQL partitioning based on UnixTime

My DB design includes multiple MYISAM tables with measurements collected online, Each row record contains auto-incremented id, some data and …

mysql dynamic partitioning myisam unix-timestamp
Partitioning in spark while reading from RDBMS via JDBC

I am running spark in cluster mode and reading data from RDBMS via JDBC. As per Spark docs, these partitioning …

apache-spark jdbc apache-spark-sql partitioning