Recently Azure announced Data Lake Gen 2 preview. As far as I know the main difference between Gen 1 and Gen 2 (in terms of functionality) is the Object Store and File System access over the same data at the same time. Other differences would be the price, available location etc. Can anyone explain what are the other key differences between Gen 1 and Gen 2?
Basically, think of gen2 as a superset of gen1 plus all of the best parts of blob storage: tiers, HDFS and object store API's and presumably the ability to efficiently handle the management of over 35K files and efficiently dealing with many small sizes and more trickle write type operations.. plus its cheaper.
I'm trying to get some clarity on a few specifics but not finding much in the meantime try these links:
https://azure.microsoft.com/en-us/blog/a-closer-look-at-azure-data-lake-storage-gen2/
https://docs.microsoft.com/en-us/azure/storage/data-lake-storage/introduction