Top 1K Features Creators Events Podcasts Books Extensions Interviews Blog Explorer CSV

parquet

< >

parquet is a binary data format created in 2014 by Doug Cutting and Julien Le Dem.

#1994on PLDB 10Years Old
HomepageWikipediaTwitter

Apache Parquet is a free and open-source column-oriented data storage format in the Apache Hadoop ecosystem. It is similar to RCFile and ORC, the other columnar-storage file formats in Hadoop, and is compatible with most of the data processing frameworks around Hadoop. It provides efficient data compression and encoding schemes with enhanced performance to handle complex data in bulk.. Read more on Wikipedia...


- Build the next great programming language Add About Search Keywords Livestreams Labs Resources Acknowledgements

Built with Scroll v144.0.0