You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Parquet-MR contains the java implementation of the [Parquet format](https://github.com/apache/parquet-format).
24
-
Parquet is a columnar storage format for Hadoop; it provides efficient storage and encoding of data.
25
-
Parquet uses the [record shredding and assembly algorithm](https://github.com/julienledem/redelm/wiki/The-striping-and-assembly-algorithms-from-the-Dremel-paper) described in the Dremel paper to represent nested structures.
23
+
This repository contains a Java implementation of [Apache Parquet](https://parquet.apache.org/)
24
+
25
+
Apache Parquet is an open source, column-oriented data file format
26
+
designed for efficient data storage and retrieval. It provides high
27
+
performance compression and encoding schemes to handle complex data in
28
+
bulk and is supported in many programming language and analytics
29
+
tools.
26
30
27
-
You can find some details about the format and intended use cases in our [Hadoop Summit 2013 presentation](http://www.slideshare.net/julienledem/parquet-hadoop-summit-2013)
31
+
The [parquet-format](https://github.com/apache/parquet-format)
32
+
repository contains the file format specificiation.
33
+
34
+
Parquet uses the [record shredding and assembly algorithm](https://github.com/julienledem/redelm/wiki/The-striping-and-assembly-algorithms-from-the-Dremel-paper) described in the Dremel paper to represent nested structures.
35
+
You can find additional details about the format and intended use cases in our [Hadoop Summit 2013 presentation](http://www.slideshare.net/julienledem/parquet-hadoop-summit-2013)
0 commit comments