Skip to content

Commit 6809a18

Browse files
authored
PARQUET-2478: Update README with link to parquet website (#1355)
Update the introductory content to reduce confusion about parquet in general.
1 parent f12229a commit 6809a18

1 file changed

Lines changed: 12 additions & 4 deletions

File tree

README.md

Lines changed: 12 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -20,11 +20,19 @@
2020
Parquet MR [![Build Status](https://github.com/apache/parquet-mr/workflows/Test/badge.svg)](https://github.com/apache/parquet-mr/actions)
2121
======
2222

23-
Parquet-MR contains the java implementation of the [Parquet format](https://github.com/apache/parquet-format).
24-
Parquet is a columnar storage format for Hadoop; it provides efficient storage and encoding of data.
25-
Parquet uses the [record shredding and assembly algorithm](https://github.com/julienledem/redelm/wiki/The-striping-and-assembly-algorithms-from-the-Dremel-paper) described in the Dremel paper to represent nested structures.
23+
This repository contains a Java implementation of [Apache Parquet](https://parquet.apache.org/)
24+
25+
Apache Parquet is an open source, column-oriented data file format
26+
designed for efficient data storage and retrieval. It provides high
27+
performance compression and encoding schemes to handle complex data in
28+
bulk and is supported in many programming language and analytics
29+
tools.
2630

27-
You can find some details about the format and intended use cases in our [Hadoop Summit 2013 presentation](http://www.slideshare.net/julienledem/parquet-hadoop-summit-2013)
31+
The [parquet-format](https://github.com/apache/parquet-format)
32+
repository contains the file format specificiation.
33+
34+
Parquet uses the [record shredding and assembly algorithm](https://github.com/julienledem/redelm/wiki/The-striping-and-assembly-algorithms-from-the-Dremel-paper) described in the Dremel paper to represent nested structures.
35+
You can find additional details about the format and intended use cases in our [Hadoop Summit 2013 presentation](http://www.slideshare.net/julienledem/parquet-hadoop-summit-2013)
2836

2937
## Building
3038

0 commit comments

Comments
 (0)