-
Notifications
You must be signed in to change notification settings - Fork 143
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
08053f5
commit 37f0fe2
Showing
1 changed file
with
34 additions
and
1 deletion.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1,34 @@ | ||
WIP | ||
--- | ||
title: "Release 0.1.0-incubating" | ||
sidebar_position: 1 | ||
--- | ||
|
||
## [Release 0.1.0-incubating](https://github.com/apache/incubator-xtable/releases/tag/0.1.0-incubating) ([docs](https://xtable.apache.org/docs/how-to)) | ||
This is the first official apache release for Apache XTable, an incubating project under the Apache Software Foundation. | ||
Apache XTable™ (Incubating) is a cross-table converter for table formats that facilitates omni-directional interoperability across data processing systems and query engines. | ||
Currently, Apache XTable™ supports widely adopted open-source table formats such as Apache Hudi, Apache Iceberg, and Delta Lake. | ||
|
||
## Features | ||
Apache XTable™ (Incubating) provides users with the ability to translate metadata from one table format to another. | ||
|
||
Apache XTable™ (Incubating) provides two sync modes, "incremental" and "full." The incremental mode is more lightweight and has better performance, especially on large tables. If there is anything that prevents the incremental mode from working properly, the tool will fall back to the full sync mode. | ||
|
||
This sync provides users with the following: | ||
|
||
1. Syncing of data files along with their column level statistics and partition metadata | ||
2. Schema updates in the source are reflected in the target table metadata | ||
3. Metadata maintenance for the target table formats. | ||
* For Hudi, unreferenced files will be marked as [cleaned](https://hudi.apache.org/docs/hoodie_cleaner/) to control the size of the metadata table. | ||
* For Iceberg, snapshots will be [expired](https://iceberg.apache.org/docs/latest/maintenance/#expire-snapshots) after a configured amount of time. | ||
* For Delta, the transaction log will be [retained](https://docs.databricks.com/en/sql/language-manual/delta-vacuum.html) for a configured amount of time. | ||
|
||
|
||
## Improvements | ||
1. Added apache release guide and infra components to be compliant with ASF release process. | ||
2. Fix bugs related to dependency convicts, few edge cases related when parsing column stats. | ||
3. Improved README, docker demo and website docs based on feedback provided by users. | ||
4. Refactored the codebase to follow apache naming practices. | ||
[apache-xtable-0.1.0-incubating.src (1).tgz](https://github.com/user-attachments/files/16709894/apache-xtable-0.1.0-incubating.src.1.tgz) | ||
|
||
## Raw Release Notes | ||
https://github.com/apache/incubator-xtable/compare/v0.1.0-beta1...0.1.0-incubating |