Skip to content

Commit

Permalink
Docs update quickstart iceberg (#239)
Browse files Browse the repository at this point in the history
* add note for Iceberg sourceFormat

* Revert "add note for Iceberg sourceFormat"

This reverts commit 245e08e.

* add note for Iceberg sourceFormat

* Update how-to.md

add `people` to tableBasePath

* Update how-to.md

fixed typos

* clarified context on tableDataPath
  • Loading branch information
sagarlakshmipathy authored Nov 18, 2023
1 parent f737637 commit 83fb7e3
Showing 1 changed file with 13 additions and 6 deletions.
19 changes: 13 additions & 6 deletions website/docs/how-to.md
Original file line number Diff line number Diff line change
Expand Up @@ -270,9 +270,13 @@ targetFormats:
datasets:
-
tableBasePath: file:///tmp/iceberg-dataset/people
dataBasePath: file:///tmp/iceberg-dataset/people/data
tableDataPath: file:///tmp/iceberg-dataset/people/data
tableName: people
```
:::note Note:
Add `tableDataPath` for ICEBERG sourceFormat if the `tableBasePath` is different from the path to the data.
:::

</TabItem>
</Tabs>

Expand All @@ -296,7 +300,7 @@ targetFormats:
- ICEBERG
datasets:
-
tableBasePath: s3://path/to/hudi-data # replace this with gs://path/to/hudi_data if your data is in GCS.
tableBasePath: s3://path/to/hudi-dataset/people # replace this with gs://path/to/hudi-dataset/people if your data is in GCS.
tableName: people
partitionSpec: city:VALUE
```
Expand All @@ -311,7 +315,7 @@ targetFormats:
- ICEBERG
datasets:
-
tableBasePath: s3://path/to/delta-data # replace this with gs://path/to/delta_data if your data is in GCS.
tableBasePath: s3://path/to/delta-dataset/people # replace this with gs://path/to/delta-dataset/people if your data is in GCS.
tableName: people
```

Expand All @@ -325,10 +329,13 @@ targetFormats:
- DELTA
datasets:
-
tableBasePath: s3://path/to/iceberg # replace this with gs://path/to/iceberg_data if your data is in GCS.
tableDataPath: s3://path/to/iceberg/data
tableBasePath: s3://path/to/iceberg-dataset/people # replace this with gs://path/to/iceberg-dataset/people if your data is in GCS.
tableDataPath: s3://path/to/iceberg-dataset/people/data
tableName: people
```
:::note Note:
Add `tableDataPath` for ICEBERG sourceFormat if the `tableBasePath` is different from the path to the data.
:::

</TabItem>
</Tabs>
Expand Down Expand Up @@ -357,4 +364,4 @@ that can be used to query the source table in different target table formats.

## Next steps
Go through the [Catalog Integration guides](/docs/catalogs-index) to register the OneTable synced tables
in different data catalogs.
in different data catalogs.

0 comments on commit 83fb7e3

Please sign in to comment.