Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[docs] added run_syn_tool.sh with HMS example and also added more on S3 changes needed for Trino. #458

Open
wants to merge 3 commits into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
26 changes: 23 additions & 3 deletions website/docs/hms.md
Original file line number Diff line number Diff line change
Expand Up @@ -87,8 +87,9 @@ datasets:
:::

From your terminal under the cloned Apache XTable™ (Incubating) directory, run the sync process using the below command.
```shell md title="shell"
java -jar xtable-utilities/target/xtable-utilities-0.2.0-SNAPSHOT-bundled.jar --datasetConfig my_config.yaml
```shell in JDBC md title="shell in JDBC"
java -jar xtable-utilities/target/xtable-utilities-0.1.0-SNAPSHOT-bundled.jar --datasetConfig my_config.yaml
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We updated the version to 0.2.0-SNAPSHOT in main branch now.

I feel it's better to use the downloads from maven for the demo going forward. WDYT ?
https://xtable.apache.org/releases/downloads/


```

:::tip Note:
Expand All @@ -114,6 +115,17 @@ A Hudi table can directly be synced to the Hive Metastore using Hive Sync Tool
and subsequently be queried by different query engines. For more information on the Hive Sync Tool, check
[Hudi Hive Metastore](https://hudi.apache.org/docs/syncing_metastore) docs.

```shell md title="shell with HMS example"
cd $HUDI_HOME/hudi-sync/hudi-hive-sync ; \
./run_sync_tool.sh \
--metastore-uris '<thrift_url>' \
--partitioned-by <partition_field> \
--base-path '<path/to/s3/synced/hudi/table>' \
--database <database_name> \
--table <tableName> \
--sync-mode hms
```

```shell md title="shell"
alberttwong marked this conversation as resolved.
Show resolved Hide resolved
cd $HUDI_HOME/hudi-sync/hudi-hive-sync

Expand All @@ -122,7 +134,7 @@ cd $HUDI_HOME/hudi-sync/hudi-hive-sync
--user <username> \
--pass <password> \
--partitioned-by <partition_field> \
--base-path <'/path/to/synced/hudi/table'> \
--base-path '<path/to/synced/hudi/table>' \
--database <database_name> \
--table <tableName>
```
Expand Down Expand Up @@ -218,6 +230,14 @@ using query engines like `Presto` and/or `Trino`. Check out the guides for query
SELECT * FROM iceberg_db.<table_name>;
```

```sql md title="sql for Iceberg, Hudi and Delta Lake in S3"
alberttwong marked this conversation as resolved.
Show resolved Hide resolved
select * from hudi.hudi_db.people;
select * from delta.delta_db.people;
select * from iceberg.iceberg_db.people;
```

In the example demo docker image, you can find the catalog configurations.

</TabItem>
</Tabs>

Expand Down