Apache Spark - Apache HBase Connector

Change Log

This version based on hortonworks SHC. You can store the DataFrame into HBase by using bulkload. This is an example:

df.write
  .format("hbase")
  .option(HBaseTableCatalog.tableName, "test_table")
  .option(HBaseTableCatalog.rowKey, "rk")
  .option(HBaseTableCatalog.cf, "f")
  .option(HBaseRelation.WRITE_MODE, HBaseRelation.Restrictive.BULKLOAD)
  .option(HBaseRelation.HFILE_TEMP_PATH, "hdfs:///tmp/hfile")
  .save()

// structured-streaing is also suported
df.writeStream
  .format("hbase")
  .option("checkpointLocation", "hdfs:///tmp/structured-streaming-checkpoint/")
  .option(HBaseTableCatalog.tableName, "test_table")
  .option(HBaseTableCatalog.rowKey, "rk")
  .option(HBaseTableCatalog.cf, "f")
  .option(HBaseRelation.WRITE_MODE, HBaseRelation.Restrictive.BULKLOAD)
  .option(HBaseRelation.HFILE_TEMP_PATH, "hdfs:///zmk/hfile")
  .outputMode(OutputMode.Append())
  .trigger(Trigger.ProcessingTime(Seconds(10).milliseconds))
  .start()
  .awaitTermination()

More information can be found here.

Name		Name	Last commit message	Last commit date
Latest commit History 166 Commits
.github		.github
build		build
core		core
examples		examples
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Apache Spark - Apache HBase Connector

Change Log

About

Releases

Packages

Languages

License

yagagagaga/shc

Folders and files

Latest commit

History

Repository files navigation

Apache Spark - Apache HBase Connector

Change Log

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages