15.3. HBase Data Store Parameters

Use the following parameters for an HBase data store (required parameters are marked with *):

Parameter

Type

Description

hbase.catalog *

String

The name of the GeoMesa catalog table, including the HBase namespace (if any) separated by a :. For example, myCatalog or myNamespace:myCatalog

hbase.zookeepers

String

A comma-separated list of servers in the HBase zookeeper ensemble. This is optional, the preferred method for defining the HBase connection is with hbase-site.xml

hbase.coprocessor.url

String

Path to the GeoMesa jar containing coprocessors, for auto registration

hbase.config.paths

String

Additional HBase configuration resource files (comma-delimited)

hbase.config.xml

String

Additional HBase configuration properties, as a standard XML <configuration> element

hbase.connections.reuse

Boolean

Re-use and share HBase connections, or create a new one for this data store

hbase.remote.filtering

Boolean

Can be used to disable remote filtering and coprocessors, for environments where custom code can’t be installed

hbase.security.enabled

Boolean

Enable HBase security (visibilities)

geomesa.security.auths

String

Comma-delimited superset of authorizations that will be used for queries

geomesa.security.force-empty-auths

Boolean

Forces authorizations to be empty

geomesa.query.audit

Boolean

Audit queries being run. Queries will be written to a log file

geomesa.query.timeout

String

The max time a query will be allowed to run before being killed. The timeout is specified as a duration, e.g. 1 minute or 60 seconds

geomesa.query.threads

Integer

The number of threads to use per query

hbase.coprocessor.threads

Integer

The number of HBase RPC threads to use per coprocessor query

geomesa.query.loose-bounding-box

Boolean

Use loose bounding boxes - queries will be faster but may return extraneous results

hbase.ranges.max-per-extended-scan

Integer

Max ranges per extended scan. Ranges will be grouped into scans based on this setting

hbase.ranges.max-per-coprocessor-scan

Integer

Max ranges per coprocessor scan. Ranges will be grouped into scans based on this setting

hbase.coprocessor.arrow.enable

Boolean

Disable coprocessor scans for Arrow queries, and use local encoding instead

hbase.coprocessor.bin.enable

Boolean

Disable coprocessor scans for Bin queries, and use local encoding instead

hbase.coprocessor.density.enable

Boolean

Disable coprocessor scans for density queries, and use local processing instead

hbase.coprocessor.stats.enable

Boolean

Disable coprocessor scans for stat queries, and use local processing instead

hbase.coprocessor.yield.partial.results

Boolean

Toggle coprocessors yielding partial results

hbase.coprocessor.scan.parallel

Boolean

Toggle extremely parallel coprocessor scans (bounded by RPC threads)

geomesa.stats.enable

Boolean

Toggle collection of statistics (currently not implemented)

geomesa.partition.scan.parallel

Boolean

For partitioned schemas, execute scans in parallel instead of sequentially

Note

The HBase data store will also load any hbase-site.xml file that is available on the classpath.

Note

The hbase.coprocessor.*.enable parameters will be superseded by hbase.remote.filtering=false.

The HBase data store needs an hbase-site.xml file in order to connect to HBase. This can be provided in multiple ways. For simple use cases, the Zookeeper connection can be specified through the parameter hbase.zookeepers. For normal use, hbase-site.xml can be provided at the root of the classpath, at an arbitrary location pointed to by hbase.config.paths, inline using hbase.config.xml, or any combination of the above.

15.3.1. Programmatic Access

An instance of an HBase data store can be obtained through the normal GeoTools discovery methods, assuming that the GeoMesa code is on the classpath.

Map<String, Serializable> parameters = new HashMap<>();
parameters.put("hbase.catalog", "geomesa");
org.geotools.api.data.DataStore dataStore =
    org.geotools.api.data.DataStoreFinder.getDataStore(parameters);

More information on using GeoTools can be found in the GeoTools user guide.