13.1. Installing GeoMesa Cassandra¶
13.1.1. Connecting to Cassandra¶
The first step to getting started with Cassandra and GeoMesa is to install Cassandra itself. You can find good directions for downloading and installing Cassandra online. For example, see Cassandra’s official getting started documentation.
Once you have Cassandra installed, the next step is to prepare your Cassandra installation
to integrate with GeoMesa. First, create a key space within Cassandra. The easiest way to
do this with cqlsh
, which should have been installed as part of your Cassandra installation.
Start cqlsh
, then type:
CREATE KEYSPACE mykeyspace WITH REPLICATION = {'class': 'SimpleStrategy', 'replication_factor' : 3};
This creates a key space called “mykeyspace”. This is a top-level name space within Cassandra and it will provide a place for GeoMesa to put all of its data, including data for spatial features and associated metadata.
Next, you’ll need to set the CASSANDRA_HOME
environment variable. GeoMesa uses this variable
to find the Cassandra jars. These jars should be in the lib
directory of your Cassandra
installation. To set the variable add the following line to your .profile
or .bashrc
file:
export CASSANDRA_HOME=/path/to/cassandra
Finally, make sure you know a contact point for your Cassandra instance.
If you are just trying things locally, and using the default Cassandra settings,
the contact point would be 127.0.0.1:9042
. You can check and configure the
port you are using using the native_transport_port
in the Cassandra
configuration file (located at conf/cassandra.yaml
in your Cassandra
installation directory).
13.1.2. Installing from the Binary Distribution¶
GeoMesa Cassandra artifacts are available for download or can be built from source. The easiest way to get started is to download the most recent binary version (2.1.3) from GitHub.
Extract it somewhere convenient:
# download and unpackage the most recent distribution:
$ wget "https://github.com/locationtech/geomesa/releases/download/geomesa_2.11-$VERSION/geomesa-cassandra_2.11-$VERSION-bin.tar.gz"
$ tar xvf geomesa-cassandra_2.11-$VERSION-bin.tar.gz
$ cd geomesa-cassandra_2.11-$VERSION
$ ls
bin/ conf/ dist/ docs/ examples/ lib/ LICENSE.txt logs/
13.1.3. Building from Source¶
GeoMesa Cassandra may also be built from source. For more information refer to Building from Source
in the developer manual, or to the README.md
file in the the source distribution.
The remainder of the instructions in this chapter assume the use of the binary GeoMesa Cassandra
distribution. If you have built from source, the distribution is created in the target
directory of
geomesa-cassandra/geomesa-cassandra-dist
.
More information about developing with GeoMesa may be found in the Developer Manual.
13.1.4. Setting up the Cassandra Command Line Tools¶
GeoMesa Cassandra comes with a set of command line tools for managing Cassandra features located in
geomesa-cassandra_2.11-$VERSION/bin/
of the binary distribution.
Note
You can configure environment variables and classpath settings in geomesa-cassandra_2.11-$VERSION/conf/geomesa-env.sh
.
Note
geomesa-cassandra
will read the $CASSANDRA_HOME
and $HADOOP_HOME
environment variables to load the
appropriate JAR files for Cassandra and Hadoop. In addition, geomesa-cassandra
will pull any
additional jars from the $GEOMESA_EXTRA_CLASSPATHS
environment variable into the class path.
Use the geomesa classpath
command in order to see what JARs are being used.
If you do not have a local Cassandra installation you will need to manually install the Cassandra JARs into the
tools lib
folder. To do this, use the scripts provided with the distribution:
$ bin/install-cassandra-jars.sh lib
$ bin/install-tools-jar.sh lib
Due to licensing restrictions, dependencies for shape file support must be separately installed. Do this with the following commands:
$ bin/install-jai.sh
$ bin/install-jline.sh
Run geomesa-cassandra
without arguments to confirm that the tools work.
$ bin/geomesa-cassandra
INFO Usage: geomesa-cassandra [command] [command options]
Commands:
...
13.1.5. Installing GeoMesa Cassandra in GeoServer¶
Warning
- GeoServer 2.13.0 and 2.13.1 are not recommended due to two serious bugs:
- GeoMesa WPS processes are not triggered correctly, and will run slowly or not at all
- GeoMesa count optimizations are bypassed, potentially resulting in large duplicate scans for WFS queries
The GeoMesa Cassandra distribution includes a GeoServer plugin for including
Cassandra data stores in GeoServer. The plugin files are in the
dist/gs-plugins/geomesa-cassandra-gs-plugin_2.11-$VERSION-install.tar.gz
archive within the
GeoMesa Cassandra distribution directory.
To install the plugins, extract the archive and copy the contents to the WEB-INF/lib
directory of your GeoServer installation. You will also need to install the Cassandra JARs; these
are not bundled to allow for different versions. The distribution includes a script to download
the JARs: bin/install-cassandra-jars.sh
. Call it with the path to the GeoServer WEB-INF/lib
directory.
By default, it will install the following JARs:
- cassandra-all-3.0.11.jar
- cassandra-driver-core-3.0.0.jar
- cassandra-driver-mapping-3.0.0.jar
- netty-all-4.0.33.Final.jar
- metrics-core-3.1.2.jar
Restart GeoServer after the JARs are installed.
13.1.5.1. Jackson Version¶
Warning
Some GeoMesa functions (in particular Arrow conversion) requires jackson-core-2.6.x
. Some versions
of GeoServer ship with an older version, jackson-core-2.5.0.jar
. After installing the GeoMesa
GeoServer plugin, be sure to delete the older JAR from GeoServer’s WEB-INF/lib
folder.