Skip to content

Setting up the Clojure collector

Alexander Dean edited this page Aug 8, 2013 · 8 revisions

HOME > SNOWPLOW SETUP GUIDE > Step 1: setup a Collector > Clojure collector setup

Overview of the Clojure Collector

The Clojure collector has been designed to enable cross-domain Snowplow deployments. As such, the Clojure collector performs one key function not performed by the [Cloudfront collector](setting up the cloudfront collector): it sets the user_id (used to identify unique visitors) server side, so that it is possible to reliably identify the same user across domains. This differs from the [Cloudfront collector](setting up the cloudfront collector), where user IDs are set on the client side.

The Clojure collector has been designed to run on Amazon's Elastic Beanstalk. This provides two key advantages:

  1. It makes it easy to deploy the Clojure collector in a scalable way
  2. It makes use of Elastic Beanstalk's built in support for saving Tomcat logs into S3. These are the logs that are processed by the Enrichment stage to generate Snowplow event data.

Contents

Setting up the Clojure collector is a 6 step process:

  1. Download the Clojure collector WAR file, or compile it from source. (Required)
  2. Create a new application in Elastic Beanstalk, and upload the WAR file into it. (Required)
  3. [Enable logging to S3](Enable logging to S3). (Required)
  4. [Enable support for HTTPS](Enable support for HTTPS). (Optional, but recommended.)

In addition, we document [additional configuration options](additional configuration options) at the end of this guide.

Note: We recommend running all Snowplow AWS operations through an IAM user with the bare minimum permissions required to run Snowplow. Please see our IAM user setup page for more information on doing this.

HOME > SNOWPLOW SETUP GUIDE > Collectors > Clojure collector setup

Setup Snowplow

  1. [Setup a Collector] (setting-up-a-collector)
  1. [Step 2: Setup a Tracker] (setting-up-a-tracker)
  2. [Step 3: Setup EmrEtlRunner] (setting-up-EmrEtlRunner)
  3. [Step 4: Setup the StorageLoader] (setting-up-storageloader)
  4. [Step 5: Analyze your data!] (Getting started analyzing Snowplow data)

Useful resources

Clone this wiki locally