Rokku

A security layer between s3 user (eg. application using aws sdk) and s3 backend (eg. ceph RadosGW).

Rokku uses NPA to access data so it needs to rewrite client signature

Rokku NPA

For setting NPA storage credentials you need to set in the env:

ROKKU_STORAGE_S3_ADMIN_ACCESSKEY

ROKKU_STORAGE_S3_ADMIN_SECRETKEY

What do you need

To get started with Rokku you only need a few applications set up:

Docker
AWS CLI
[Optional for coding] SBT

We've added a small description on how to setup the AWS CLI here.

How to run

To test the proxy (both live and integration tests), we need all dependencies to be running. For this we use a docker-compose.yml which defines all dependencies, run it using:
```
 docker-compose up
```
Before we can run our proxy, we have to specify the configuration for Apache Ranger. Ranger can be configured by creating a file called ranger-s3-security.xml on the classpath. There are 2 places you can put it:
1. REPO_ROOTDIR/src/main/resources/ranger-s3-security.xml
2. /etc/rokku/ranger-s3-security.xml
An example of this file can be found here. No modification to this is needed if you run this project with the accompanying docker containers.
When all is running, we can start the proxy:
```
 sbt run
```

for windows docker runs on different ip so you need to: set environmental variables:

ROKKU_STS_HOST

ROKKU_STORAGE_S3_HOST

ROKKU_KEYCLOAK_TOKEN_URL

change ROKKU_KEYCLOAK_URL in the docker-compose.yml

change the ranger.plugin.s3.policy.rest.url in ranger-s3-security.xml

if you want to use notifications before running sbt run set kafka and zookeeper as a host name in /etc/hosts:

127.0.0.1   localhost kafka zookeeper

thanks to that you will be able to write events to kafka.

Proxy as docker image

When proxy is started as docker image, the ranger-s3-security.xml file can be added in the following way:

    docker run -d -v /host/dir/with/xmls:/etc/rokku -p 8010:8010 rokku

Getting Started

This guide assumes you're using the default docker containers provided, see: How to run

Now you've got everything running, you may wonder: what now? This section we'll describe a basic flow on how to use Rokku to perform operations in S3. You may refer to the What is Rokku? document before diving in here. That will introduce you to the various components used.

Authorise with keycloak to request a keycloak token:

 curl -s \
      -d 'client_id=sts-rokku' \
      -d 'username=testuser' \
      -d 'password=password' \
      -d 'grant_type=password' \
      'http://localhost:8080/auth/realms/auth-rokku/protocol/openid-connect/token'

Search for the field access_token which contains your token.

Retrieve your short term session credentials from the STS service:
```
 aws sts get-session-token --endpoint-url http://localhost:12345 --token-code YOUR_KEYCLOAK_TOKEN_HERE
```
This should give you an accessKeyId, secretAccessKey, sessionToken and expirationDate.

Note: Alternatively you can run script helper to get credentials dev_sts_get_credentials.sh

Setup your local environment to use the credentials received from STS. You can do this in 2 ways.
1. Set them in your environment variables:
```
 export AWS_ACCESS_KEY_ID=YOUR_ACCESSKEY
 export AWS_SECRET_ACCESS_KEY=YOUR_SECRETKEY
 export AWS_SESSION_TOKEN=YOUR_SESSIONTOKEN
```
2. Set them in your ~/.aws/config file. See the Setting up AWS CLI guide on how to do this.
NOTE: This session expires at the expiration date specified by the STS service. You'll need to repeat these steps everytime your session expires.
Technically you're now able to use the aws cli to perform any commands through Rokku to S3. Since the authorisation is completely handled by ranger, authorization in Ceph should be removed to avoid conflicts. For this reason, Rokku uses NPA which must be manually configured as system user:
```
     docker-compose exec ceph radosgw-admin user modify --uid ceph-admin --system
```
Go nuts with the default bucket called demobucket that exists on Ceph already:
```
 aws s3api list-objects --bucket demobucket
 aws s3api put-object --bucket demobucket --key SOME_FILE
```
!BOOM! What happened?!

Well, your policy in Ranger only allows you to read objects from the demobucket. So we'll need to allow to write as well.
1. Go to Ranger on http://localhost:6080 and login with admin:admin.
2. Go to the testservice under the S3 header.
3. Edit the one existing policy. You'll have to allow the testuser to write, but also don't forget to remove the deny condition!
4. Save the policy at the bottom of the page.
Now it'll take maximum 30 seconds for this policy to sync to the Proxy. Then you should be able to retry:
```
 aws s3api put-object --bucket demobucket --key SOME_FILE
 aws s3api list-objects --bucket demobucket
 aws s3api get-object --bucket demobucket --key SOME_FILE SOME_TARGET_FILE
```

Verified AWS clients

We've currently verified that the following set of AWS clients work with Rokku:

CLI (s3 and s3api) using region us-east-1
Java SDK (using signerType: S3SignerType)

Other options may work but haven't been checked yet by us. There are known limitiations for other signer types within the Java SDK.

Known unsupported s3 functions

Chunk upload is not supported because Rokku needs to rewrite s3 signature and payload modification is expensive.

https://docs.aws.amazon.com/AmazonS3/latest/API/sigv4-streaming.html

In most tools you can disable the functionality - e.g java sdk has .disableChunkedEncoding()

Architecture

Dependencies:

Keycloak for MFA authentication of users.
STS Service to provide authentication and short term access to resources on S3.
STS persistence storage to maintain the user and session tokens issued. Current implementation uses MariaDB. Information regarding the tables database and the tables to be created in MariaDB can be found here.
Ranger to manage authorisation to resources on S3. The Apache Ranger docker images are created from this repo: https://github.com/ing-bank/rokku-dev-apache-ranger.git
S3 Backend (Current setup contains Ceph image with RadosGW).

A more in-depth discussion of the architecture and interaction of various components can be found here: What is Rokku?

Docker Ceph settings

In order to enable debug logging on Ceph RadosGW:

Edit /etc/ceph/ceph.conf and add following lines, under [global] section

debug rgw = 20
debug civetweb = 20

Restart rgw process (either docker stop <ceph/daemon rgw> or whole ceph/demo)

Events Notification

Rokku can send event notification to message queue based on user requests, in AWS format.

Currently, two types are emitted:

s3:ObjectCreated:*
s3:ObjectRemoved:*

In order to enable update application.conf, set to true:

        bucketNotificationEnabled = ${?ROKKU_BUCKET_NOTIFY_ENABLED}

and configure kafka and topic names:

        kafka.producer.bootstrapServers = ${?ROKKU_KAFKA_BOOTSTRAP_SERVERS}
        kafka.producer.createTopic = ${?ROKKU_KAFKA_CREATE_TOPIC}
        kafka.producer.deleteTopic = ${?ROKKU_KAFKA_DELETE_TOPIC}

Setting Up AWS CLI

It is possible to set up the AWS command-line tools for working with Ceph RadosGW and Rokku. Following are instructions to set this up using virtualenv_wrapper or Anaconda.

Create an environment for this work:

a. virtualenv_wrapper

% mkvirtualenv -p python3 rokku

b. Anaconda

% conda create -n rokku python=3
% conda activate rokku

Install the AWS command-line tools and the endpoint plugin:
```
% pip install awscli awscli-plugin-endpoint
```

Configure profiles and credentials for working with Rokku or the RadosGW directly (more info can be found in the aws documentation):

% mkdir -p ~/.aws

% cat >> ~/.aws/credentials << EOF
[radosgw]
aws_access_key_id = accesskey
aws_secret_access_key = secretkey

[rokku]
aws_access_key_id = YOUR_ACCESSKEY
aws_secret_access_key = YOUR_SECRETKEY
aws_session_token = YOUR_SESSIONTOKEN
EOF

% cat >> ~/.aws/config << EOF
[plugins]
endpoint = awscli_plugin_endpoint

[profile rokku]
output = json
region = us-east-1
s3 =
    endpoint_url = http://localhost:8987/
s3api =
    endpoint_url = http://localhost:8987/
sts =
    endpoint_url = http://localhost:12345/

[profile radosgw]
output = json
region = us-east-1
s3 =
    endpoint_url = http://localhost:8010/
s3api =
    endpoint_url = http://localhost:8010/
EOF

Note: for now, we default to us-east-1 AWS region for signature verification

Configure the default profile and reactivate the virtual environment:

a. virtualenv_wrapper

% cat >> ${WORKON_HOME:-$HOME/.virtualenvs}/rokku/bin/postactivate << EOF
AWS_DEFAULT_PROFILE=rokku
export AWS_DEFAULT_PROFILE
EOF

% cat >> ${WORKON_HOME:-$HOME/.virtualenvs}/rokku/bin/predeactivate << EOF
unset AWS_DEFAULT_PROFILE
EOF

% deactivate

% workon rokku

b. Anaconda

% cat >> /YOUR_CONDA_HOME/envs/rokku/etc/conda/deactivate.d/aws.sh << EOF
unset AWS_DEFAULT_PROFILE
EOF

% cat >> /YOUR_CONDA_HOME/envs/rokku/etc/conda/activate.d/aws.sh << EOF
AWS_DEFAULT_PROFILE=rokku
export AWS_DEFAULT_PROFILE
EOF

% source deactivate

% source activate rokku

By default S3 and STS commands will now be issued against the proxy service. For example:

% aws s3 ls

Commands can also be issued against the underlying RadosGW service:

% aws --profile radosgw s3 ls

The default profile can also be switched by modifying the AWS_DEFAULT_PROFILE environment variable.

Security environment

For kerberos environment (e.g connecting to ranger) you need to provide keytab file and principle name.

ROKKU_KERBEROS_KEYTAB: "keytab_full_path"
ROKKU_KERBEROS_PRINCIPAL: "user"

Authorization plugin

By default rokku uses Apache Ranger for authorization but you can change it. To provide other implementation you need:

implement AccessControl - example is Ranger
configure
- set the access control class in the config access-control.class-name or environment ROKKU_ACCESS_CONTROL_CLASS_NAME=...
- if you need any specific configuration for the plugin add it to access-control.plugin-param
add the access control class to rokku classpath

Authorization Audit Log

To enable the log set:

ROKKU_ENABLED_AUDIT="true"

For AccessControlProviderRanger you need to provide on the classpath the ranger-s3-audit.xml configuration.

ECS multi namespace support

To support many ECS namespaces you need to have dedicated credentials for each namespace with full access right.

To enable the functionality you have to set env var:

ROKKU_NAMESPACES_ENABLED=true

The namespace credentials also have to be provided in env variables. The variable name prefix is configurable by:

ROKKU_NAMESPACES_ENV_VAR_S3_CREDENTIALS_PREFIX  #by default is set to NAMESPACE_S3_CREDENTIALS_

so for example to set two namespaces you need to set:

NAMESPACE_S3_CREDENTIALS_1=accesskey1,secretkey1
NAMESPACE_S3_CREDENTIALS_2=accesskey2,secretkey2

The s3 credentials (accessKey and secretKey) have to be split by comma. The env namespace credentials will be uploaded to server sorted by the env name and thanks to that you can determine which namespace is checked firt to find a bucket.

The example to start the server is in the server_dev.sh script.

Name		Name	Last commit message	Last commit date
Latest commit History 945 Commits
.github/workflows		.github/workflows
dev-setup/ranger		dev-setup/ranger
docs		docs
mockServer		mockServer
project		project
scripts		scripts
src		src
.editorconfig		.editorconfig
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.sbt		build.sbt
docker-compose-extra.yaml		docker-compose-extra.yaml
docker-compose.yml		docker-compose.yml
setupS3Env.sh		setupS3Env.sh
waitForContainerSetup.sh		waitForContainerSetup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rokku

Rokku NPA

What do you need

How to run

Proxy as docker image

Getting Started

Verified AWS clients

Known unsupported s3 functions

Architecture

Docker Ceph settings

Events Notification

Setting Up AWS CLI

Security environment

Authorization plugin

Authorization Audit Log

ECS multi namespace support

About

Releases 126

Packages

Contributors 11

Languages

License

ing-bank/rokku

Folders and files

Latest commit

History

Repository files navigation

Rokku

Rokku NPA

What do you need

How to run

Proxy as docker image

Getting Started

Verified AWS clients

Known unsupported s3 functions

Architecture

Docker Ceph settings

Events Notification

Setting Up AWS CLI

Security environment

Authorization plugin

Authorization Audit Log

ECS multi namespace support

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 126

Packages 0

Contributors 11

Languages

Packages