dynq

dynq is an analytic query and data processing CLI tool for DynamoDB. It uses jq filters to target, transform and/or aggregate items in a given table, and has a number of QoL features including automatic pagination, table segmentation and index expansion.

Installation

Linux

Linux binaries can be downloaded from https://github.com/benward2301/dynq/releases.

Docker

docker run -i --rm \
    --network=host \
    -v ~/.aws:/root/.aws:ro \
    -e AWS_ACCESS_KEY_ID=$AWS_ACCESS_KEY_ID \
    -e AWS_SECRET_ACCESS_KEY=$AWS_SECRET_ACCESS_KEY \
    -e AWS_REGION=$AWS_REGION \
  benward2301/dynq --version

You may wish to alias this command (omitting --version), or copy the docker/dynq script to somewhere on your path (e.g. /usr/local/bin).

PowerShell

docker run -i --rm `
    --network=host `
    -v $Env:USERPROFILE\.aws:/root/.aws:ro `
    -e AWS_ACCESS_KEY_ID=$Env:AWS_ACCESS_KEY_ID `
    -e AWS_SECRET_ACCESS_KEY=$Env:AWS_SECRET_ACCESS_KEY `
    -e AWS_REGION=$Env:AWS_REGION `
  benward2301/dynq --version

Building from source

A Linux binary can be built at target/dynq using Docker:

git clone https://github.com/benward2301/dynq.git
cd dynq
docker compose up -d
docker/build/mvn verify
docker compose down --rmi all
target/dynq --version

Then a Docker image can be built:

docker build -t benward2301/dynq -f docker/local/Dockerfile .

Starting out

The DVD rental database

You can try dynq out using a local single-table conversion of the PostgreSQL DVD rental sample database:

docker run -d -p 8000:8000 benward2301/dynamodb-local-dvd-rental
sleep 2

# Can be omitted if you have AWS configured
export AWS_ACCESS_KEY_ID=local
export AWS_SECRET_ACCESS_KEY=local
export AWS_REGION=eu-west-2

dynq -E http://localhost:8000 -f dvd_rental -L 1

You can view the database schema with the AWS CLI:

aws dynamodb --endpoint-url http://localhost:8000 describe-table --table-name dvd_rental

Tips

Use --partition-key and --sort-key wherever possible
Use --select to improve performance of high-volume queries
Keep jq filters simple
Consider using jiq or a similar interactive jq tool to model queries
Bear in mind memory usage when processing large datasets

Options

General

`-f, --from`

(table name)

Required

The name of the table containing the requested items; or, if you provide --index, the name of the table to which that index belongs.

`-c, --concurrency`

(integer)

The number of coroutines to launch when reading from DynamoDB. Defaults to 1.

For scan operations, this option is equivalent to the DynamoDB --total-segments option. The optimal number will depend on the size and composition of the table.

For non-scan operations, this option is only applicable if multiple keys are passed.

Incompatible with --scan-limit and --start-key.

`-i, --index`

(index name)

The name of a global secondary index to query.

Requires --partition-key.

Incompatible with --consistent-read.

`-x, --expand`

Retrieve non-projected attributes from the primary table when querying a global secondary index.

Requires --partition-key and --index.

`-s, --select`

(projection expression)

A comma-separated set of attribute names to retrieve. Equivalent to the DynamoDB --projection-expression option.

Can improve performance of queries.

`-e, --stream`

Incrementally write items to stdout.

Incompatible with --aggregate, --reduce, --prune and --meta-only.

`--consistent-read`

Guarantees that all writes completed before the query began will be processable.

`-E, --endpoint-url`

(url)

Send DynamoDB requests to the given URL.

`-p, --profile`

(aws profile)

Profile to use from your AWS credentials file.

`-R, --region`

(aws region)

The AWS region to use. Overrides config/env settings.

Selection filters

`-w, --where`

(jq filter)

jq predicate filter to select items. Equivalent to jq select(f) function.

`-P, --partition-key`

(jq filter)

jq filter producing one or more partition keys to query.

The output must be an object containing a single key (the partition key attribute name), the value of which must be a string, number, or array thereof.

Binary keys are not currently supported.

`-S, --sort-key`

(jq filter)

jq filter producing one or more sort keys or a sort key range to query.

The output must be an object containing a single key (the sort key attribute name).

To target specific items, the value must be a string, number, or array thereof.

To target a range of items, the value must be an object with exactly one of the following operator keys:

lt or less_than
lte or less_than_or_equals
gt or greater_than
gte or greater_than_or_equals
begins_with
between

Operand values must be a string or number. Each operator expects a single operand, except between which expects an array containing a lower and upper bound (both inclusive).

Binary keys are not currently supported.

Requires --partition-key.

Incompatible with --start-key.

`-k, --start-key`

(jq filter)

jq filter producing the last evaluated key from a previous DynamoDB scan or query operation. When applicable, dynq will return the last evaluated key of any such operations via the meta.lastEvaluatedKey field.

If a partition key has already been passed via the --partition-key option, then this filter does not need to output one.

Binary keys are not currently supported.

Incompatible with --sort-key and --concurrency.

Transformation filters

`-t, --transform`

(jq filter)

jq filter to transform individual items. Executes after the --where selection filter.

Incompatible with --meta-only.

`-T, --pretransform`

(jq filter)

jq filter to transform individual items. Executes before the --where selection filter.

`-a, --aggregate`

(jq filter)

jq filter to transform the complete query result set, after all other transformations and exclusions have been applied. The output of this filter is returned to the user via the content field.

The total hit count of the query can be accessed in this filter using the $count variable. This is useful when the result set has been transformed by the --reduce filter.

Incompatible with --stream and --meta-only.

`-u, --prune`

(jq filter)

jq filter to transform the cumulative result set, executed after each request to DynamoDB. Must return an array.

This filter can be used to find the least/greatest n values according to some comparator, or find distinct values.

Where possible, it should be used over --aggregate for high-volume queries to reduce memory usage.

Incompatible with --meta-only.

`-r, --reduce`

(starting value) (jq filter)

Reduce items using the given starting value and jq filter, with items assigned to $item.

Equivalent to jq reduce .[] as $item (<starting value>; <jq filter>)

Incompatible with --stream, --prune and --meta-only.

Limits

`-l, --limit`

(integer)

The maximum number of DynamoDB items to retain after selection.

Note that meta.lastEvaluatedKey will not be returned when this option is given.

`-L, --scan-limit`

(integer)

The maximum number of DynamoDB items to scan across one or more requests.

Unlike --limit, meta.lastEvaluatedKey will be returned when this option is given unless all items have been scanned.

`-Q, --request-limit`

(integer)

The maximum number of requests to send to DynamoDB per coroutine.

`-I, --items-per-request`

(integer)

The maximum number of items scanned per DynamoDB request.

Output flags

`--content-only`

Return only the content of the query output.

Incompatible with --meta-only.

`--meta-only`

Return only the metadata of the query output.

Incompatible with --content-only, --transform, --aggregate, --prune, --reduce, --rearrange-keys and stream.

`-q, --quiet`

Only write to stderr when an error is encountered.

`-C, --colorize`

Colorize JSON output. Enabled by default when destination is a TTY.

Incompatible with --monochrome.

`-M, --monochrome`

Do not colorize JSON output. Enabled by default if destination is not a TTY.

Incompatible with --colorize.

`-m, --compact`

Compact instead of pretty-printed output.

`-g, --rearrange-keys`

Sort keys of objects on output.

Incompatible with --meta-only.

Examples

The examples below are run against the DVD rental sample database.

For brevity, these queries do not use the --select option, but its use is recommended for high-volume queries.

Find a `film` with a `G rating`

dynq --from dvd_rental \
  --partition-key '.entity = "film"' \
  --where '.rating == "G"' \
  --limit 1

Output

Tip

The --partition-key filter above uses assignment to produce an object, however a JSON or JSON5 object literal may be used instead:

--partition-key: '{ entity: "film" }'

Calculate total `amount` of `payments` in `2007-03`

Using --transform and --aggregate

dynq --from dvd_rental \
  --partition-key '.entity = "payment"' \
  --where '.payment_date | startswith("2007-03")' \
  --transform '.amount' \
  --aggregate 'add'

Using --reduce

dynq --from dvd_rental \
  --partition-key '.entity = "payment"' \
  --where '.payment_date | startswith("2007-03")' \
  --reduce 0 '. + $item.amount'

Output

Calculate average `payment amount`

Using --transform and --aggregate

dynq --from dvd_rental \
  --partition-key '.entity = "payment"' \
  --transform '.amount' \
  --aggregate 'add / length'

Using --reduce and --aggregate

dynq --from dvd_rental \
  --partition-key '.entity = "payment"' \
  --reduce 0 '. + $item.amount' \
  --aggregate '. / $count'

Output

Find the `film` with the longest `length`

dynq --from dvd_rental \
  --partition-key '.entity = "film"' \
  --prune '[max_by(.length)]'

Output

Find the three `films` with the shortest `length`

dynq --from dvd_rental \
  --partition-key '.entity = "film"' \
  --prune 'sort_by(.length)[:3]'

Output

Count `films` by `rating`

dynq --from dvd_rental \
  --partition-key '.entity = "film"' \
  --reduce '{}' '.[$item.rating] += 1'

Output

Find `inventory` for `film_id 1` using the `film_id` index

dynq --from dvd_rental \
  --partition-key '.film_id = 1' \
  --where '.entity == "inventory"' \
  --index 'film_id' \
  --expand

Output

Scan from `rental 1`

dynq --from dvd_rental \
  --start-key '{ entity: "rental", id: 1 }'

Get `films` up to `film 100`

dynq --from dvd_rental \
  --partition-key '.entity = "film"' \
  --sort-key '.id.less_than = 100'

Find all `staff`, `customers` and `actors` named `Jon`

dynq --from dvd_rental \
  --partition-key '.entity = ["staff", "customer", "actor"]' \
  --where '.first_name == "Jon"'

Output

Save each item locally

dynq -em -f dvd_rental | while read item; do echo "$item" > $(uuid).json; done

Decode a binary attribute

dynq -e -f dvd_rental -P '.entity = "staff"' -S '.id = 1' -t '.picture' \
  | tr -d \" \
  | base64 -d \
  > staff_1.png

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
.idea		.idea
.mvn		.mvn
dev		dev
docker		docker
docs		docs
src/main		src/main
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
		README.md
docker-compose.yaml		docker-compose.yaml
pom.yaml		pom.yaml

License

benward2301/dynq

Folders and files

Latest commit

History

Repository files navigation

dynq

Contents

Installation

Linux

Docker

PowerShell

Building from source

Starting out

The DVD rental database

Tips

Options

General

-f, --from

-c, --concurrency

-i, --index

-x, --expand

-s, --select

-e, --stream

--consistent-read

-E, --endpoint-url

-p, --profile

-R, --region

Selection filters

-w, --where

-P, --partition-key

-S, --sort-key

-k, --start-key

Transformation filters

-t, --transform

-T, --pretransform

-a, --aggregate

-u, --prune

-r, --reduce

Limits

-l, --limit

-L, --scan-limit

-Q, --request-limit

-I, --items-per-request

Output flags

--content-only

--meta-only

-q, --quiet

-C, --colorize

-M, --monochrome

-m, --compact

-g, --rearrange-keys

Examples

Find a film with a G rating

Calculate total amount of payments in 2007-03

Calculate average payment amount

Find the film with the longest length

Find the three films with the shortest length

Count films by rating

Find inventory for film_id 1 using the film_id index

Scan from rental 1

Get films up to film 100

Find all staff, customers and actors named Jon

Save each item locally

Decode a binary attribute

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Languages

`-f, --from`

`-c, --concurrency`

`-i, --index`

`-x, --expand`

`-s, --select`

`-e, --stream`

`--consistent-read`

`-E, --endpoint-url`

`-p, --profile`

`-R, --region`

`-w, --where`

`-P, --partition-key`

`-S, --sort-key`

`-k, --start-key`

`-t, --transform`

`-T, --pretransform`

`-a, --aggregate`

`-u, --prune`

`-r, --reduce`

`-l, --limit`

`-L, --scan-limit`

`-Q, --request-limit`

`-I, --items-per-request`

`--content-only`

`--meta-only`

`-q, --quiet`

`-C, --colorize`

`-M, --monochrome`

`-m, --compact`

`-g, --rearrange-keys`

Find a `film` with a `G rating`

Calculate total `amount` of `payments` in `2007-03`

Calculate average `payment amount`

Find the `film` with the longest `length`

Find the three `films` with the shortest `length`

Count `films` by `rating`

Find `inventory` for `film_id 1` using the `film_id` index

Scan from `rental 1`

Get `films` up to `film 100`

Find all `staff`, `customers` and `actors` named `Jon`

Packages