Skip to content

A small project to help me learn a little bit about Flask and Python.

License

Notifications You must be signed in to change notification settings

Kimblebrook/hello_bears

Repository files navigation

Setting up the Snowplow applications

Snowplow configuration files are stored in snowplow_config.

Stream collector:

java -jar snowplow-stream-collector-0.12.0.jar --config application.conf

Stream enrich:

java -jar snowplow-stream-enrich-0.13.0.jar --config enrich.conf --resolver file:iglu_resolver.json

S3 loader:

java -jar snowplow-s3-loader-0.6.0.jar --config s3_loader.conf

Upload schemas to S3:

./igluctl static s3cp schema_repository sb-user-bucket --region eu-west-1

Overall architecture

Taking input from here.

stream collector -> kinesis raw stream -> kinesis enrich -> kinesis good stream -> s3 enriched -> kinesis bad stream -> s3 bad -> s3 raw

Ideas for what to do next

  • Get self-describing events working
  • Write data to S3
  • Try out Athena
  • Replace Kinesis with Kafka so that we can try out KSQL
  • Implement an iOS app for testing purposes

Iglu Repository

http://sb-user-bucket.s3-website-eu-west-1.amazonaws.com

About

A small project to help me learn a little bit about Flask and Python.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published