Getting started with Titan using Cassandra and Solr
Titan comes with several possibilities to configure the storage (BerkleyDb, Cassandra, Hbase) and the underlying search engine (Lucene, Solr, Elastic). Since DataStax aquired Aurelius and DataStax...
View ArticleRealtime Fast Data Analytics with Druid
I have been working with the SMACK stack for a while now and it is great fun from a developer’s point of view. Kafka is a very robust data buffer, Spark is great at streaming all that buffered data and...
View ArticleDeploying Pull Requests with Docker
The Git repositories in my current project are hosted on Bitbucket Cloud. Any code changes have to go through pull requests. Jenkins builds the pull requests and gives its approval if the build is...
View ArticleChange Streams in MongoDB 3.6
MongoDB 3.6 introduces an interesting API enhancement called change streams. With change streams you can watch for changes to certain collections by means of the driver API. This feature replaces all...
View ArticleIoT Analytics Platform
The Internet of Things a.k.a. the next industrial revolution is the current hype, but what kinds of challenges do we face with the consumption of big amounts of data? One variant is to collect all the...
View ArticleFrom PDF data sheets to shared understanding with serverless SHACL
Knowledge contained in PDF files When crawling the web for information about products of a specific category, may it be instances of industrial machine parts, chemical components, or even household...
View Article