All posts by tobias

Functions as Arguments Java vs Scala, Game Set Match Scala Wins!

August 24, 2021Uncategorizedjava, scalatobias

This is how you would create a function that takes a function as argument in Java

import java.util.function.Function;

class Scratch {

    public static void doCallFunc(int num, Function<Integer,String> fn) {
        System.out.println( "Result : "+fn.apply( num ) );
    }

    public static void main(String[] args) {
        Function<Integer,String> myFunc = num -> "Value = " + num;
        System.out.println( myFunc.apply( 7 ) );        
    }
}

The Function<A,B> myFunc = num -> “Value = ” + num;
Here :
A = the type of the first argument, in this example an interger
B = the type of the result/returned value, in this exampel a String

And for multiple parameters you need to create an interface like this

import java.util.function.Function;


@FunctionalInterface
interface TwoParamFunction<A,B,C> {
    public C apply(A a, B b);
}

class Scratch {

    public static void doCallFunc2(int num, TwoParamFunction<Integer,String,String> fn) {
        System.out.println( "Result : "+fn.apply( num, "Value" ) );
    }

    public static void main(String[] args) {
        TwoParamFunction<Integer,String,String> myFunc2 = (num,str) -> str + " : " + num;
        doCallFunc2( 7, myFunc2 );

    }
}

Now with TWO (2) parameters instead, it looks looks alot more complicated.
TwoParamFunction<A,B,C>
A = is the type of the first parameter
B = is the type of the second parameter
C = is the type of the resule/returned value

If we look at Scala the code looks alot simpler and much more intuitive

def myFunc( num:Int ):String = {
 "Value = " + num
}

def doCallFunc( num:Int, fn:(Int)=>String ):Unit = {
 println("Result :"+fn(num))
}

doCallFunc(123,myFunc)

here the definition of the function
fn:(Int)=>String
clearly spells out that the first argument is an Int and the return type is a String.

And if we in Scala would have 2 or more arguments you have probably already guessed it

def myFunc2( num:Int, str:String ):String = {
  str + num
}

def doCallFunc2( num:Int, fn:(Int,String)=>String ):Unit = {
  println("Result :"+fn(num,"Value = "))
}

doCallFunc2( 123, myFunc2 )

For functions as arguments/parameters example above, then Scala wins all week !

Over and out !

Apache Cassandra Secondary Indices

August 16, 2020Cassandratobias

How are Secondary Indices really stored ?

This is based on the article from Datastax found here; https://www.datastax.com/blog/2016/04/cassandra-native-secondary-index-deep-dive

Let’s just create a simple table

CREATE TABLE customer (
    id int PRIMARY KEY,
    city text,
    name text
)

CREATE TABLE customer (

id int PRIMARY KEY,

city text,

name text

)

Or visualized as a table :

Column	Type	Key
id	int	Primary Key
city	text
name	text

If we then create an index like this

CREATE INDEX customer_city_idx ON customer (city);

1	CREATE INDEX customer_city_idx ON customer (city);

Then this will result in just “normal” table, just hidden , and here the column we created the index for becomes the Partition Key, and the original table Partition Key becomes the clustering key

Column	Type	Key
city	text	Primary Key
id	int	Clustering Key

With some data it would be like this for the “customer” table.

Id	Name	City
1	Italia Pizzeria	Kalmar
2	Thai Silk	Kalmar
3	Royal Thai	Stockholm
4	Indian Corner	Malmö

And the index which then is a “table” would thus be like this

City	Id
Kalmar	1
Kalmar	2
Stockholm	3
Malmö	4

When a cluster is used, the index then the data of the source table is distributed over the nodes, using the murmor3 algorithm. Now the index table is also distributed, BUT together on the same node with the data of the source table.

Print stacktraces for all threads on shutdown

April 22, 2020debugging, JAVA, JVM, Scalatobias

If your microservice stops responding from time to time, and they only way out is to kill it with SIGINT or SIGTERM then adding a shutdown hook might be the way to go. Do note that this will not work if you kill the process with SIGKILL (-9), cause that will result in an unclean shutdown.

Some of this code is heavily influenced by Print all of the thread’s information and stack traces : Exception « Development « Java Tutorial. But has been translated into Scala, and cleaned up a little.

import java.io.PrintWriter
import java.lang.management.ManagementFactory

object TryOutShutdownHook {


  def doStuff() = {
    (1 to 1000).foreach( _ => {
      print(".")
      Thread.sleep(1000)
    })
  }


  def main(args: Array[String]): Unit = {
    sys.addShutdownHook({
      println("Shutdown hook!")
      ThreadUtils.printThreadInfo(new PrintWriter(System.out),"Threads during shutdown")
    })

    doStuff()

  }
}



object ThreadUtils {

  private val threadBean = ManagementFactory.getThreadMXBean

  def setContentionTracing(on: Boolean): Unit = {
    threadBean.setThreadContentionMonitoringEnabled( on )
  }

  private def getTaskName(id: Long, name: String): String = {
    if (name == null) id.toString
    else id + " (" + name + ")"
  }

  /**
   * Print all of the thread's information and stack traces.
   *
   * @param stream
   * the stream to
   * @param title
   * a string title for the stack trace
   */
  def printThreadInfo(stream: PrintWriter, title: String): Unit = {
    val STACK_DEPTH = 20
    val contention = threadBean.isThreadContentionMonitoringEnabled
    val threadIds = threadBean.getAllThreadIds
    stream.println("Process Thread Dump: " + title)
    stream.println(threadIds.length + " active threads")
    for (tid <- threadIds) {
      val info = threadBean.getThreadInfo(tid, STACK_DEPTH)
      if (info == null) {
        stream.println("  Inactive")
      } else {
        stream.println("Thread " + getTaskName(info.getThreadId, info.getThreadName) + ":")
        val state = info.getThreadState
        stream.println("  State: " + state)
        stream.println("  Blocked count: " + info.getBlockedCount)
        stream.println("  Waited count: " + info.getWaitedCount)
        if (contention) {
          stream.println("  Blocked time: " + info.getBlockedTime)
          stream.println("  Waited time: " + info.getWaitedTime)
        }
        if (state eq Thread.State.WAITING) stream.println("  Waiting on " + info.getLockName)
        else if (state eq Thread.State.BLOCKED) {
          stream.println("  Blocked on " + info.getLockName)
          stream.println("  Blocked by " + getTaskName(info.getLockOwnerId, info.getLockOwnerName))
        }
        stream.println("  Stack:")
        for (frame <- info.getStackTrace) {
          stream.println("    " + frame.toString)
        }
      }
    }
    stream.flush()
  }
}

import java.io.PrintWriter

import java.lang.management.ManagementFactory

object TryOutShutdownHook {

def doStuff() = {

(1 to 1000).foreach( _ => {

print(".")

Thread.sleep(1000)

})

}

def main(args: Array[String]): Unit = {

sys.addShutdownHook({

println("Shutdown hook!")

ThreadUtils.printThreadInfo(new PrintWriter(System.out),"Threads during shutdown")

})

doStuff()

}

object ThreadUtils {

private val threadBean = ManagementFactory.getThreadMXBean

def setContentionTracing(on: Boolean): Unit = {

threadBean.setThreadContentionMonitoringEnabled( on )

}

private def getTaskName(id: Long, name: String): String = {

if (name == null) id.toString

else id + " (" + name + ")"

}

/**

* Print all of the thread's information and stack traces.

* @param stream

* the stream to

* @param title

* a string title for the stack trace

def printThreadInfo(stream: PrintWriter, title: String): Unit = {

val STACK_DEPTH = 20

val contention = threadBean.isThreadContentionMonitoringEnabled

val threadIds = threadBean.getAllThreadIds

stream.println("Process Thread Dump: " + title)

stream.println(threadIds.length + " active threads")

for (tid <- threadIds) {

val info = threadBean.getThreadInfo(tid, STACK_DEPTH)

if (info == null) {

stream.println(" Inactive")

} else {

stream.println("Thread " + getTaskName(info.getThreadId, info.getThreadName) + ":")

val state = info.getThreadState

stream.println(" State: " + state)

stream.println(" Blocked count: " + info.getBlockedCount)

stream.println(" Waited count: " + info.getWaitedCount)

if (contention) {

stream.println(" Blocked time: " + info.getBlockedTime)

stream.println(" Waited time: " + info.getWaitedTime)

}

if (state eq Thread.State.WAITING) stream.println(" Waiting on " + info.getLockName)

else if (state eq Thread.State.BLOCKED) {

stream.println(" Blocked on " + info.getLockName)

stream.println(" Blocked by " + getTaskName(info.getLockOwnerId, info.getLockOwnerName))

}

stream.println(" Stack:")

for (frame <- info.getStackTrace) {

stream.println(" " + frame.toString)

}

stream.flush()

}

The output would look something like this

...........................Shutdown hook!
Process Thread Dump: Threads during shutdown
8 active threads
Thread 11 (shutdownHook1):
  State: RUNNABLE
  Blocked count: 0
  Waited count: 0
  Stack:
    sun.management.ThreadImpl.getThreadInfo1(Native Method)
    sun.management.ThreadImpl.getThreadInfo(ThreadImpl.java:178)
    sun.management.ThreadImpl.getThreadInfo(ThreadImpl.java:139)
    ThreadUtils$.$anonfun$printThreadInfo$1(TryOutShutdownHook.scala:56)
    ThreadUtils$$$Lambda$7/1354382162.apply$mcVJ$sp(Unknown Source)
    scala.runtime.java8.JFunction1$mcVJ$sp.apply(JFunction1$mcVJ$sp.java:23)
    scala.collection.IndexedSeqOptimized.foreach(IndexedSeqOptimized.scala:36)
    scala.collection.IndexedSeqOptimized.foreach$(IndexedSeqOptimized.scala:33)
    scala.collection.mutable.ArrayOps$ofLong.foreach(ArrayOps.scala:258)
    ThreadUtils$.printThreadInfo(TryOutShutdownHook.scala:55)
    TryOutShutdownHook$.$anonfun$main$1(TryOutShutdownHook.scala:18)
    TryOutShutdownHook$$$Lambda$1/728890494.apply$mcV$sp(Unknown Source)
    scala.sys.ShutdownHookThread$$anon$1.run(ShutdownHookThread.scala:37)
Thread 13 (SIGTERM handler):
  State: WAITING
  Blocked count: 0
  Waited count: 1
  Waiting on scala.sys.ShutdownHookThread$$anon$1@506c589e
  Stack:
    java.lang.Object.wait(Native Method)
    java.lang.Thread.join(Thread.java:1252)
    java.lang.Thread.join(Thread.java:1326)
    java.lang.ApplicationShutdownHooks.runHooks(ApplicationShutdownHooks.java:107)
    java.lang.ApplicationShutdownHooks$1.run(ApplicationShutdownHooks.java:46)
    java.lang.Shutdown.runHooks(Shutdown.java:123)
    java.lang.Shutdown.sequence(Shutdown.java:167)
    java.lang.Shutdown.exit(Shutdown.java:212)
    java.lang.Terminator$1.handle(Terminator.java:52)
    sun.misc.Signal$1.run(Signal.java:212)
    java.lang.Thread.run(Thread.java:748)
Thread 12 (Attach Listener):
  State: RUNNABLE
  Blocked count: 0
  Waited count: 0
  Stack:
Thread 5 (Monitor Ctrl-Break):
  State: RUNNABLE
  Blocked count: 0
  Waited count: 0
  Stack:
    java.net.SocketInputStream.socketRead0(Native Method)
    java.net.SocketInputStream.socketRead(SocketInputStream.java:116)
    java.net.SocketInputStream.read(SocketInputStream.java:171)
    java.net.SocketInputStream.read(SocketInputStream.java:141)
    sun.nio.cs.StreamDecoder.readBytes(StreamDecoder.java:284)
    sun.nio.cs.StreamDecoder.implRead(StreamDecoder.java:326)
    sun.nio.cs.StreamDecoder.read(StreamDecoder.java:178)
    java.io.InputStreamReader.read(InputStreamReader.java:184)
    java.io.BufferedReader.fill(BufferedReader.java:161)
    java.io.BufferedReader.readLine(BufferedReader.java:324)
    java.io.BufferedReader.readLine(BufferedReader.java:389)
    com.intellij.rt.execution.application.AppMainV2$1.run(AppMainV2.java:64)
Thread 4 (Signal Dispatcher):
  State: RUNNABLE
  Blocked count: 0
  Waited count: 0
  Stack:
Thread 3 (Finalizer):
  State: WAITING
  Blocked count: 1
  Waited count: 2
  Waiting on java.lang.ref.ReferenceQueue$Lock@79c01878
  Stack:
    java.lang.Object.wait(Native Method)
    java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:144)
    java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:165)
    java.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:216)
Thread 2 (Reference Handler):
  State: WAITING
  Blocked count: 1
  Waited count: 1
  Waiting on java.lang.ref.Reference$Lock@2b1227e6
  Stack:
    java.lang.Object.wait(Native Method)
    java.lang.Object.wait(Object.java:502)
    java.lang.ref.Reference.tryHandlePending(Reference.java:191)
    java.lang.ref.Reference$ReferenceHandler.run(Reference.java:153)
Thread 1 (main):
  State: TIMED_WAITING
  Blocked count: 0
  Waited count: 27
  Stack:
    java.lang.Thread.sleep(Native Method)
    TryOutShutdownHook$.$anonfun$doStuff$1(TryOutShutdownHook.scala:10)
    TryOutShutdownHook$$$Lambda$6/2047526627.apply$mcVI$sp(Unknown Source)
    scala.collection.immutable.Range.foreach$mVc$sp(Range.scala:158)
    TryOutShutdownHook$.doStuff(TryOutShutdownHook.scala:8)
    TryOutShutdownHook$.main(TryOutShutdownHook.scala:21)
    TryOutShutdownHook.main(TryOutShutdownHook.scala)

Process finished with exit code 143 (interrupted by signal 15: SIGTERM)

...........................Shutdown hook!

Process Thread Dump: Threads during shutdown

8 active threads

Thread 11 (shutdownHook1):

State: RUNNABLE

Blocked count: 0

Waited count: 0

Stack:

sun.management.ThreadImpl.getThreadInfo1(Native Method)

sun.management.ThreadImpl.getThreadInfo(ThreadImpl.java:178)

sun.management.ThreadImpl.getThreadInfo(ThreadImpl.java:139)

ThreadUtils$.$anonfun$printThreadInfo$1(TryOutShutdownHook.scala:56)

ThreadUtils$$$Lambda$7/1354382162.apply$mcVJ$sp(Unknown Source)

scala.runtime.java8.JFunction1$mcVJ$sp.apply(JFunction1$mcVJ$sp.java:23)

scala.collection.IndexedSeqOptimized.foreach(IndexedSeqOptimized.scala:36)

scala.collection.IndexedSeqOptimized.foreach$(IndexedSeqOptimized.scala:33)

scala.collection.mutable.ArrayOps$ofLong.foreach(ArrayOps.scala:258)

ThreadUtils$.printThreadInfo(TryOutShutdownHook.scala:55)

TryOutShutdownHook$.$anonfun$main$1(TryOutShutdownHook.scala:18)

TryOutShutdownHook$$$Lambda$1/728890494.apply$mcV$sp(Unknown Source)

scala.sys.ShutdownHookThread$$anon$1.run(ShutdownHookThread.scala:37)

Thread 13 (SIGTERM handler):

State: WAITING

Blocked count: 0

Waited count: 1

Waiting on scala.sys.ShutdownHookThread$$anon$1@506c589e

Stack:

java.lang.Object.wait(Native Method)

java.lang.Thread.join(Thread.java:1252)

java.lang.Thread.join(Thread.java:1326)

java.lang.ApplicationShutdownHooks.runHooks(ApplicationShutdownHooks.java:107)

java.lang.ApplicationShutdownHooks$1.run(ApplicationShutdownHooks.java:46)

java.lang.Shutdown.runHooks(Shutdown.java:123)

java.lang.Shutdown.sequence(Shutdown.java:167)

java.lang.Shutdown.exit(Shutdown.java:212)

java.lang.Terminator$1.handle(Terminator.java:52)

sun.misc.Signal$1.run(Signal.java:212)

java.lang.Thread.run(Thread.java:748)

Thread 12 (Attach Listener):

State: RUNNABLE

Blocked count: 0

Waited count: 0

Stack:

Thread 5 (Monitor Ctrl-Break):

State: RUNNABLE

Blocked count: 0

Waited count: 0

Stack:

java.net.SocketInputStream.socketRead0(Native Method)

java.net.SocketInputStream.socketRead(SocketInputStream.java:116)

java.net.SocketInputStream.read(SocketInputStream.java:171)

java.net.SocketInputStream.read(SocketInputStream.java:141)

sun.nio.cs.StreamDecoder.readBytes(StreamDecoder.java:284)

sun.nio.cs.StreamDecoder.implRead(StreamDecoder.java:326)

sun.nio.cs.StreamDecoder.read(StreamDecoder.java:178)

java.io.InputStreamReader.read(InputStreamReader.java:184)

java.io.BufferedReader.fill(BufferedReader.java:161)

java.io.BufferedReader.readLine(BufferedReader.java:324)

java.io.BufferedReader.readLine(BufferedReader.java:389)

com.intellij.rt.execution.application.AppMainV2$1.run(AppMainV2.java:64)

Thread 4 (Signal Dispatcher):

State: RUNNABLE

Blocked count: 0

Waited count: 0

Stack:

Thread 3 (Finalizer):

State: WAITING

Blocked count: 1

Waited count: 2

Waiting on java.lang.ref.ReferenceQueue$Lock@79c01878

Stack:

java.lang.Object.wait(Native Method)

java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:144)

java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:165)

java.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:216)

Thread 2 (Reference Handler):

State: WAITING

Blocked count: 1

Waited count: 1

Waiting on java.lang.ref.Reference$Lock@2b1227e6

Stack:

java.lang.Object.wait(Native Method)

java.lang.Object.wait(Object.java:502)

java.lang.ref.Reference.tryHandlePending(Reference.java:191)

java.lang.ref.Reference$ReferenceHandler.run(Reference.java:153)

Thread 1 (main):

State: TIMED_WAITING

Blocked count: 0

Waited count: 27

Stack:

java.lang.Thread.sleep(Native Method)

TryOutShutdownHook$.$anonfun$doStuff$1(TryOutShutdownHook.scala:10)

TryOutShutdownHook$$$Lambda$6/2047526627.apply$mcVI$sp(Unknown Source)

scala.collection.immutable.Range.foreach$mVc$sp(Range.scala:158)

TryOutShutdownHook$.doStuff(TryOutShutdownHook.scala:8)

TryOutShutdownHook$.main(TryOutShutdownHook.scala:21)

TryOutShutdownHook.main(TryOutShutdownHook.scala)

Process finished with exit code 143 (interrupted by signal 15: SIGTERM)

Apache Zeppelin, with Spark and Cassandra, the perfect tool

April 5, 2020Uncategorizedtobias

Zeppelin has become one of my favourite tools in my toolbox. I am heavily designing stuff for Cassandra and in Scala, and even though I love Cassandra there are times when things just gets so complicated with the CQL command line, and creating a small project in IntelliJ just seems like too much hazel. Then using Zeppelin to try out is just perfect. So this page is a How-To with some useful Cookbook recipes.

Setting Up Zeppelin

I use Docker where things are so much easier, and I pick v0.8.0 cause I never got 0.8.2 to work for some reason.

Download and Start Cassandra

docker pull cassandra

1	docker pull cassandra

docker run --name Cassandra3 -p 9042:9042 cassandra:3.11

1	docker run --name Cassandra3 -p 9042:9042 cassandra:3.11

Download and Start Zeppelin

Download Zeppelin image

docker pull apache/zeppelin:0.8.0

1	docker pull apache/zeppelin:0.8.0

Start Zeppelin on port 8080

docker run -p 8080:8080 --name zeppelin apache/zeppelin:0.8.0

1	docker run -p 8080:8080 --name zeppelin apache/zeppelin:0.8.0

-p hp:cp
hp = Host Port, the port on your local machine
cp = Container Port, the port inside the docker which is what Zeppelin is exposing

Go to localhost:8080 in your web browser and you should see something like this

Setup Zeppelin

Find out the IP address of Cassandra in you Docker network, as you can see of the inspect, the IP address is 172.17.0.3.

QSWEM078:~ teriksson$ docker network inspect bridge
[
    {
        "Name": "bridge",
        "Id": "355be8072aafa87bafa8de19d00af597746039000d27e9245e2464fa54bf81a8",
        "Created": "2020-04-03T14:23:57.446760383Z",
        "Scope": "local",
        "Driver": "bridge",
        "EnableIPv6": false,
        "IPAM": {
            "Driver": "default",
            "Options": null,
            "Config": [
                {
                    "Subnet": "172.17.0.0/16",
                    "Gateway": "172.17.0.1"
                }
            ]
        },
        "Internal": false,
        "Attachable": false,
        "Ingress": false,
        "ConfigFrom": {
            "Network": ""
        },
        "ConfigOnly": false,
        "Containers": {
            "ceda1cebea87ee7244f00d5e88292ff76fc46142627ed4064e0b98cd92f728a3": {
                "Name": "zeppelin",
                "EndpointID": "2cc39278d16db811bc593945adcc4a7ae2d0e5409a98c1ddf0d548bcf0b7052a",
                "MacAddress": "02:42:ac:11:00:02",
                "IPv4Address": "172.17.0.2/16",
                "IPv6Address": ""
            },
            "f772b8c66fe3729bd00e2bd9d2e50472ec40b1e8047796f8f69db6ecee6a77ae": {
                "Name": "<strong>cassandra3</strong>",
                "EndpointID": "23fde4a184ca9456ddec164616c4603f6ee8f3c310e21cb7c4409d350d7c3fd6",
                "MacAddress": "02:42:ac:11:00:03",
                "IPv4Address": "<strong>172.17.0.3</strong>/16",
                "IPv6Address": ""
            }
        },
        "Options": {
            "com.docker.network.bridge.default_bridge": "true",
            "com.docker.network.bridge.enable_icc": "true",
            "com.docker.network.bridge.enable_ip_masquerade": "true",
            "com.docker.network.bridge.host_binding_ipv4": "0.0.0.0",
            "com.docker.network.bridge.name": "docker0",
            "com.docker.network.driver.mtu": "1500"
        },
        "Labels": {}
    }
]

QSWEM078:~ teriksson$ docker network inspect bridge

[

{

"Name": "bridge",

"Id": "355be8072aafa87bafa8de19d00af597746039000d27e9245e2464fa54bf81a8",

"Created": "2020-04-03T14:23:57.446760383Z",

"Scope": "local",

"Driver": "bridge",

"EnableIPv6": false,

"IPAM": {

"Driver": "default",

"Options": null,

"Config": [

{

"Subnet": "172.17.0.0/16",

"Gateway": "172.17.0.1"

}

]

"Internal": false,

"Attachable": false,

"Ingress": false,

"ConfigFrom": {

"Network": ""

"ConfigOnly": false,

"Containers": {

"ceda1cebea87ee7244f00d5e88292ff76fc46142627ed4064e0b98cd92f728a3": {

"Name": "zeppelin",

"EndpointID": "2cc39278d16db811bc593945adcc4a7ae2d0e5409a98c1ddf0d548bcf0b7052a",

"MacAddress": "02:42:ac:11:00:02",

"IPv4Address": "172.17.0.2/16",

"IPv6Address": ""

"f772b8c66fe3729bd00e2bd9d2e50472ec40b1e8047796f8f69db6ecee6a77ae": {

"Name": "<strong>cassandra3</strong>",

"EndpointID": "23fde4a184ca9456ddec164616c4603f6ee8f3c310e21cb7c4409d350d7c3fd6",

"MacAddress": "02:42:ac:11:00:03",

"IPv4Address": "<strong>172.17.0.3</strong>/16",

"IPv6Address": ""

}

"Options": {

"com.docker.network.bridge.default_bridge": "true",

"com.docker.network.bridge.enable_icc": "true",

"com.docker.network.bridge.enable_ip_masquerade": "true",

"com.docker.network.bridge.host_binding_ipv4": "0.0.0.0",

"com.docker.network.bridge.name": "docker0",

"com.docker.network.driver.mtu": "1500"

"Labels": {}

}

]

Set up IP address for Cassandra in the Spark Interpreter

Go to the section on “Spark”

Now add a row that says

spark.cassandra.connection.host : <span class="ng-scope ng-binding editable">172.17.0.3 </span>

1	spark.cassandra.connection.host : <span class="ng-scope ng-binding editable">172.17.0.3 </span>

Now also edit the Dependencies

You can do this in many ways, either you specify the MAVEN repo with version OR you download the JAR file(s) to disk and copy them into the Docker. I had to do the latter due to some issue with my network.

You need these two libraries :

Simply click on the JAR file and download the file, then copy it into the docker with

docker cp spark-cassandra-connector_2.11-2.0.12.jar zeppelin:/zeppelin/interpreter/spark/dep/spark-cassandra-connector_2.11-2.0.12.jar

1	docker cp spark-cassandra-connector_2.11-2.0.12.jar zeppelin:/zeppelin/interpreter/spark/dep/spark-cassandra-connector_2.11-2.0.12.jar

docker cp jsr166e-1.1.0.jar zeppelin:/zeppelin/interpreter/spark/dep/jsr166e-1.1.0.jar

1	docker cp jsr166e-1.1.0.jar zeppelin:/zeppelin/interpreter/spark/dep/jsr166e-1.1.0.jar

Setup IP address for Cassandra in Cassandra Interpreter

cassandra.hosts : 172.17.0.3

1	cassandra.hosts : 172.17.0.3

Create your first Notebook

Cookbook Recipes

Load Table into RDD and count rows

This is just to show how you load a table into an RDD, once in the RDD you can play around with it and do lots of stuff.

%spark
import com.datastax.spark.connector._
import org.apache.spark.{SparkConf, SparkContext}
import org.apache.spark.SparkContext._
val rdd = sc.cassandraTable("system_schema","keyspaces")
println("Row count:" + rdd.count)

%spark

import com.datastax.spark.connector._

import org.apache.spark.{SparkConf, SparkContext}

import org.apache.spark.SparkContext._

val rdd = sc.cassandraTable("system_schema","keyspaces")

println("Row count:" + rdd.count)

Show key spaces using the built in Cassandra interpreter using CQL

%cassandra
USE "system_schema";
SELECT * FROM keyspaces;

%cassandra

USE "system_schema";

SELECT * FROM keyspaces;

The result :

Create Keyspace and Table using CQL

%cassandra
CREATE KEYSPACE cim WITH replication = {'class':'SimpleStrategy', 'replication_factor' : 1};

%cassandra

CREATE KEYSPACE cim WITH replication = {'class':'SimpleStrategy', 'replication_factor' : 1};

%cassandra
 CREATE TABLE cim.customer(
   id int PRIMARY KEY,
   name text,
   city text,
   );

%cassandra

CREATE TABLE cim.customer(

id int PRIMARY KEY,

name text,

city text,

);

Insert data by hand using CQL

%cassandra
INSERT INTO cim.customer_fast (id, ck1, name, city ) VALUES ( 1,2, 'US Robotics', 'New York' );

%cassandra

INSERT INTO cim.customer_fast (id, ck1, name, city ) VALUES ( 1,2, 'US Robotics', 'New York' );

Fill the table with bogus data using Spark and Scala

%spark
import scala.util.Random
val random = new Random
val cities = List[String]( "Stockholm", "Malmoe", "Kalmar", "Jonkoping", "Linkoping", "Karlskrona", "Ronneby" )
val companyNames = List[String]( "Ikea", "SJ", "Ericsson", "Thai Silk", "Italia", "Apple", "ASEA", "Pressbyran")
val data = (1 to 3000 ).map( id =&gt; (id,companyNames(random.nextInt(companyNames.length))+"-"+id,cities(random.nextInt(cities.length))) )
val rdd = sc.parallelize( data )
rdd.saveToCassandra( "cim", "customer", SomeColumns("id","name","city") )

%spark

import scala.util.Random

val random = new Random

val cities = List[String]( "Stockholm", "Malmoe", "Kalmar", "Jonkoping", "Linkoping", "Karlskrona", "Ronneby" )

val companyNames = List[String]( "Ikea", "SJ", "Ericsson", "Thai Silk", "Italia", "Apple", "ASEA", "Pressbyran")

val data = (1 to 3000 ).map( id => (id,companyNames(random.nextInt(companyNames.length))+"-"+id,cities(random.nextInt(cities.length))) )

val rdd = sc.parallelize( data )

rdd.saveToCassandra( "cim", "customer", SomeColumns("id","name","city") )

Select data using CQL

%cassandra
SELECT * FROM cim.customer_fast where id = 100;

%cassandra

SELECT * FROM cim.customer_fast where id = 100;

Create VIEW so that we can run SQL

%spark
import org.apache.spark.sql.cassandra._
import org.apache.spark.sql
val createTempView = """CREATE TEMPORARY VIEW customers
 USING org.apache.spark.sql.cassandra
 OPTIONS (
 table "customer",
 keyspace "cim",
 pushdown "true")"""
spark.sql(createTempView)

%spark

import org.apache.spark.sql.cassandra._

import org.apache.spark.sql

val createTempView = """CREATE TEMPORARY VIEW customers

USING org.apache.spark.sql.cassandra

OPTIONS (

table "customer",

keyspace "cim",

pushdown "true")"""

spark.sql(createTempView)

Run SQL, ohh sweet SQL 🙂

%spark
spark.sql("SELECT * FROM customers WHERE city like 'K%' limit 10").show

%spark

spark.sql("SELECT * FROM customers WHERE city like 'K%' limit 10").show

By creating temporary views like this, we can also do joins if we would like to.

Obviously this is not how Cassandra was intended to be used, but the point here is more of giving the ability to troubleshoot, turist around in the data with ease instead of setting up a project, and do the joins inside of the code. Here we are able to really trail and error until we get what we want.

That was all for now

-Tobias

Remove the cardo-updater agent from OSX

June 5, 2018Mac, OSX, tooltobias

I have the intercom from Cardo Systems, and it is really good
BUT when I updated the firmware some time agoe, it decided to install some software that takes port 8080, which is one of those really common ports used by a lot of applications out there. So it really becomes a problem…

Now I figured this out after using lsof

Tobiass-MacBook-Pro-2:~ tobias$ sudo lsof -i tcp:8080
Password:
COMMAND   PID USER   FD   TYPE             DEVICE SIZE/OFF NODE NAME
cardo-upd 107 root    6u  IPv4 0xa6a0fdd48cbb4495      0t0  TCP localhost:http-alt (LISTEN)

Tobiass-MacBook-Pro-2:~ tobias$ sudo lsof -i tcp:8080

Password:

COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAME

cardo-upd 107 root 6u IPv4 0xa6a0fdd48cbb4495 0t0 TCP localhost:http-alt (LISTEN)

Then I got the PID, so now I could do a ps -ef, to figure out WHICH parent process started it.

Tobiass-MacBook-Pro-2:~ tobias$ ps -ef | grep 107
    0   107     1   0 Sat09PM ??        16:56.45 /usr/local/cardo-updater

1 2	Tobiass-MacBook-Pro-2:~ tobias$ ps -ef \| grep 107 0 107 1 0 Sat09PM ?? 16:56.45 /usr/local/cardo-updater

Ohh PPID = 1 🙂 That is the launchd-process

Tobiass-MacBook-Pro-2:~ tobias$ ps -ef | head
  UID   PID  PPID   C STIME   TTY           TIME CMD
    0     1     0   0 Sat09PM ??         5:09.23 /sbin/launchd

Tobiass-MacBook-Pro-2:~ tobias$ ps -ef | head

UID PID PPID C STIME TTY TIME CMD

0 1 0 0 Sat09PM ?? 5:09.23 /sbin/launchd

OK so now we know it is the launchd process.
So first just find it in launchd

Tobiass-MacBook-Pro-2:~ tobias$ sudo launchctl list | grep cardo
20177 0 cardo-updater

1 2	Tobiass-MacBook-Pro-2:~ tobias$ sudo launchctl list \| grep cardo 20177 0 cardo-updater

Now in order to unload it we need to find the path to the plist file

Tobiass-MacBook-Pro-2:~ tobias$ sudo launchctl procinfo 20724
program path = /usr/local/cardo-updater
mach info = {
	task-kernel port = 0xb03 (unknown)
	task-host port = 0xc03 (host)
	task-name port = 0xf03 (unknown)
	task-bootstrap port = 0xf07 (unknown)
	task-seatbelt port = 0x0 (unknown)
	task-access port = 0xf0b (unknown)
	task-debug control port = 0xf0f (unknown)
}
argument count = 1
argument vector = {
	[0] = /usr/local/cardo-updater
}
environment vector = {
	PATH => /usr/bin:/bin:/usr/sbin:/sbin
	XPC_SERVICE_NAME => cardo-updater
	XPC_FLAGS => 0x1
}
bsd proc info = {
	pid = 20724
	unique pid = 20724
	ppid = 1
	pgid = 20724
	status = stopped
	flags = session leader
	uid = 0
	svuid = 0
	ruid = 0
	gid = 0
	svgid = 0
	ruid = 0
	comm name = cardo-updater
	long name = cardo-updater
	controlling tty devnode = 0xffffffff
	controlling tty pgid = 0
}
audit info
	session id = 100000
	uid = 4294967295
	success mask = 0x0
	failure mask = 0x0
	flags = is_initial
sandboxed = no
container = (no container)

responsible pid = 20724
responsible unique pid = 20724
responsible path = /usr/local/cardo-updater

pressured exit info = {
	dirty state tracked = 0
	dirty = 0
	pressured-exit capable = 0
}

jetsam priority = 3: background
jetsam memory limit = -1
jetsam state = (normal memory state)

entitlements = (no entitlements)

code signing info = valid
	platform dyld

cardo-updater = {
	active count = 1
	path = /Library/LaunchDaemons/com.cardosystems.cardo-updater.plist
	state = running

	program = /usr/local/cardo-updater
	arguments = {
		/usr/local/cardo-updater
	}

	default environment = {
		PATH => /usr/bin:/bin:/usr/sbin:/sbin
	}

	environment = {
		XPC_SERVICE_NAME => cardo-updater
	}

	domain = com.apple.xpc.launchd.domain.system
	minimum runtime = 10
	exit timeout = 5
	runs = 4
	successive crashes = 0
	excessive crashing = 0
	pid = 20724
	immediate reason = inefficient
	forks = 0
	execs = 1
	trampolined = 1
	started suspended = 0
	proxy started suspended = 0
	last exit code = 0

	event triggers = {
	}

	endpoints = {
	}

	dynamic endpoints = {
	}

	pid-local endpoints = {
	}

	instance-specific endpoints = {
	}

	event channels = {
	}

	sockets = {
	}

	spawn type = daemon
	spawn role = (null)
	jetsam priority = 3
	jetsam memory limit (active) = (unlimited)
	jetsam memory limit (inactive) = (unlimited)
	jetsamproperties category = daemon
	cpumon = default

	properties = {
		partial import = 1
		launchd bundle = 0
		xpc bundle = 0
		keepalive = 1
		runatload = 0
		dirty at shutdown = 0
		low priority i/o = 0
		low priority background i/o = 0
		legacy timer behavior = 0
		exception handler = 0
		multiple instances = 0
		supports transactions = 0
		supports pressured exit = 0
		enter kdp before kill = 0
		wait for debugger = 0
		app = 0
		system app = 0
		creates session = 0
		inetd-compatible = 0
		inetd listener = 0
		abandon process group = 0
		one-shot = 0
		requires reap = 0
		event monitor = 0
		penalty box = 0
		pended non-demand spawn = 0
		role account = 0
		launch only once = 0
		system support = 0
		app-like = 0
		inferred program = 1
		joins gui session = 0
		joins host session = 0
		parameterized sandbox = 0
		resolve program = 0
		abandon coalition = 0
		extension = 0
		nano allocator = 0
		no initgroups = 0
		start on fs mount = 0
		endpoints initialized = 1
		disallow all lookups = 0
		system service = 0
	}
}

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

Tobiass-MacBook-Pro-2:~ tobias$ sudo launchctl procinfo 20724

program path = /usr/local/cardo-updater

mach info = {

task-kernel port = 0xb03 (unknown)

task-host port = 0xc03 (host)

task-name port = 0xf03 (unknown)

task-bootstrap port = 0xf07 (unknown)

task-seatbelt port = 0x0 (unknown)

task-access port = 0xf0b (unknown)

task-debug control port = 0xf0f (unknown)

}

argument count = 1

argument vector = {

[0] = /usr/local/cardo-updater

}

environment vector = {

PATH => /usr/bin:/bin:/usr/sbin:/sbin

XPC_SERVICE_NAME => cardo-updater

XPC_FLAGS => 0x1

}

bsd proc info = {

pid = 20724

unique pid = 20724

ppid = 1

pgid = 20724

status = stopped

flags = session leader

uid = 0

svuid = 0

ruid = 0

gid = 0

svgid = 0

ruid = 0

comm name = cardo-updater

long name = cardo-updater

controlling tty devnode = 0xffffffff

controlling tty pgid = 0

}

audit info

session id = 100000

uid = 4294967295

success mask = 0x0

failure mask = 0x0

flags = is_initial

sandboxed = no

container = (no container)

responsible pid = 20724

responsible unique pid = 20724

responsible path = /usr/local/cardo-updater

pressured exit info = {

dirty state tracked = 0

dirty = 0

pressured-exit capable = 0

}

jetsam priority = 3: background

jetsam memory limit = -1

jetsam state = (normal memory state)

entitlements = (no entitlements)

code signing info = valid

platform dyld

cardo-updater = {

active count = 1

path = /Library/LaunchDaemons/com.cardosystems.cardo-updater.plist

state = running

program = /usr/local/cardo-updater

arguments = {

/usr/local/cardo-updater

}

default environment = {

PATH => /usr/bin:/bin:/usr/sbin:/sbin

}

environment = {

XPC_SERVICE_NAME => cardo-updater

}

domain = com.apple.xpc.launchd.domain.system

minimum runtime = 10

exit timeout = 5

runs = 4

successive crashes = 0

excessive crashing = 0

pid = 20724

immediate reason = inefficient

forks = 0

execs = 1

trampolined = 1

started suspended = 0

proxy started suspended = 0

last exit code = 0

event triggers = {

}

endpoints = {

}

dynamic endpoints = {

}

pid-local endpoints = {

}

instance-specific endpoints = {

}

event channels = {

}

sockets = {

}

spawn type = daemon

spawn role = (null)

jetsam priority = 3

jetsam memory limit (active) = (unlimited)

jetsam memory limit (inactive) = (unlimited)

jetsamproperties category = daemon

cpumon = default

properties = {

partial import = 1

launchd bundle = 0

xpc bundle = 0

keepalive = 1

runatload = 0

dirty at shutdown = 0

low priority i/o = 0

low priority background i/o = 0

legacy timer behavior = 0

exception handler = 0

multiple instances = 0

supports transactions = 0

supports pressured exit = 0

enter kdp before kill = 0

wait for debugger = 0

app = 0

system app = 0

creates session = 0

inetd-compatible = 0

inetd listener = 0

abandon process group = 0

one-shot = 0

requires reap = 0

event monitor = 0

penalty box = 0

pended non-demand spawn = 0

role account = 0

launch only once = 0

system support = 0

app-like = 0

inferred program = 1

joins gui session = 0

joins host session = 0

parameterized sandbox = 0

resolve program = 0

abandon coalition = 0

extension = 0

nano allocator = 0

no initgroups = 0

start on fs mount = 0

endpoints initialized = 1

disallow all lookups = 0

system service = 0

}

As you can see above the plist file is :

path = /Library/LaunchDaemons/com.cardosystems.cardo-updater.plist

Allright, so now we can unload it

Tobiass-MacBook-Pro-2:~ tobias$ sudo launchctl unload /Library/LaunchDaemons/com.cardosystems.cardo-updater.plist

1	Tobiass-MacBook-Pro-2:~ tobias$ sudo launchctl unload /Library/LaunchDaemons/com.cardosystems.cardo-updater.plist

And that is it !

-Tobias

SQL LIKE operation in Cassandra, is possible in v3.4+

August 12, 2016Uncategorizedtobias

For a long time it has not been possible to do a SELECT * FROM table WHERE firstname like ‘t%’; in Cassandra like you could in eg.. MySQL or any other Relation Database for that matter.

In Cassandra v3.4 this is now possible, BUT it requires some extra to do it right, and that is why I created this blog post cause I had trouble finding it.

The solution is to create a separate index, and not the secondary indexes that Cassandra came with, but a different index, called a SASI index.

This is what I have

CREATE TABLE bth.employee (
    id int,
    lastname text,
    firstname text,
    dateofbirth date,
    PRIMARY KEY (id, lastname, firstname)
) WITH CLUSTERING ORDER BY (lastname ASC, firstname ASC)
    AND bloom_filter_fp_chance = 0.01
    AND caching = {'keys': 'ALL', 'rows_per_partition': 'NONE'}
    AND comment = ''
    AND compaction = {'bucket_high': '1.5', 'bucket_low': '0.5', 'class': 'org.apache.cassandra.db.compaction.SizeTieredCompactionStrategy', 'enabled': 'true', 'max_threshold': '32', 'min_sstable_size': '50', 'min_threshold': '4', 'tombstone_compaction_interval': '86400', 'tombstone_threshold': '0.2', 'unchecked_tombstone_compaction': 'false'}
    AND compression = {'chunk_length_in_kb': '64', 'class': 'org.apache.cassandra.io.compress.LZ4Compressor'}
    AND crc_check_chance = 1.0
    AND dclocal_read_repair_chance = 0.1
    AND default_time_to_live = 0
    AND gc_grace_seconds = 864000
    AND max_index_interval = 2048
    AND memtable_flush_period_in_ms = 0
    AND min_index_interval = 128
    AND read_repair_chance = 0.0
    AND speculative_retry = '99PERCENTILE';

CREATE TABLE bth.employee (

id int,

lastname text,

firstname text,

dateofbirth date,

PRIMARY KEY (id, lastname, firstname)

) WITH CLUSTERING ORDER BY (lastname ASC, firstname ASC)

AND bloom_filter_fp_chance = 0.01

AND caching = {'keys': 'ALL', 'rows_per_partition': 'NONE'}

AND comment = ''

AND compaction = {'bucket_high': '1.5', 'bucket_low': '0.5', 'class': 'org.apache.cassandra.db.compaction.SizeTieredCompactionStrategy', 'enabled': 'true', 'max_threshold': '32', 'min_sstable_size': '50', 'min_threshold': '4', 'tombstone_compaction_interval': '86400', 'tombstone_threshold': '0.2', 'unchecked_tombstone_compaction': 'false'}

AND compression = {'chunk_length_in_kb': '64', 'class': 'org.apache.cassandra.io.compress.LZ4Compressor'}

AND crc_check_chance = 1.0

AND dclocal_read_repair_chance = 0.1

AND default_time_to_live = 0

AND gc_grace_seconds = 864000

AND max_index_interval = 2048

AND memtable_flush_period_in_ms = 0

AND min_index_interval = 128

AND read_repair_chance = 0.0

AND speculative_retry = '99PERCENTILE';

And the content of it looks like this

cqlsh:bth> select * from bth.employee;

 id | lastname    | firstname | dateofbirth
----+-------------+-----------+-------------
  1 |    eriksson |    tobias |  1973-06-11
  2 |  holgersson |    marcus |  1972-08-01
  7 |    eriksson |     linda |  1974-06-11
  3 | abrahamsson |    ingvar |  1959-10-05

(4 rows)
cqlsh:bth>

cqlsh:bth> select * from bth.employee;

id | lastname | firstname | dateofbirth

----+-------------+-----------+-------------

1 | eriksson | tobias | 1973-06-11

2 | holgersson | marcus | 1972-08-01

7 | eriksson | linda | 1974-06-11

3 | abrahamsson | ingvar | 1959-10-05

(4 rows)

cqlsh:bth>

And now I would like to search for all the rows that has a first name that starts with a ‘t’

In SQL that would have been :

SELECTÂ * FROMÂ bth.employee WHEREÂ firstname LIKEÂ ‘t%’;

In fact we could have done that on any column …. but in Cassandra it would result in something like this:

cqlsh:bth> SELECT * FROM bth.employee WHERE firstname like 't%';
InvalidRequest: code=2200 [Invalid query] message="firstname LIKE '<term>%' restriction is only supported on properly indexed columns"
cqlsh:bth>

cqlsh:bth> SELECT * FROM bth.employee WHERE firstname like 't%';

InvalidRequest: code=2200 [Invalid query] message="firstname LIKE '<term>%' restriction is only supported on properly indexed columns"

cqlsh:bth>

In Cassandra we first has to decide on which columns this should be possible, by creating an index like this:

CREATE CUSTOM INDEX employee_firstname_idx ON bth.employee (firstname) USING 'org.apache.cassandra.index.sasi.SASIIndex' WITH OPTIONS = {'analyzer_class': 'org.apache.cassandra.index.sasi.analyzer.StandardAnalyzer', 'case_sensitive': 'false'};

1	CREATE CUSTOM INDEX employee_firstname_idx ON bth.employee (firstname) USING 'org.apache.cassandra.index.sasi.SASIIndex' WITH OPTIONS = {'analyzer_class': 'org.apache.cassandra.index.sasi.analyzer.StandardAnalyzer', 'case_sensitive': 'false'};

And so you can now do the following

cqlsh:bth>SELECT * FROM bth.employee WHERE firstname LIKE 't%';

 id | lastname | firstname | dateofbirth
----+----------+-----------+-------------
  1 | eriksson |    tobias |  1973-06-11

(1 rows)
cqlsh:bth>

cqlsh:bth>SELECT * FROM bth.employee WHERE firstname LIKE 't%';

id | lastname | firstname | dateofbirth

----+----------+-----------+-------------

1 | eriksson | tobias | 1973-06-11

(1 rows)

cqlsh:bth>

But what if you decide that I would like to know all the employees that ends with an ‘s’ in their name, so something like this:

cqlsh:bth> SELECT * FROM bth.employee WHERE firstname like '%s';
InvalidRequest: code=2200 [Invalid query] message="firstname LIKE '%<term>' restriction is only supported on properly indexed columns"
cqlsh:bth>

cqlsh:bth> SELECT * FROM bth.employee WHERE firstname like '%s';

InvalidRequest: code=2200 [Invalid query] message="firstname LIKE '%<term>' restriction is only supported on properly indexed columns"

cqlsh:bth>

So to be able to search for something that contains we have to change the index like this instead:

CREATE CUSTOM INDEX employee_firstname_idx ON bth.employee (firstname) USING 'org.apache.cassandra.index.sasi.SASIIndex' WITH OPTIONS = {'mode': 'CONTAINS', 'analyzer_class': 'org.apache.cassandra.index.sasi.analyzer.StandardAnalyzer', 'case_sensitive': 'false'};

1	CREATE CUSTOM INDEX employee_firstname_idx ON bth.employee (firstname) USING 'org.apache.cassandra.index.sasi.SASIIndex' WITH OPTIONS = {'mode': 'CONTAINS', 'analyzer_class': 'org.apache.cassandra.index.sasi.analyzer.StandardAnalyzer', 'case_sensitive': 'false'};

And now you can run that query again:

cqlsh:bth> SELECT * FROM bth.employee WHERE firstname like '%s';

 id | lastname   | firstname | dateofbirth
----+------------+-----------+-------------
  1 |   eriksson |    tobias |  1973-06-11
  2 | holgersson |    marcus |  1972-08-01

(2 rows)
cqlsh:bth>

cqlsh:bth> SELECT * FROM bth.employee WHERE firstname like '%s';

id | lastname | firstname | dateofbirth

----+------------+-----------+-------------

1 | eriksson | tobias | 1973-06-11

2 | holgersson | marcus | 1972-08-01

(2 rows)

cqlsh:bth>

You can read more about the SASI index hereÂ https://docs.datastax.com/en/cql/3.3/cql/cql_reference/refCreateSASIIndex.html

Enjoy!

-Tobias

UDF/User Defined Functions in Cassandra 3.x

August 11, 2016Cassandra, JAVAtobias

I was just playing around with Cassandra WRITETIME and thought it was somewhat difficult to figure out the date / timestamp of a number like this (microseconds since EPOC) 1470645914253000.

So in my example it looked like this

cqlsh:bth> select id, writetime(dateofbirth) from bth.employee;

 id | writetime(dateofbirth)
----+------------------------
  1 |       1470645914253000
  2 |       1470645977177000
  7 |       1470948508799001
  3 |       1470645977178000

(4 rows)
cqlsh:bth>

cqlsh:bth> select id, writetime(dateofbirth) from bth.employee;

id | writetime(dateofbirth)

----+------------------------

1 | 1470645914253000

2 | 1470645977177000

7 | 1470948508799001

3 | 1470645977178000

(4 rows)

cqlsh:bth>

So I figured why not create a UDF that would solve this for me

That turned out to be a little bit of a challenge …

I thought that I could do like this

CREATE FUNCTION bth.ts2date ( input bigint )
	RETURNS NULL ON NULL INPUT
    RETURNS text 
    LANGUAGE java
    AS $$
    	if( input > 0L ) {
    		long ms = input / 1000L;
    		Date date=new Date(ms);
    		SimpleDateFormat sdf = new SimpleDateFormat("yyyy-MM-dd HH:mm:ss,SSS");
    		return sdf.format(date);
    	} else return null;
    $$;

CREATE FUNCTION bth.ts2date ( input bigint )

RETURNS NULL ON NULL INPUT

RETURNS text

LANGUAGE java

AS $$

if( input > 0L ) {

long ms = input / 1000L;

Date date=new Date(ms);

SimpleDateFormat sdf = new SimpleDateFormat("yyyy-MM-dd HH:mm:ss,SSS");

return sdf.format(date);

} else return null;

$$;

BUT NO, YOU CAN NOT!!!

There are several WRONGS in here it turns out

First off you have to turn on
enable_user_defined_functions: true
in the conf/cassandra.yaml file
All classes has to be fully qualified, so Date would be java.util.Date, and so on…
The division operator ‘/’ can not be used !!! however +,- and * works fine. surely this must be a bug … this called for some thinking…

The error I got when trying to use the code above without fully qualified names was

cqlsh> CREATE FUNCTION ts2date ( input bigint )
   ... RETURNS NULL ON NULL INPUT
   ...     RETURNS text 
   ...     LANGUAGE java
   ...     AS $$
   ...     if( input != null ) {
   ...     Date date=new Date(mills);
InvalidRequest: code=2200 [Invalid query] message="Functions must be fully qualified with a keyspace name if a keyspace is not set for the session"

cqlsh> CREATE FUNCTION ts2date ( input bigint )

... RETURNS NULL ON NULL INPUT

... RETURNS text

... LANGUAGE java

... AS $$

... if( input != null ) {

... Date date=new Date(mills);

InvalidRequest: code=2200 [Invalid query] message="Functions must be fully qualified with a keyspace name if a keyspace is not set for the session"

And the reason, if I got it right, is that you can not do imports.

The error I got when trying to use the division ‘/’ operator was this:

cqlsh:bth> CREATE FUNCTION bth.ts2date ( input bigint )
       ... RETURNS NULL ON NULL INPUT
       ...     RETURNS text 
       ...     LANGUAGE java
       ...     AS $$
       ...     if( input > 0L ) {
       ...     java.util.Date date=new java.util.Date(input/1000);
Invalid syntax at line 7, char 49
      java.util.Date date=new java.util.Date(input/1000);
                                                  ^
cqlsh:bth>

cqlsh:bth> CREATE FUNCTION bth.ts2date ( input bigint )

... RETURNS NULL ON NULL INPUT

... RETURNS text

... LANGUAGE java

... AS $$

... if( input > 0L ) {

... java.util.Date date=new java.util.Date(input/1000);

Invalid syntax at line 7, char 49

java.util.Date date=new java.util.Date(input/1000);

cqlsh:bth>

The code that works looks like this, using java.math.BigDecimal to solve it was perhaps a so-so solution, but it works:

CREATE FUNCTION bth.ts2date ( input bigint ) 
RETURNS NULL ON NULL INPUT     
RETURNS text      
LANGUAGE java     
AS $$     
	if( input > 0L ) {     
		java.math.BigDecimal t = java.math.BigDecimal.valueOf(1000L);     
		java.math.BigDecimal inp = java.math.BigDecimal.valueOf(input);     
		java.math.BigDecimal mst = inp.divide(t); 
		long ms = mst.longValue();     
		java.util.Date date=new java.util.Date(ms);     
		java.text.SimpleDateFormat sdf = new java.text.SimpleDateFormat("yyyy-MM-dd HH:mm:ss,SSS");     
		return sdf.format(date);     
	} else return null;     
$$;

CREATE FUNCTION bth.ts2date ( input bigint )

RETURNS NULL ON NULL INPUT

RETURNS text

LANGUAGE java

AS $$

if( input > 0L ) {

java.math.BigDecimal t = java.math.BigDecimal.valueOf(1000L);

java.math.BigDecimal inp = java.math.BigDecimal.valueOf(input);

java.math.BigDecimal mst = inp.divide(t);

long ms = mst.longValue();

java.util.Date date=new java.util.Date(ms);

java.text.SimpleDateFormat sdf = new java.text.SimpleDateFormat("yyyy-MM-dd HH:mm:ss,SSS");

return sdf.format(date);

} else return null;

$$;

So now my output in cqlsh.sh looks like this now

cqlsh:bth> select id, ts2date(writetime(dateofbirth)) from bth.employee;

 id | bth.ts2date(writetime(dateofbirth))
----+-------------------------------------
  1 |             2016-08-08 10:45:14,253
  2 |             2016-08-08 10:46:17,177
  7 |             2016-08-11 22:48:28,799
  3 |             2016-08-08 10:46:17,178

(4 rows)
cqlsh:bth>

cqlsh:bth> select id, ts2date(writetime(dateofbirth)) from bth.employee;

id | bth.ts2date(writetime(dateofbirth))

----+-------------------------------------

1 | 2016-08-08 10:45:14,253

2 | 2016-08-08 10:46:17,177

7 | 2016-08-11 22:48:28,799

3 | 2016-08-08 10:46:17,178

(4 rows)

cqlsh:bth>

That is a lot better !

Cassandra set the writetime explicitly with a PreparedStatement

March 16, 2016Cassandra, JAVA, Scalatobias

This is a quick one, I wanted to set the writetime of a row explicitly when I populate the database for testing purposes. We use the writetime of a column to filter them out.

It required some looking around to find out how to do this…. so I figured I write an article about it.

INSERT INTO invoices (invoice_id,amount,tax,description) VALUES ('333',10,3,'ice-cream') USING TIMESTAMP 1458134077121;

1	INSERT INTO invoices (invoice_id,amount,tax,description) VALUES ('333',10,3,'ice-cream') USING TIMESTAMP 1458134077121;

The timestamp will be set for ALL cells in this row (well not the primary key, cause it does not have a timestamp, but the others).

The timestamp is given as millisecondsÂ since EPOC, so lots of digitsÂ :-).

A prepared statement would then look like this (Scala code)

val cql = "INSERT INTO invoices (invoice_id,amount,tax,description) VALUES (?,?,?,?) USING TIMESTAMP ?;"

val stmt = session.prepare( cql )

val bs = stmt.bind()

bs.setString("invoice_id", "333" )

bs.setLong("amount", 10L )

bs.setLong("tax", 3L )

bs.setString("description","ice-cream")

bs.setLong("[timestamp]", 1458134077121L )

val result = Â session.execute( bs )

val cql = "INSERT INTO invoices (invoice_id,amount,tax,description) VALUES (?,?,?,?) USING TIMESTAMP ?;"

val stmt = session.prepare( cql )

val bs = stmt.bind()

bs.setString("invoice_id", "333" )

bs.setLong("amount", 10L )

bs.setLong("tax", 3L )

bs.setString("description","ice-cream")

bs.setLong("[timestamp]", 1458134077121L )

val result = Â session.execute( bs )

TTL and TIMESTAMP can both be set like this, i.e. with [ttl] and [timestamp]

-Tobias

Apache SPARK and Cassandra and SQL

January 21, 2016Uncategorizedapache, cassandra, cassandrasqlcontext, dependencies, exception, nosuchmethodexception, sbt, scala, spark, sqltobias

This is a short intro to start using Apache SPARK with Cassandra, running SQL on the Cassandra tables.

Note that I am not running a SPARK cluster, I am running “local”, to me this is really convenient, not having to run a SPARK server and workers for something so small. So for playing around with SPARK and Cassandra this is really good.

I am using Scala and SBT.

Something I was struggling hard with, to get the dependency versions right. It is crucial that you do not do like I did first, use version 1.5.2 of Spark and 1.5.0 for SparkCassandraConnector, this will NOT work. I constantly got exception withÂ java.lang.NoSuchMethodException, so incredibly frustrating to try out version after version.

Caused by: java.lang.NoSuchMethodError: org.apache.spark.sql.execution.datasources.LogicalRelation.<init>(Lorg/apache/spark/sql/sources/BaseRelation;)V

1	Caused by: java.lang.NoSuchMethodError: org.apache.spark.sql.execution.datasources.LogicalRelation.<init>(Lorg/apache/spark/sql/sources/BaseRelation;)V

build.sbt

val sparkV = "1.5.0"
val sparkCassandraConnectorV = "1.5.0-RC1"
val cassandraDriverV = "3.0.0-rc1"

libraryDependencies += "joda-time"           %   "joda-time"         % "2.9.1"
libraryDependencies += "org.slf4j"           %   "slf4j-api"         % "1.7.10"
libraryDependencies += "ch.qos.logback"      %   "logback-core"      % "1.1.3"
libraryDependencies += "ch.qos.logback"      %   "logback-classic"   % "1.1.3"
libraryDependencies += "jline"      %   "jline"   % "2.12.1"
libraryDependencies += "org.apache.spark" % "spark-core_2.11" % sparkV exclude("org.scala-lang","scala-compiler") exclude("jline","jline")
libraryDependencies += "org.apache.spark" % "spark-sql_2.11" % sparkV
libraryDependencies += "com.datastax.spark"  %% "spark-cassandra-connector" % sparkCassandraConnectorV exclude("org.apache.spark","spark-sql_2.11")
libraryDependencies += "com.datastax.cassandra" % "cassandra-driver-core" % cassandraDriverV

mainClass in (Compile, run) := Some("com.tsoft.dreamtel.spark.poc.SparkTest")

val sparkV = "1.5.0"

val sparkCassandraConnectorV = "1.5.0-RC1"

val cassandraDriverV = "3.0.0-rc1"

libraryDependencies += "joda-time" % "joda-time" % "2.9.1"

libraryDependencies += "org.slf4j" % "slf4j-api" % "1.7.10"

libraryDependencies += "ch.qos.logback" % "logback-core" % "1.1.3"

libraryDependencies += "ch.qos.logback" % "logback-classic" % "1.1.3"

libraryDependencies += "jline" % "jline" % "2.12.1"

libraryDependencies += "org.apache.spark" % "spark-core_2.11" % sparkV exclude("org.scala-lang","scala-compiler") exclude("jline","jline")

libraryDependencies += "org.apache.spark" % "spark-sql_2.11" % sparkV

libraryDependencies += "com.datastax.spark" %% "spark-cassandra-connector" % sparkCassandraConnectorV exclude("org.apache.spark","spark-sql_2.11")

libraryDependencies += "com.datastax.cassandra" % "cassandra-driver-core" % cassandraDriverV

mainClass in (Compile, run) := Some("com.tsoft.dreamtel.spark.poc.SparkTest")

A small Scala program to show how it works

SparkTest.scala

package com.tsoft.dreamtel.spark.poc

import org.apache.spark.sql.cassandra.CassandraSQLContext
import org.apache.spark.sql.types.{StringType, StructField, StructType}
import org.apache.spark.sql.{DataFrame, SQLContext}
import com.datastax.spark.connector._
import org.apache.spark.{SparkConf, SparkContext}

object SparkTest {
  def main(args: Array[String]): Unit = {
    testCassandraSQL()
  }

  def testCassandraSQL(): Unit = {
    val conf = new SparkConf(true).set("spark.cassandra.connection.host","127.0.0.1")
    val sc = new SparkContext("local", "Tobias-Local-SPARK-Context", conf)
    val cc = new CassandraSQLContext(sc)
    val sqlrdd: DataFrame = cc.sql("SELECT firstname from playground.individual WHERE firstname like 'L%'") //  WHERE firstname like 'S%'")
    sqlrdd.foreach( e => println("ROW: "+e.getString(0) ) )
  }


  }
}

package com.tsoft.dreamtel.spark.poc

import org.apache.spark.sql.cassandra.CassandraSQLContext

import org.apache.spark.sql.types.{StringType, StructField, StructType}

import org.apache.spark.sql.{DataFrame, SQLContext}

import com.datastax.spark.connector._

import org.apache.spark.{SparkConf, SparkContext}

object SparkTest {

def main(args: Array[String]): Unit = {

testCassandraSQL()

}

def testCassandraSQL(): Unit = {

val conf = new SparkConf(true).set("spark.cassandra.connection.host","127.0.0.1")

val sc = new SparkContext("local", "Tobias-Local-SPARK-Context", conf)

val cc = new CassandraSQLContext(sc)

val sqlrdd: DataFrame = cc.sql("SELECT firstname from playground.individual WHERE firstname like 'L%'") // WHERE firstname like 'S%'")

sqlrdd.foreach( e => println("ROW: "+e.getString(0) ) )

}

The output…

ROW: Leona
ROW: Lillian 
ROW: Liana
ROW: Lilian
ROW: Lovisa
ROW: Layla 
ROW: Lennon 
ROW: Leonidas 
ROW: Lova
ROW: LÃ©on
ROW: LÃ©on
ROW: Laila
ROW: Leonardo

ROW: Leona

ROW: Lillian

ROW: Liana

ROW: Lilian

ROW: Lovisa

ROW: Layla

ROW: Lennon

ROW: Leonidas

ROW: Lova

ROW: LÃ©on

ROW: Laila

ROW: Leonardo

SBT Good to know…

November 5, 2015SBT, Scalatobias

Dependecy problems

I have been having some difficulties figuring out what depends on what. I found the following set plugins which I think can be really helpful;

https://github.com/jrudolph/sbt-dependency-graph

and

https://github.com/gilt/sbt-dependency-graph-sugar

Be sure to install GraphWiz first, I used Homebrew on my Mac

brew install graphviz

and I also had to create a config file

mkdir ~/.sbt/gilt
vi ~/.sbt/gilt/sbt-dependency-graph-sugar-cmd

1 2	mkdir ~/.sbt/gilt vi ~/.sbt/gilt/sbt-dependency-graph-sugar-cmd

with the following content

open -a Safari $1

1	open -a Safari $1

The readme explains how to use it pretty well, simply start sbtÂ CLI

Tobiass-MBP:projectX tobias$ sbt
[info] Loading global plugins from /Users/tobias/.sbt/0.13/plugins
[info] Updating {file:/Users/tobias/.sbt/0.13/plugins/}global-plugins...
Waiting for lock on /Users/tobias/.ivy2/.sbt.ivy.lock to be available...
[info] Resolving org.fusesource.jansi#jansi;1.4 ...
[info] Done updating.
[info] Loading project definition from /Users/tobias/qvantel/projectX/project
Waiting for lock on /Users/tobias/.ivy2/.sbt.ivy.lock to be available...
[info] Set current project to projectX (in build file:/Users/tobias/qvantel/projectX/)
> dependencySvgView

Tobiass-MBP:projectX tobias$ sbt

[info] Loading global plugins from /Users/tobias/.sbt/0.13/plugins

[info] Updating {file:/Users/tobias/.sbt/0.13/plugins/}global-plugins...

Waiting for lock on /Users/tobias/.ivy2/.sbt.ivy.lock to be available...

[info] Resolving org.fusesource.jansi#jansi;1.4 ...

[info] Done updating.

[info] Loading project definition from /Users/tobias/qvantel/projectX/project

Waiting for lock on /Users/tobias/.ivy2/.sbt.ivy.lock to be available...

[info] Set current project to projectX (in build file:/Users/tobias/qvantel/projectX/)

> dependencySvgView

It will give you a graph that looks something like this (it is in SVG format so it is searchable!!!) Now you should see which package/jar is using which, and also where the different versions clash…

Show the class path for the run command

Tobiass-MBP:projectX tobias$ sbt
[info] Loading global plugins from /Users/tobias/.sbt/0.13/plugins
[info] Loading project definition from /Users/tobias/qvantel/projectX/project
[info] Set current project to projectX (in build file:/Users/tobias/qvantel/projectX/)
> show runtime:fullClasspath
[info] Updating {file:/Users/tobias/qvantel/projectX/}projectx...
[info] Resolving jline#jline;2.12.1 ...
[info] Done updating.
[info] List(Attributed(/Users/tobias/qvantel/projectX/target/scala-2.11/classes), Attributed(/Users/tobias/.ivy2/cache/org.scala-lang/scala-library/jars/scala-library-2.11.7.jar), Attributed(/Users/tobias/.ivy2/cache/org.slf4j/slf4j-api/jars/slf4j-api-1.7.10.jar), Attributed(/Users/tobias/.ivy2/cache/ch.qos.logback/logback-core/jars/logback-core-1.1.3.jar), Attributed(/Users/tobias/.ivy2/cache/ch.qos.logback/logback-classic/jars/logback-classic-1.1.3.jar), Attributed(/Users/tobias/.ivy2/cache/io.kamon/kamon-core_2.11/jars/kamon-core_2.11-0.5.2.jar), Attributed(/Users/tobias/.ivy2/cache/com.typesafe.akka/akka-actor_2.11/jars/akka-actor_2.11-2.3.14.jar), Attributed(/Users/tobias/.ivy2/cache/com.typesafe/config/bundles/config-1.2.1.jar), Attributed(/Users/tobias/.ivy2/cache/org.hdrhistogram/HdrHistogram/bundles/HdrHistogram-2.1.7.jar), Attributed(/Users/tobias/.ivy2/cache/io.kamon/kamon-akka_2.11/jars/kamon-akka_2.11-0.5.2.jar), Attributed(/Users/tobias/.ivy2/cache/io.kamon/kamon-scala_2.11/jars/kamon-scala_2.11-0.5.2.jar), Attributed(/Users/tobias/.ivy2/cache/io.kamon/kamon-spray_2.11/jars/kamon-spray_2.11-0.5.2.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-can_2.11/bundles/spray-can_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-io_2.11/bundles/spray-io_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-util_2.11/bundles/spray-util_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-http_2.11/bundles/spray-http_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/org.parboiled/parboiled-scala_2.11/jars/parboiled-scala_2.11-1.1.7.jar), Attributed(/Users/tobias/.ivy2/cache/org.parboiled/parboiled-core/jars/parboiled-core-1.1.7.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-client_2.11/bundles/spray-client_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-httpx_2.11/bundles/spray-httpx_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/org.scala-lang.modules/scala-xml_2.11/bundles/scala-xml_2.11-1.0.3.jar), Attributed(/Users/tobias/.ivy2/cache/org.jvnet.mimepull/mimepull/jars/mimepull-1.9.5.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-routing_2.11/bundles/spray-routing_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/com.chuusai/shapeless_2.11/jars/shapeless_2.11-1.2.4.jar), Attributed(/Users/tobias/.ivy2/cache/org.aspectj/aspectjweaver/jars/aspectjweaver-1.8.7.jar))
[success] Total time: 1 s, completed Nov 5, 2015 10:21:30 AM
> 
>

Tobiass-MBP:projectX tobias$ sbt

[info] Loading global plugins from /Users/tobias/.sbt/0.13/plugins

[info] Loading project definition from /Users/tobias/qvantel/projectX/project

[info] Set current project to projectX (in build file:/Users/tobias/qvantel/projectX/)

> show runtime:fullClasspath

[info] Updating {file:/Users/tobias/qvantel/projectX/}projectx...

[info] Resolving jline#jline;2.12.1 ...

[info] Done updating.

[info] List(Attributed(/Users/tobias/qvantel/projectX/target/scala-2.11/classes), Attributed(/Users/tobias/.ivy2/cache/org.scala-lang/scala-library/jars/scala-library-2.11.7.jar), Attributed(/Users/tobias/.ivy2/cache/org.slf4j/slf4j-api/jars/slf4j-api-1.7.10.jar), Attributed(/Users/tobias/.ivy2/cache/ch.qos.logback/logback-core/jars/logback-core-1.1.3.jar), Attributed(/Users/tobias/.ivy2/cache/ch.qos.logback/logback-classic/jars/logback-classic-1.1.3.jar), Attributed(/Users/tobias/.ivy2/cache/io.kamon/kamon-core_2.11/jars/kamon-core_2.11-0.5.2.jar), Attributed(/Users/tobias/.ivy2/cache/com.typesafe.akka/akka-actor_2.11/jars/akka-actor_2.11-2.3.14.jar), Attributed(/Users/tobias/.ivy2/cache/com.typesafe/config/bundles/config-1.2.1.jar), Attributed(/Users/tobias/.ivy2/cache/org.hdrhistogram/HdrHistogram/bundles/HdrHistogram-2.1.7.jar), Attributed(/Users/tobias/.ivy2/cache/io.kamon/kamon-akka_2.11/jars/kamon-akka_2.11-0.5.2.jar), Attributed(/Users/tobias/.ivy2/cache/io.kamon/kamon-scala_2.11/jars/kamon-scala_2.11-0.5.2.jar), Attributed(/Users/tobias/.ivy2/cache/io.kamon/kamon-spray_2.11/jars/kamon-spray_2.11-0.5.2.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-can_2.11/bundles/spray-can_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-io_2.11/bundles/spray-io_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-util_2.11/bundles/spray-util_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-http_2.11/bundles/spray-http_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/org.parboiled/parboiled-scala_2.11/jars/parboiled-scala_2.11-1.1.7.jar), Attributed(/Users/tobias/.ivy2/cache/org.parboiled/parboiled-core/jars/parboiled-core-1.1.7.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-client_2.11/bundles/spray-client_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-httpx_2.11/bundles/spray-httpx_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/org.scala-lang.modules/scala-xml_2.11/bundles/scala-xml_2.11-1.0.3.jar), Attributed(/Users/tobias/.ivy2/cache/org.jvnet.mimepull/mimepull/jars/mimepull-1.9.5.jar), Attributed(/Users/tobias/.ivy2/cache/io.spray/spray-routing_2.11/bundles/spray-routing_2.11-1.3.3.jar), Attributed(/Users/tobias/.ivy2/cache/com.chuusai/shapeless_2.11/jars/shapeless_2.11-1.2.4.jar), Attributed(/Users/tobias/.ivy2/cache/org.aspectj/aspectjweaver/jars/aspectjweaver-1.8.7.jar))

[success] Total time: 1 s, completed Nov 5, 2015 10:21:30 AM

tsoft.se

Tobias – With a Passion For Software Development

All posts by tobias

Functions as Arguments Java vs Scala, Game Set Match Scala Wins!

Apache Cassandra Secondary Indices

How are Secondary Indices really stored ?

Print stacktraces for all threads on shutdown

Apache Zeppelin, with Spark and Cassandra, the perfect tool

Setting Up Zeppelin

Download and Start Cassandra

Download and Start Zeppelin

Setup Zeppelin

Setup IP address for Cassandra in Cassandra Interpreter

Create your first Notebook

Cookbook Recipes

Load Table into RDD and count rows

Show key spaces using the built in Cassandra interpreter using CQL

Create Keyspace and Table using CQL

Insert data by hand using CQL

Fill the table with bogus data using Spark and Scala

Select data using CQL

Create VIEW so that we can run SQL

Run SQL, ohh sweet SQL 🙂

Remove the cardo-updater agent from OSX

SQL LIKE operation in Cassandra, is possible in v3.4+

UDF/User Defined Functions in Cassandra 3.x

Cassandra set the writetime explicitly with a PreparedStatement

Apache SPARK and Cassandra and SQL

SBT Good to know…

Dependecy problems

Show the class path for the run command