jonathan chow // blog

closures vs. structs in Go

2016-10-25T00:00:00+00:00

Today, I was reminded about this classic story about closures vs. objects:

The venerable master Qc Na was walking with his student, Anton. Hoping to prompt the master into a discussion, Anton said “Master, I have heard that objects are a very good thing - is this true?” Qc Na looked pityingly at his student and replied, “Foolish pupil - objects are merely a poor man’s closures.”

Chastised, Anton took his leave from his master and returned to his cell, intent on studying closures. He carefully read the entire “Lambda: The Ultimate…” series of papers and its cousins, and implemented a small Scheme interpreter with a closure-based object system. He learned much, and looked forward to informing his master of his progress.

On his next walk with Qc Na, Anton attempted to impress his master by saying “Master, I have diligently studied the matter, and now understand that objects are truly a poor man’s closures.” Qc Na responded by hitting Anton with his stick, saying “When will you learn? Closures are a poor man’s object.” At that moment, Anton became enlightened.

The last couple of years for me was a deep dive into a very Haskell-influenced style of Scala and JavaScript programming. Between trying to grok typelevel.scala projects and advanced dependent-typing concepts, my thinking naturally became biased towards modelling problems in terms of types and functions instead of objects.

(To give a general idea of how deep down the rabbit hole I ended up, I wrote both type-level arithmetic operators (including a primality tester!) and type-level SK combinators in Scala.)

Recently, I’ve turned my attention to Go. It’s been a very refreshing experience, having the procedural-functional pendulum swing back the other way. The specific problem that brought this story back to me was in trying to abstract HTTP handlers in Go.

The standard library provides this interface to handle requests:

type HandlerFunc func(ResponseWriter, *Request)

and you’d use it like this, for example:

func MyHandler(w http.ResponseWriter, r *http.Request) {
  http.NotFound(w, r)
}

http.HandleFunc("/", MyHandler)
http.ListenAndServe(":9000", nil)

This API is very barebones, but there’s nothing wrong with it. I really enjoy how straightforward and pragmatic standard Go APIs tend to be.

One problem that creeps in, though, is when your handler needs other dependencies in order to do whatever it needs to do. For me, that was a database connection. Go provides generic database functionality via the sql package, which exposes the *sql.DB type as a database handle of sorts. So how do I get an instance of this struct into my handler?

A basic solution is to simply share what is essentially a global *sql.DB instance, for instance:

func main() {
  db := sql.Open("postgres", "postgres://...")

  myHandler := func(w http.ResponseWriter, r *http.Request) {
    // use `db`
  }

  http.HandleFunc("/", myHandler)
  http.ListenAndServe(":9000", nil)
}

This might be fine for small projects, but there are some common issues with this implementation: you can’t replace the db instance with something else for testing purposes (whether it’s a connection to a different database or a mock), and everything needs to be in the same scope, which doesn’t help with modularization.

Knowing these problems, my immediate thought was to use partially applied functions. In ES6, it’d be something like this:

handler = (db) => (writer, request) => {
  // use `db`
};

This way, the function would still close over the connection, but I’m now free to dictate exactly which instance of db the handler should use in each circumstance.

I did in fact end up writing the equivalent in Go, which looked something like this:

func CreateHandler(db *sql.DB) func(http.ResponseWriter, *http.Request) {
  return func(w http.ResponseWriter, r *http.Request) {
    // use `db`
  }
}

and to test:

func TestFoo() {
  db := getTestConnection()
  handler := CreateHandler(db)

  req := httptest.NewRequest(...)
  res := httptest.NewRecorder()
  handler(res, req)

  // check `res`
}

But sitting back and looking at my handiwork, I couldn’t shake that niggly feeling that this wasn’t quite right - it felt clumsy and not very idiomatic.

Eventually, I stumbled upon a different way to approach the problem, and it (surprise!) involved using structs. It wasn’t complicated, and, to be honest, would very likely have been the first thing to spring to mind for anyone who hadn’t been so heavily invested in modelling problems with functions in the first place. Consider this:

type MyHandler struct {
  db *sql.DB
}

func (h *MyHandler) Handle(w http.ResponseWriter, r *http.Request) {
  // use `h.db`
}

By using a method on a struct instead of a partially applied function, the closing over of the db variable is now pushed into the instance of the struct instead of an instance of a function. This is much more idiomatic Go. And if there’s any doubt that these two implementations are semantically equivalent, and you can see that it really is the case by comparing the test for this version with the previous test:

func TestBar() {
  db := getTestConnection()
  handler := MyHandler{db}

  req := httptest.NewRequest(...)
  res := httptest.NewRecorder()
  handler.Handle(res, req)

  // check `res`
}

This shows precisely what the opening story tries to convey: that objects and closures are different ways to express the same thing. Some languages favour one idiom over the other, but, at the end of the day, the result is the same.

This was a good reminder to me that using the right tool for the right job not only applies to languages and tooling, but to concepts and abstractions as well.

programming with types

2015-01-23T00:00:00+00:00

One of the things that really grows on you after programming with Haskell for a while is the idea that the types alone actually reveal quite a lot about what a program does. The implementation is almost a secondary concern.

To a programmer who has worked in side-effecting languages their whole life, it’s very unnerving to learn that the most common way to search the Haskell documentation is with a type signature. It’s arguably even more disconcerting to see something as terse as this in the official documentation:

const :: a -> b -> a
base Prelude, base Data.Function

Constant function.

Uh… cool. So what does const actually do?

Generics are very generic

To understand how to read type signatures, you first have to realise that generic arguments are precisely that: they are generic. Haskell has no class hierarchy because it’s not an OOP language. In Java, since all types inherit from Object, at a minimum you can always call toString() on a generic type, like so:

<A> void foo(A a) {
    System.out.println(a.toString());
}

In Haskell, absolutely nothing is known about a generic type other than the fact that it is of that type. Trying to compile the Haskell equivalent of the Java above:

foo :: a -> IO ()
foo = print

results in a compile error:

No instance for (Show a)
  arising from a use of `print'
In the expression: print
In an equation for `foo': foo = print

You can’t even test for equality! This code snippet:

bar :: a -> Bool
bar x = x == x

also results in a compile error:

No instance for (Eq a)
  arising from a use of `=='
In the expression: x == x
In an equation for `bar': bar x = x == x

Show and Eq are examples of what is called a typeclass. On a high level, these are similar to interfaces in Java: for example, Show tells the Haskell compiler that show, the equivalent of Java’s toString, is available for that type. The only way to make these functions compile is to explicitly say that you expect the type a to have instances declared for the appropriate typeclass:

foo :: (Show a) => a -> IO ()
foo = print

bar :: (Eq a) => a -> Bool
bar x = x == x

These compile. The closest Java equivalent would be something like this:

<A extends Show> void foo(A a) {
    System.out.println(a.show());
}

The takeaway here is that Haskell is that strict about its types. If you don’t explicitly say that a generic type can do something, then the only operation you can perform on it is to return itself.

Arguments as the sole inputs

Next, you have to remember that Haskell is a pure language. That means that given the same inputs for a function, the output will always be the same, a property known as referential transparency (it’s actually slightly more complicated than that, but it’s close).

This, for example, means that global states are out. This Java function returns a different value every time it is called:

int bar = 0;
int foo(int a) {
    return a + bar++;
}

An equivalent can’t be implemented in Haskell, at least not with the same type signature. It is simply impossible.

Examples

That brings us directly to the most straightforward example, the id function:

id :: a -> a

By now you should be able to see why there can only ever be one implementation given this type signature. The implementation is simply:

id :: a -> a
id x = x

This is because there is no way to find a value of type a within the context of the function and because there are no operations that you can perform on a value of the generic type a.

Let’s move onto a slightly more complex example. Here is the type signature:

apply :: (a -> b) -> a -> b

So apply is function that takes two arguments: the first is a function that takes an a and returns a b, and the second is a value of type a. The apply function itself must return a value of type b. How can it get its hands on this value?

The only possible way is to call the function passed to it with the value that was also passed to it, leading to this implementation:

apply :: (a -> b) -> a -> b
apply f x = f x

Those of you who have used Haskell will recognise this as the ($) operator.

By now the only implementation for the example I gave in the introduction should be apparent:

const :: a -> b -> a
const x _ = x

or, if you prefer:

const :: a -> b -> a
const x = \_ -> x

Through the magic of currying, const x returns a function that ignores its input and always produces the predefined constant result x.

Closing thoughts

And so on. The epiphany for me was the realisation that, given the constraints of the Haskell programming language, type signatures are in fact very unique.

I could look a signature like this

(a -> b) -> [a] -> [b]

and understand immediately that the only productive implementation for this signature yields the map function. Or a signature like this:

(a -> b -> b) -> b -> [a] -> b

and immediately see a fold function (foldr specifically in Haskell).

From experience I can say that it certainly takes some time to get used to reading and understanding functions in this manner. It’s a fundamental change in how you interpret code. But once you’ve made the connection and internalised this concept you don’t ever think about code in quite the same way again.

exploring scala macros: map to case class conversion

2013-11-04T00:00:00+00:00

Recently I had a go at writing some Scala macros. Scala macros are essentially an advanced version of the traditional C #defines. To someone like me who hasn’t had too much experience with C, its macros feel like a sophisticated find-and-replace tool that gets run before each compile. On the other hand, Scala macros can bring about benefits such as automatic code generation (via implicits), static type safety within strings (when interpolating), and even allow for the creation very fluent DSL interfaces.

My best understanding of Scala macros was the code generation aspect of it, so I decided to tackle a problem that has probably plagued every budding developer who’s tried to roll their own ORM in a statically typed language: persisting a case class to the database and reading it back without using reflection.

The crux of the problem is always the conversion between a type-safe case class and the database layer. In more mainstream languages like Java, there is simply no way to automatically call some function for each field based on its type without using reflection. With macros, however, the code to do this can be generated at compile time.

The Problem

Let’s reduce the problem to a very specific one: taking any arbitrary case class and producing converter functions to and from a Map[String, Any] where the keys are the names of the case class’s constructor parameters pointing to their respective values.

[Note: many of the problems I faced while writing this macro were solved by looking at this implementation on StackOverflow, hence the similarity.]

To take advantage of implicit macros (we’ll get back to them later), we’ll use a type class to provide the conversion:

trait Mappable[T] {
  def toMap(t: T): Map[String, Any]
  def fromMap(map: Map[String, Any]): T
}

Any implementation of the Mappable[T] trait can now be used to convert a type T to and from a map. For example, we can define one manually:

case class Person(name: String, age: Int)

val PersonMapper = new Mappable[Person] {
  def toMap(p: Person) = Map(
    "name" -> p.name,
    "age" -> p.age)
  def fromMap(map: Map[String, Any]) = Person(
    map("name").asInstanceOf[String]
    map("age").asInstanceOf[Int])
}

There’s a big problem with defining the mapper explicitly though: any time the case class changes the mapper must also be updated accordingly. Take, for example, the case of adding a new parameter to the Person case class:

case class Person(name: String, age: Int, height: Double)

val PersonMapper = new Mappable[Person] {
  // toMap: compiles even though it's incorrect
  def toMap(p: Person) = Map(
    "name" -> p.name,
    "age" -> p.age)

  // fromMap: fails to compile (as it should)
  def fromMap(map: Map[String, Any]) = Person(
    map("name").asInstanceOf[String]
    map("age").asInstanceOf[Int])
}

When only the case class has changed, the compiler can catch the error in the fromMap method because it’s one parameter short, but the compiler can’t catch the semantic error in the toMap method missing the new height parameter.

Using Macros

The reason for this is that explicitly defining the mapper leads to code that’s not very DRY. It introduces multiple points in the code that have to change in order for some changes to be semantically correct. Ideally, the mapper should be able to figure out what fields are needed by looking directly at the class it’s defined for rather than having each field explicitly listed in its methods.

It turns out that macros let you do this really easily. Let’s start by defining a barebones macro in the companion object of the Mappable trait:

[Note: you can clone this template repo to follow along. With the 2.11.0-M5 compiler, macros must be compiled separately from the code that uses them. With this template, the macro subproject can be used for this purpose.]

import scala.reflect.macros.Context
object Mappable {
  implicit def materializeMappable[T]: Mappable[T] =
    macro materializeMappableImpl[T]

  def materializeMappableImpl[T: c.WeakTypeTag](c: Context): c.Expr[Mappable[T]] = {
    import c.universe._
    val tpe = weakTypeOf[T]

    c.Expr[Mappable[T]] { q"""
      new Mappable[$tpe] {
        def toMap(t: $tpe) = ???
        def fromMap(map: Map[String, Any]) = ???
      }
    """ }
  }

Even to a seasoned Scala user, if you’ve never used macros before this probably looks like gobbledegook! Dependent types, quasiquotes, even a context bound thrown into the mix. Behind all the flashiness, however, it’s actually fairly straightforward. Let’s go through this one part at a time.

Implicit Function to Trigger Macro

We start off with the implicit method that triggers the macro:

implicit def materializeMappable[T]: Mappable[T] =
  macro materializeMappableImpl[T]

It’s easy to see that this method returns a Mappable corresponding to whatever type is passed when the function is called. This method doesn’t have an implementation; the macro keyword instructs the compiler to expand the corresponding macro implementation instead, in this case, materializeMappableImpl.

The reason we make this method implicit is that this allows the compiler to automatically create mappers for types as required (the aforementioned implicit macros). Without it, one would need to explicitly create a mapper before using it:

def personToMap(p: Person) = {
  val mapper = materializeMappable[Person]
  mapper.toMap(p)
}

By marking the method implicit, we give the compiler the opportunity to automatically insert this method call whenever an implicit parameter of type Mapper[T] is required. For example,

// the compiler will insert materializeMappable[T] as the implicit parameter
def mapify[T](t: T)(implicit mapper: Mappable[T]) =
  mapper.toMap(t)

We can even use context bounds to not explicitly specify the extra parameter:

def mapify[T: Mappable](t: T) =
  implicitly[Mappable[T]].toMap(t)

In this case, the mapper is implicitly inserted into the function by the compiler. We don’t have a reference to it, but it’s there, so we use the implicitly function to summon it from the nether world.

Macro Boilerplate

Let’s move on to the macro implementation. The structure of the macro function looks at first sight to be some strange incantation:

def materializeMappableImpl[T: c.WeakTypeTag](c: Context): c.Expr[Mappable[T]] = {
  import c.universe._
  // _...
}

Again, however, it’s actually fairly straightforward. Macros work with code, so we manipulate it with abstract syntax trees. The Context variable contains information the compiler would have pertaining to the current invocation of the macro (such as call site, parameters, etc.). This is passed as a parameter to the macro expansion. All the other information about the macro invocation are then passed the same way the original function is written as a dependent type of the current Context: parameters and return types as c.Exprs (essentially typed ASTs according to the docs), and type parameters as c.WeakTypeTags (see this commit for an explanation about why it must be a WeakTypeTag and here for more information about TypeTags in general).

Finally, we import everything inside the universe of the Context to bring all the common utility functions into scope.

Macro Implementation

Now we get into the nuts and bolts of the macro:

val tpe = weakTypeOf[T]

c.Expr[Mappable[T]] { q"""
  new Mappable[$tpe] {
    def toMap(t: $tpe) = ???
    def fromMap(map: Map[String, Any]) = ???
  }
""" }

To start things off, we first get the type of the case class we’re creating a mapper for out of the WeakTypeTag. This tpe variable can then be used directly within quasiquotes.

[Note: it looks like WeakTypeTags should also be directly usable within quasiquotes since [they also have a Liftable implementation](http://www.scala-lang.org/files/archive/api/2.11.0-M5/#scala.reflect.api.StandardLiftables) but I couldn't get it to work. I didn't look too closely at it though. densh has pointed out that you need a variable of type WeakTypeTag and not a type of one for this to work.]

Now, quasiquotes. I found this part the most awesome part about writing Scala macros. They’re somewhat of a replacement for the earlier reify/splice style of writing macros. They work just like interpolated strings, but instead of a string you write normal Scala code and instead of splicing string versions of variables with $variable, you splice ASTs. The most obvious distinction between the two is that reify returns an Expr, while quasiquotes return an AST which must then be wrapped into an Expr explicitly.

With that understanding, the rest of this code snippet should be easy to understand. We define an Expr of type Mappable[T] and use quasiquotes to create the AST from normal code. Note the use of the tpe variable inside the quasiquotes in place of T. We use ??? here because we’ve yet to discuss the real implementation of the Mappable instance.

Getting Fields

Our instance of Mapper needs to iterate over the fields of the case class it’s used for. We don’t want all fields though; just the ones used in the constructor are all we want.

There are many ways we can get at that information. Methods have an isCaseAccessor flag that signifies whether they are used to access the parameters in the constructor. We can also look at the primary implementation of the copy function. However, because we’ll eventually need the exact order of parameters in order to implement the fromMap method, we’ll use the primary constructor to get the list of fields we need.

To do this, we’ll inspect the tpe variable describing our case class to get a list of all its declarations. [Note: declarations are members declared directly in this class, while members include inherited ones.] One of these will be the primary constructor, so we use a pattern match with a guard to get it out. Once we have the constructor, we can extract the list of parameters in the order that we need.

This can be translated directly into code:

val declarations = tpe.declarations
val ctor = declarations.collectFirst {
  case m: MethodSymbol if m.isPrimaryConstructor => m
}.get
val params = ctor.paramss.head

paramss looks like a typo, but in fact it’s a list of lists (of parameters), hence the double ‘s’. There’s only ever one primary constructor, so in our case we’re fine taking the head of that list, but methods in general can be overloaded to take different parameter lists which is why it’s there.

Writing toMap

Now that we have the fields, let’s write the toMap method. Let’s refresh ourselves with what this method should look like by taking a look at the manual implementation from earlier:

def toMap(p: Person) = Map(
  "name" -> p.name,
  "age" -> p.age)

The implementation is just one statement! It’s just a call to Map.apply with “stuff” in it. Let’s break down what that “stuff” includes:

the name of the field as a String
a call to the -> method to create the tuple
a member access to the underlying field

What we need, then, is an AST that represents this. What better way to generate that AST than to use quasiquotes?

val toMapParams = fields.map { field =>
  val name = field.name
  val mapKey: String = name.decoded
  q"$mapKey -> t.$name"
}

That’s all we need! The mapKey variable is annotated with its type String to illustrate the fact that Strings have a built-in Liftable implementation that allows quasiquotes to convert it into the appropriate AST without us doing so explicitly (the AST would be Literal(Constant(mapKey))).

There are probably two more things in here that stand out: what does it mean to decode the name? And what’s this t variable that hasn’t been defined anywhere? (Or has it…?)

According to the docs, decoding the name “replaces all occurrences of $op_names in this name by corresponding operator symbols”. We want this because in the case a parameter has a name like content-type, we want the map to have the key content-type and not content$minustype.

The t variable is a bit more tricky. We must remember that all we’re constructing here is an AST. It is merely some small portion of code. With no context, this t variable makes no sense, but if we put it in some context where some variable t is defined, then it does make sense. If you look back at the original definition of the toMap method we used in the macro, you’ll see that the name of the variable passed into the toMap method is, in fact, named t. This is the t that we’re referring to.

Combining all this together, we can advance our macro implementation to include the toMap method:

c.Expr[Mappable[T]] { q"""
  new Mappable[$tpe] {
    def toMap(t: $tpe) = Map(..$toMapParams)
    def fromMap(map: Map[String, Any]) = ???
  }
""" }

The toMap method is implemented as described before. The t variable now has sufficient context to give the code meaning. We use ..$toMapParams to indicate that we are passing a List[T]. There is a ... variant for List[List[T]] (e.g., parameter lists for methods) which are shown on the quasiquotes doc page, but I haven’t had a chance to try them out.

If you want, you can comment out the fromMap method from the Mappable trait and the macro implementation to give toMap a try:

def mapify[T: Mappable](t: T) =
  implicitly[Mappable[T]].toMap(t)

case class Item(name: String, price: Double)
val map = mapify(Item("lunch", 15.5))
println(map("name")) // "lunch"
println(map("price")) // 15.5

Cool, huh?

Writing fromMap

The fromMap method can be written in an analogous way. Let’s take a look at what we need:

def fromMap(map: Map[String, Any]) = Person(
  map("name").asInstanceOf[String]
  map("age").asInstanceOf[Int])

There are two things we need here that we didn’t need for the implementation of toMap: the companion object for the apply method, and the type of each parameter for the cast. We can get both from the tpe variable:

val companion = tpe.typeSymbol.companionSymbol
def returnType(name: Name) = tpe.declaration(name).typeSignature

Using these and the same list of fields we had from the toMap implementation, we can generate the fromMap implementation:

val fromMapParams = fields.map { field =>
  val name = field.name
  val decoded = name.decoded
  val returnType = tpe.declaration(name).typeSignature
  q"map($decoded).asInstanceOf[$returnType]"
}

c.Expr[Mappable[T]] { q"""
  new Mappable[$tpe] {
    def toMap(t: $tpe) = Map(..$toMapParams)
    def fromMap(map: Map[String, Any]) = $companion(..$fromMapParams)
  }
""" }

Remember that decoded is a String that gets lifted into an AST by the quasiquotes. map will be the name of the variable that gets passed to the fromMap method. The factory for the case class is the apply method of the companion object, which we can call by doing a function application directly on the companion object’s symbol, just like in standard Scala.

It’s important to note that the order of the parameters that get fed into the apply method is important. This is why in the beginning we chose to retrieve the list of parameters from the primary constructor. By doing so, we’ve guaranteed ourselves that the order will indeed be correct.

And that’s it! You can try it out like this:

def materialize[T: Mappable](map: Map[String, Any]) =
  implicitly[Mappable[T]].fromMap(map)

case class Item(name: String, price: Double)
val item = materialize[Item](Map("name" -> "dinner", "price" -> 25.8))
println(item.name) // "dinner"
println(item.price) // 25.8

Wrapping It Up

This is the complete implementation of the macro. You can also find it in the complete-example branch of my macro template repo. I’ve taken the liberty to simplify the code where possible to make it short and concise.

import scala.reflect.macros.Context

trait Mappable[T] {
  def toMap(t: T): Map[String, Any]
  def fromMap(map: Map[String, Any]): T
}

object Mappable {
  implicit def materializeMappable[T]: Mappable[T] =
    macro materializeMappableImpl[T]

  def materializeMappableImpl[T: c.WeakTypeTag](c: Context): c.Expr[Mappable[T]] = {
    import c.universe._
    val tpe = weakTypeOf[T]
    val companion = tpe.typeSymbol.companionSymbol

    val fields = tpe.declarations.collectFirst {
      case m: MethodSymbol if m.isPrimaryConstructor ⇒ m
    }.get.paramss.head

    val (toMapParams, fromMapParams) = fields.map { field ⇒
      val name = field.name
      val decoded = name.decoded
      val returnType = tpe.declaration(name).typeSignature

      (q"$decoded → t.$name", q"map($decoded).asInstanceOf[$returnType]")
    }.unzip

    c.Expr[Mappable[T]] { q"""
      new Mappable[$tpe] {
        def toMap(t: $tpe): Map[String, Any] = Map(..$toMapParams)
        def fromMap(map: Map[String, Any]): $tpe = $companion(..$fromMapParams)
      }
    """ }
  }
}

I hope this introduction to Scala macros has been helpful. I’m no expert in them and most of what I’ve done here was the result of scouring the Scala docs and a lot of googling. Comments and suggestions are most welcome!

dynamically creating tests with ScalaTest

2013-05-12T00:00:00+00:00

At the Code Retreat run at Movio this weekend we watched Corey Haines do the Roman Numerals kata in Ruby. An interesting thing he did was to list all conversion in a hash and iterate over it to dynamically create the test for each conversion:

describe "Converting arabic numbers to roman numerals" do
  {
    1 => "I",
    2 => "II",
    5 => "V"
    # ...
  }.each_pair do |arabic, roman|
    it "converts #{arabic} to #{roman}" do
      expect(convert(arabic)).to eq(roman)
    end
  end
end

Later while attempting to create Conway’s Game of Life using TDD I came across some tests that were repetitive in a similar manner:

describe ("alive cells") {
  val cell = Cell(Alive)
  it ("should become Dead when there are 0 live neighbours") {
    cell.next(0) should be (Cell(Dead))
  }
  it ("should become Dead when there are 1 live neighbours") {
    cell.next(1) should be (Cell(Dead))
  }
  it ("should become Alive when there are 2 live neighbours") {
    cell.next(2) should be (Cell(Alive))
  }
  it ("should become Alive when there are 3 live neighbours") {
    cell.next(3) should be (Cell(Alive))
  }
  // ...
}

It turns out that ScalaTest also supports creating tests using the same style:

describe ("alive cells") {
  val cell = Cell(Alive)

  Seq(
    (0, Dead),
    (1, Dead),
    (2, Alive),
    (3, Alive)
    // ...
  ) foreach { case (count, state) =>
    it (s"should become $state when there are $count live neighbours") {
      cell.next(count) should be (Cell(state))
    }
  }
}

It might take some getting used to, but I think it’s a rather nice way to run different inputs for the same test. It lets you isolate each input into its own test case without any code duplication and concisely lists all your test cases and expected outputs together.

You can see how I did Corey’s Roman Numerals kata in Scala (along with other katas I have done/will do) on my GitHub.

birthday problem

2013-03-22T00:00:00+00:00

Recently at work we came across a case where we needed to generate up to 10,000 random unique numbers. We had to fit it into 23 bits, giving us roughly 8 million different numbers to choose from.

We’d all done combinatorics before, so we knew that if we were to randomly generate these numbers, the chance of there being a collision isn’t going to be as low as what our intuition tells us. But none of us were really that fluent with our math, so when we plugged our formula into Wolfram Alpha and it spit out 99.8% chance of a collision, we were sure that the problem was with our formula and not with the scenario.

I ended up testing the situation empirically in the Scala REPL and it turns out the math was right after all. Here’s the template for empirically testing the classic birthday problem:

import scala.util.Random

def sample(size: Int, limit: Int): Seq[Int] =
  Stream.continually(Random.nextInt(limit)).take(size)

def isUnique(sample: Seq[Int]): Boolean =
  sample.distinct.size == sample.size

def collisionChance(size: Int, limit: Int, times: Int): Double =
  (Stream
    .continually(sample(size, limit))
    .take(times)
    .count(isUnique)
    .toDouble / times)

assert(collisionChance(23, 365, 10000) ~= 0.5)