Debugging in Clojure: Tools
Part one covered the types of things I’m considering when talking about “debugging”, and the process by which I debug code.
This part is about writing debuggable code, and the tools I use or have seen to debug code: the REPL, println debugging and better, tracing with Spyscope and clojure.tools.trace, and other assorted good things.
Contents:
- Constructing comprehensible systems
- Easy to exercise
- Easily observed behavior
- Unit tests
- Purity and state
- Understand your assumptions
- Tools and techniques
- The REPL
- println
- Tracing with Spyscope
- Macros for debugging let forms
- Tracing with clojure.tools.trace
- Fogus’s breakpoint macro
- Other bits
Constructing comprehensible systems
So, debugging is essentially a process for gaining a more in-depth understanding about how one particular aspect of our system operates. How can Clojure development in particular help with that, and how can we make that easier when developing in Clojure?
In short: write code that is easy to exercise in the REPL, whose behavior is easily observed, and with unit test coverage.
Easy to exercise
Make it easy to construct valid data to pass to your function. You will be greatly assisted by knowing exactly what constitutes valid. Docstrings, pre- and post-conditions, assertions, schema are all helpful. Fewer arguments keep it simpler. Provide helper functions to help construct complex intermediate data structures.
The usual programming practices have their usual benefits. Write clear, well-encapsulated, cohesive-but-not-coupled, singly-responsible code (functions, in our case).
Easily observed behavior
Your functions should express a concise piece of behavior, and your code will be easiest to understand if that behavior is simple, and the observation is as simple as inspecting the return value.
If the cyclomatic complexity of the function is high, observing the function’s behavior might also require observing the control flow (yuck). If the return value of your function needs significant parsing or decoding to determine whether it operated correctly, consider whether the function has too many responsibilities.
Unit tests
I debated for a second whether tests are part of comprehensible systems, or whether they are tools. They are an integral part of any code base, so here they belong. Tests are your infallible long-term memory, a playground for experimentation, your sanity check. Easily exercised, easily verified functions are easily unit tested.
Purity and state
Writing pure functions aids the system’s comprehensibility enormously.
f(x) = f(x)
when your program starts up, or when it’s crashing, and
whether f
is running on an EC2 node or in your REPL. When your code
is a composition of pure functions, reasoning about the code is far
simpler. This is probably the single biggest contributor to why one
can effectively program in Clojure without a visual debugger: with
pure functions, the state of the system is always external to the
code.
Reality is a little messy. Applications are infrequently entirely a composition of pure functions. You have databases, user input, network and disk I/O to contend with. Push the non-pure aspects of your system to the edges, encapsulate them, and stop your abstractions from leaking too far.
So, why? You need to be facile enough with your code and tools so you can observe the conditions that cause the system to behave “unexpectedly”. If your code is structured such that it is easy to run in the REPL (and pure functions generally are), it will be easy to reproduce the faulty behavior in the REPL (and later in tests), and understand what the system is doing.
If the corpus of code under examination is too large for immediate comprehension (in reductionist terms, “emergent behavior”?), divide and conquer. By looking at the problem into smaller and smaller chunks (i.e. the layers of composed functions), you exclude possibilities of where the source of the problem is, until the answer jumps out at you.
Understand your assumptions
With Occam’s Razor in mind, be ready to question your assumptions. There’s always the possibility a bug in Clojure core is the cause of your problem. It’s not very likely, so don’t start there. The most likely problem is in your understanding of the problem, or in your expression of the solution (substitute “your” for whomever wrote the code), so the assumptions built in here are the first ones to question, to check, and to verify.
Tools and techniques
Finally, the Clojure debugging tools easily available to you in Emacs. They all are typically employed in steps 2 and 3 of the debugging algorithm, and each support one or more of the following:
- executing code
- observing the result of function calls
- observing intermediate calculations
- observing control flow
The REPL
It’s obvious, but you need to run the code to see what it does. Use the REPL to do this. In Emacs, you will likely be using cider-mode to fire up a REPL via Leiningen.
A selection of cider commands for the uninitiated:
- Use
C-c M-n
orM-x cider-repl-set-ns
to switch to a source file’s namespace. C-c C-k
orM-x cider-load-current-buffer
loads the current source file in the REPL.C-c C-e
orM-x cider-eval-last-sexp
evaluates an S-expression and prints the result in the mini buffer.C-c M-p
orM-x cider-insert-last-sexp-in-repl
copies a S-expression into the REPL and switches buffers to the REPL ready for you to hit Enter and evaluate the code.
You may have different key bindings to these.
println
Showing result values, inspecting intermediate calculation results,
shining a light into execution paths, println
can do it all. Some
uses of println I’ve seen:
Print out a number of values at the same time by throwing them into a map and printing out the map:
1 2 3 4 |
|
Print out what values intermediate symbols in a let
are bound to:
1 2 3 4 5 6 7 |
|
A variant of println is using clojure.pprint/pprint
or pr-str
to
print out more complex data structures in a more human-readable
format.
Well, this isn’t great. You have to modify the code to see what’s
going on, so you might forget to remove those modifications. You have
to duplicate code that’s already there to inspect values – this
introduces the possibility of typo error. println
doesn’t
necessarily behave how you would expect in a multithreaded context.
And yeah, it makes you itch a little too. We can do better…
Tracing with Spyscope
Spyscope is a nice improvement of println debugging. With two
simple additions to your ~/.lein/profiles.clj
it adds three reader
tags:
- print:
#spy/p
- details:
#spy/d
- trace:
#spy/t
Since they are reader tags, you can just pop one of them in front of the expression you’re trying to inspect, and it gets printed out. This improves the example from above
1 2 3 4 5 |
|
If you’re doing println debugging, I definitely recommend the upgrade to Spyscope.
Macros for debugging let forms
If you find yourself inspecting the values of the bindings within a let form, I have seen a number of macros to automate the typing.
I have Gareth Jones’s dlet
macro in my user namespace:
1 2 3 4 5 6 7 |
|
To see the binding values within the let, just replace any old let
in your code with a dlet
, and the bindings all get printed out:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 |
|
I have also seen a similar outcome using Fogus’s evalive library:
1 2 3 4 5 6 |
|
Tracing with clojure.tools.trace
The techniques so far all involve modifying the code to inspect values. In addition to similar tools to the above, clojure.tools.trace gives you easy ways to trace the result of function calls without any code modification.
From the REPL, use the trace-vars
and trace-ns
functions to wrap
specific functions or entire namespaces with tracing calls. In this
example, I define and use a function to square a number, apply a trace
to it, then use it again. You can see the invocation arguments and
return value for each call:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 |
|
I like clojure.tools.trace/trace-vars
a lot. I used Vinyasa to
make it really convenient to use by injecting trace-vars
into all
namespaces in the REPL, and use it thus:
1 2 3 4 |
|
Fogus’s breakpoint macro
The venerable Joy of Clojure has the source code for a complete Clojure breakpoint system. Reading this code was one of those “Clojure, wow” moments…
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
|
Really, it’s just a toy, but isn’t it incredible!
Other bits
ClojureScript
I saw the following ClojureScript macro for defining a traced function. I haven’t tried it, but it looks interesting:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
|
Ring
Ring services seem like fertile ground for debugging and inspection problems. My next post will cover some basic Ring concepts from a problem-solving point of view.
Sources and References
In addition to the pages referenced, the following resources have been useful to me:
- http://z.caudate.me/give-your-clojure-workflow-more-flow/ and http://dev.solita.fi/2014/03/18/pimp-my-repl.html (which I think draws upon the former)
- Image credit: http://frostedgardner.blogspot.com/2010/07/summer-weekend-finds.html