Spyder: Getting closer to a viable Mathematica alternative

We still do not quite have a Mathematica killer, but we are inching closer. Sage is supposed to fill that niche, but I always found its browser-based notebook interface tedious. Spyder, on the other hand, is a nifty piece of software. Combined with a couple of other tools, it is set to become an indispensible tool for symbolic calculations, and it is also has a collection of slick features that Mathematica does not.

Preliminaries

The discussion uses Spyder 2.2.1, IPython 0.13.2, Python 2.7.5, SymPy 0.7.3, NumPy 1.7.1, and Matplotlib 1.3.0. To enable symbolic calculations and nice printing, tick Tools->Configuration->IPython console->Advanced Settings->Use symbolic math. The disadvantage of this that if you restart the IPython console, you may not get your SymPy libraries back. In this case, set up the IPython startup file accordingly, and also at this to ~/.config/ipython/profile_default/ipython_config.py:

c.InteractiveShellApp.extra_extension = sympy.interactive.ipythonprinting

This formats mathematical formulae nicely.

For decent plotting, activate Tools->Configuration->IPython console->Support for graphics (Pylab). It is also a good idea to automatically load Pylab and NumPy.

Interface

The main window of Spyder is a hybrid between Mathematica and Matlab. Most action happens in the IPython console, which numbers inputs and outputs just like Mathematica. Moving to previously issued commands is more bash-like. The IPython shell's content can be saved, but only the instructions -- internal states are not preserved. You are also augmented by an editor and a history log, so you can easily fish out the relevant bits and save them to a script. While Mathematica's notebook interface reigns king, it is much easier to write reproducible code this way.

The main window of Spyder

The main window of Spyder

The editor itself is okay. Basic Python idiosyncrasies, such as indentations and the occasional colon, are catered for, and there is also simple spelling completion. A neat shortcut is F9, which runs the current selection. Otherwise F5 will run the current script, and Ctrl+F6 will run the previous execution.

The IPython console prints mathematical formulae the way we like them:

In [2]:
diff(sin(x*y)**x,x)
Out[2]:
\left(\frac{x y \cos{\left (x y \right )}}{\sin{\left (x y \right )}} + \log{\left (\sin{\left (x y \right )} \right )}\right) \sin^{x}{\left (x y \right )}

Plots are either displayed in-line (Mathematica style), or in a window, like in Matlab:

spyder_plot

Symbolic Calculations

SymPy takes care of symbolic manipulations. It is quite efficient, its performance is comparable to the C++ library SymbolicC++. It does numerical evaluation only when it is asked to, just like Mathematica:

In [3]:
sqrt(2)*pi
Out[3]:
\sqrt{2} \pi
In [4]:
N(sqrt(2)*pi)
Out[4]:
4.44288293815837

Simplify and expand are here to solve your grade-five algebra homework:

In [5]:
from sympy import sin, cos, simplify
In [6]:
a = (x + x**2)/(x*sin(y)**2 + x*cos(y)**2)
In [7]:
a
Out[7]:
\frac{x^{2} + x}{x \sin^{2}{\left (y \right )} + x \cos^{2}{\left (y \right )}}
In [8]:
simplify(a)
Out[8]:
x + 1
In [9]:
((x + y + z)**2).expand(basic=True)
Out[9]:
x^{2} + 2 x y + 2 x z + y^{2} + 2 y z + z^{2}

Limits also work:

In [10]:
from sympy import limit
limit(1/x,x,oo)
Out[10]:
0

Sympy also has functions for more advanced applications, such as Hermitian Operators for quantum physics simulations.

Functional Programming

Mathematica is first and foremost a functional programming language, but due to its highly opportunistic nature, it also caters to procedural programmers. Python is just about that opportunistic, throwing in object-oriented models in the mix.

Mathematica's almighty Map function has a rough equivalent in the Pythonesque list comprehensions. In Mathematica, you write:

#^2 & /@ {x, y, z}

The Python variant is:

In [11]:
[ s**2 for s in [x, y, z]]
Out[11]:
\begin{bmatrix}x^{2}, & y^{2}, & z^{2}\end{bmatrix}

Python may also use a map function to the same extent.

The lambda operator was introduced for creating anonymous functions -- this is similar to Mathematica's pure function (&). While pure functions dot every Mathematica notebook, lambda functions are far clunkier to use. They come handy in some cases:

In [12]:
map(lambda x: x%3 == 0, [1, 2, 3])
Out[12]:
[False, False, True]

MapIndexed can be emulated by the enumerate function:

In [13]:
[ [a,i] for i,a in enumerate([sqrt(2), pi, x])]
Out[13]:
\begin{bmatrix}\begin{bmatrix}\sqrt{2}, & 0\end{bmatrix}, & \begin{bmatrix}\pi, & 1\end{bmatrix}, & \begin{bmatrix}x, & 2\end{bmatrix}\end{bmatrix}

Do not have high expectations, though. Functional programming in Python was an afterthought. Nest and FixedPoint would be nice to have.

Mathematica has nice parallel routines, and lets you launch parallel kernels. The solution is not spectacularly great, for instance, parallel kernels go through a different initialization than the default kernel. Yet it is nice that if you adhere to the functional programming paradigm, your code will more or less automatically run in parallel. With a bit of extra work, IPython can do something similar. If you have four cores, then launch an IPython parallel controller with four engines:

$ ipcluster start -n 4

This starts the engines and creates a couple of files in ~/.config/ipython/profile_default/security. Those files will be parsed when you start the IPython shell. Then create a parallel environment in the shell:

from IPython.parallel import Client
rc = Client()
dview = rc[:]

You are ready to use your parallel map function:

dview.map_sync(lambda x: x**10, range(32))

What Mathematica Cannot Do

Mathematica will never tell you how it arrived at a result. Given that the tools used here are open source, if you have enough time, you can always figure how a particular result was derived. A caveat: `enough time' might be a period longer than your life span.

Theoretically Spyder could run on my phone. It requires bootstrapping a Linux and starting X in the framebuffer. None of this is rocket science, but I have not tried it yet. VNC-based solutions to run X as a virtual screen are inefficient.

At the other end of the spectrum, if you have your code developed and you would like to run it on an HPC workstation, there is nothing preventing you from doing so. Say, you know your symbolic calculations are memory-hungry, then launch a cr1.8xlarge instance on Amazon Web Services with 244 GByte of memory, do some basic configuration, and launch your code. Mathematica ties your hands with licences, you cannot just upload your copy to a workstation of your choice. While the Mathematica Player is free and it comes with the full Mathematica kernel, its primary purpose is interaction, not headless execution on a cloud-based server.

IPython bridges different levels of the software stack. I can do a symbolic integral, then run a bash command:

display(integrate(sin(x),x))
!uname -a

I cannot come up with an obvious application scenario, but executing external commands in Mathematica was always troublesome.

Debugging is not a strength of Mathematica, perhaps under the assumption that good functional programmers never-ever make a mistake. Spyder is more realistic in this regard. Breakpoins, conditional breakpoints, and the usual assortment of debugging tools are at your fingertips. You may also have a chance that you get meaningful error messages, which seldom happens with Mathematica (and never with GCC).

Grievances

Most issues go back to Python itself and have nothing to do with Spyder per se. One exception is the way to break execution in the IPython console. This is annoying. The IPython interpreter is split in two tabs: the client where you issue commands, receive output and error message, and do all your work, and a `console', where the execution takes place. Breaking the execution is a stochastic process: keep pressing Control+C in the console and the client, and hope that your execution stops. Strange enough, Mathematica suffers from similar problems. While Alt+, and Alt+. normally work, these shortcuts will fail you miserably if you are plotting an intricate diagram. This only got worse in recent Mathematica versions, as Wolfram is keen on separating the kernel and and the graphics engine, leading to frequent freezes of the interface. ListPlot3D is the worst offender.

Forget multicore execution with Python. Python's Global Interpreter Lock will not let you get away with simple solutions, unless you can express the problem in functional programming terms. You must run multiple Python processes to have parallelism, and then the onus is yours to organize communication, share data, pass messages, and so on. OpenMP? Not for Python. Crazy as it is, C code is easier to parallelize than Python.

A curse of Python is the proliferation of mutually incompatible interpreters. The Mathematica kernel does not suffer from this, and if there are changes which are not backward compatible, the interface guides you through the changes required in your notebook. With Python, your only option is to cross your fingers and hope your code will not break between versions x.y.z.a.b.c and x.y.z.a.b.d.

Performance is not outstanding, but it is not terrible either. If you are among the lucky few, then your code will work with Pypy. Executing a programme with Pypy can be four to six times faster than with the default CPython interpreter. Let us not talk about memory use, though, that is bound to be abysmal with any given Python interpreter or compiler.

5 Comments

  1. János Tóth says:

    (I write a comment when I have some time "left"...)
    >Mathematica will never tell you how it arrived at a result.
    1. You can learn quite a lot from using Trace (although not easily).
    2. Using the verbal interface you may ask for the details appropriate even for learning purposes.
    3. Reading codes in earlier packages may also be helpful.
    >Debigging
    Péter, have you ever tried Workbench? I didn't but it promises to have breakpoints, step-by-step evaluatio etc. But I prefer avoiding mistakes :)

    1. Peter Wittek says:

      I did not know about Trace, although it has been around since Mathematica 2. Monitor is also handy.

      Workbench is a curious beast. Having looked at the videos, I see that it brings the same Eclipse-based software engineering tools to Mathematica that every other language from CUDA to Haskell already had. Debugging is a bit cumbersome: an external notebook interface must launch, and Workbench shares the kernel with this notebook. I am not sure what happens if multiple kernels need debugging. Syntax highlighting and version control are nice additions. Workbench would not install in a 64-bit Eclipse -- we are in 2013 -- so I cannot try it. Charging $150 for the stand-alone version is hard to justify, and I doubt if our dean would sponsor it.

  2. […] topics, most notably, reproducible academic publications. I am a fan of IPython, combined with a few other tools, it makes a great replacement for Mathematica, but I never thought of it as a way to make research […]

  3. beginner computer algebra via crypto elliptical bada55 (badass) says:

    haskell cash openmath

    hackage.haskell.org/package/cash
    Haskell
    Mar 26, 2012 - A Computer Algebra Shell for Haskell. CASH connects via SCSCP and OpenMATH to Computer Algebra systems with SCSCP (such as GAP).

    python is better for libraries in area of scientific and plotting, however
    haskell is small developer and 'scattered.'
    BUT HASKELL IS STRONGER (in my personal opinion)

    the goal is relatively simple test of the crypto badass or bada55 tests.
    assessment: likely doable, but I may do only a small piece.

    thanks. remember all lang have own strength according to preference.

  4. beginner computer algebra via crypto elliptical bada55 (badass) says:

    python and GIL and 'strange gotcha'

    perhaps Lisp, ML, OCAML or other are STRONGER, but given the 'limitation is
    the brainpower of the coder'
    python calling haskell which will nicely check and wrap the C for
    PARALLEL to the super-computer cloud or haskell cloud is the architecture I prefer.

    general strong architecture
    Coq and/or ML - proofs
    Haskelll math to logic
    Python - plotting (which is interface to human (or impure) etc.
    C some fast math
    other....

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>