Version 52 of Comparing Tcl database abstraction layers

Updated 2007-10-17 18:08:00 by groks

On this page is a side-by-side comparison of the various database abstraction layers.

There is a wealth of different, somewhat similar database abstraction layers available for Tcl. None of the available solutions has achieved some kind of champion status as most developers use the native bindings to their database instead of one of these wrappers currently. -- schlenk 2005-11-15


Packages Compared

  • DIO - Apache License
  • nstcl-database - MIT/X11 License
  • tcldb [L1 ] - BSD License
  • TclODBC (also applies to SnODBC) - BSD License
  • XOSql - GPL License
  • sqlite - Public Domain (for comparison, as it's often cited for its nice interface)

Supported Databases

  • MySQL - DIO, nstcl, tcldb, tclodbc, XOSql
  • PostgreSQL - DIO, nstcl, tcldb, tclodbc, XOSql
  • SQLite - DIO, nstcl, tcldb, tclodbc, XOSql, sqlite3
  • ODBC - nstcl, tcldb, tclodbc, XOSql
  • DB2 - tclodbc, *
  • Oracle - DIO, nstcl, tcldb, tclodbc, XOSql
  • Sybase/MSSQL - nstcl, tclodbc
  • Solid - nstcl, tclodbc
  • Perl DBI - XOSql

(* DB2 probably supported by any package that supports ODBC; many other databases supported by ODBC.)

The amount of work needed to add support for a database varies. The nstcl, DIO, and XOSql group need slightly less work for a new database then tcldb, as their APIs are smaller.

OO Style

  • DIO - incrTcl
  • nstcl - none
  • tcldb - incrTcl
  • tclodbc - none, but follows obj method args style
  • XOSql - XOTcl
  • sqlite3 - none, but follows obj method args style

executing a SQL query

DIO

  $db exec $query

nstcl*

  db_dml statement1 $query

tcldb

  $db exec $query 

tclodbc

  $db $query

XOSql

  $db execute $query

sqlite3

  $db eval $query

NB: (* This is for a query modifying a value)


getting a single value from an SQL query

DIO

  set string [$db string $query]

nstcl

  set string [database_to_tcl_string $db $query]

tcldb

  set string [lindex [$db query 1row $query] 0]

tclodbc

  set string [lindex [$db $query] 0]

XOSql

  set string [lindex [[$db query $query] fetch] 0]

sqlite3

  $db onecolumn $query

Getting one row of a SQL query into an array

DIO

  $db array $query $arrayVar

nstcl*

  db_1row statement2 $query -columnVar $arrayVar

tcldb**

  set result [$db query 1row $query]
  foreach key $keys value $result {
      set $arrayVar($key) $value
  }

tclodbc

  $db read arrayVar $query

XOSql

  set rObj [$db query $query]
  set keys [$rObj columNames]
  set values [$rObj fetch]
  foreach key $keys value $values {
      set $arrayVar($key) $value
  }

sqlite3

  $db eval $query arrayVar { break }

NB:

(* nstcl errors out if 0 or more than one row is returned)

(** tcldb has no easy way to discover column names, but provides an extra class for table management)


Loop over the rows of a SQL query (set query "SELECT id, name FROM person")

DIO

  set rObj [$db exec $query]
  $rObj forall -array result {
      puts "$result(id) -> $result(name)"
  }

nstcl

  db_foreach statement3 $query {
      puts "$id -> $name"
  }

tcldb

  $db query foreach {id name} $query {
      puts "$id -> $name"
  }

tclodbc

  proc printRows {id name} {
      puts "$id -> $name"
  }
  $db eval printRows $query

XOSql

  set rObj [$db query $query]
  while {[llength [set row [$rObj fetch]]]} {
      puts "[lindex $row 0] -> [lindex $row 1]"
  }

sqlite3

  $db eval $query row {
      puts "$row(id) -> $row(name)"
  }

  $db eval $query {
      puts "$id -> $name"
  }

NB:

nstcl's style risks accidental overwriting of variables, which could lead to security problems.


Getting the first column of a SQL query as list

DIO

  set list [$db list $query]

nstcl

  set list [db_list statement4 $query]

tcldb

  set list [list]
  set llist [$db query list $query]
  foreach row $llist {
      lappend list [lindex $row 0]
  }

tclodbc

  foreach row [$db $query] {
      lappend list [lindex $row 0]
  }

XOSql

  set rObj [$db query $query]
  set list [list]
  while {[llength [set row [$rObj fetch]]]} {
      lappend list [lindex $row 0]
  }

sqlite3

  $db eval $query row {
      set col1 [lindex $row(*) 0]
      lappend list $row($col1)
  }

Getting the whole result as nested list of lists

DIO

  set llist [list]
  set rObj [$db exec $query]
  $rObj forall -list row {lappend llist $row}

nstcl

  set llist [db_list_of_lists statement5 $query]

tcldb

  set llist [$db query list $query]

tclodbc

  set llist [$db $query]

XOSql

  set llist [$db queryList $query]

sqlite3

  $db function mklist ::list
  $db eval {select mklist(a,b,c) from ...}

Get the number of affected rows for a query

DIO

  set rObj [$db exec $query]
  set numRows [$rObj numrows]

nstcl* ?

tcldb**

tclodbc

  $db statement stmt $query
  stmt execute
  set numRows [stmt rowcount]
  stmt drop

XOSql

  set rObj [$db execute $query]
  set numRows [$rObj rows]

sqlite3

  $db eval $query
  set numRows [$db changes]

NB:

(* no idea after just looking at the docs)

(** not exposed, there is a protected internal function)


Inserting a new row into the database

'DIO'

  $db insert $arrayVar -table demo

nstcl*

  set id $arrayVar(id)
  set name $arrayVar(id)
  db_dml statement {
      insert into demo (id , name) values (:id, :name)
  }

tcldb

  $db insert demo {id name} [list $arrayVar(id) $arrayVar(value)]

  $db exec {insert into demo (id,name) values ('@ID@','@NAME@')} \
      id $arrayVar(id) name $arrayVar(name) 

tclodbc

  $db {insert into demo(id,name) values(?,?)}  $id $name

XOSql

  $db insertRow demo {id name} [list '$arrayVar(id)' '$arrayVar(value)']

sqlite3

  $db {insert into demo(id,name) values($id,$name)}

NB:

(* Not sure if the bind variable feature supports arrays, the first two lines may be superfluous.)


Inserting a new row with automatic id

DIO

  $db insert $arrayVar -table demo \
      -keyfield id -autokey 1 -sequence demo_seq

nstcl*

  set name $arrayVar(name)
  db_dml statement {
      insert into demo (id, name)
      values (
          (select * FROM nextval(demo_seq)
      ), :name)
  }

tcldb***

  $db insert_id id {name} [list $arrayVar(name)]

tclodbc*

  (same as nstcl)

XOSql**

  $db rowInsertAutoId demo name [list $arrayVar(name)] id $sequencer

sqlite3*

  (same as nstcl?)

NB:

(* basically no support for automatic ids; use what the underlying database provides)

(** I couldn't figure out from the docs what exactly has to be provided by sequencer)

(*** Tcldb has support functions to create a db-specific autoincrement serial key.)


Delete a record from the database by primary key

DIO

  $db delete $key -table demo -keyfield id

nstcl

  db_dml statement {
     delete from demo where id = :id d
  }

tcldb

  $db delete demo id $id

tclodbc

  $db {delete from demo where id = ?} $id

XOSql*

  $db execute "delete from demo where id = $id"

sqlite3

  $db eval { delete from demo where id = $id }

NB:

(* not sure if any quoting is done; may be a security problem)


Transaction support

DIO

  $db exec {BEGIN TRANSACTION}
  ...
  # do some operations
  ...
  $db exec {COMMIT TRANSACTION}

nstcl*

  db_transaction {
     ...
     # do some operations
     ...
  }

tcldb

  $db transaction {
      ...
      # do some operations
      ...
  }

tclodbc

  $db set autocommit off
  ...
  # do some operations
  ...
  $db commit

XOSql

  $db execute {BEGIN TRANSACTION}
  ...
  # do some operations
  ...
  $db execute {COMMIT TRANSACTION}

sqlite3

  $db transaction ?type? {
     ...
  }

NB:

Basically neither DIO nor XOSql seem to have any real transaction support. nstcl supports optional code to eval in case of errors during a transaction to decide on commit or rollback. Tcldb controls commit/rollback based on the Tcl exit code from the code block. Sqlite3 allows specification of an optional type, which can be "deferred", "exclusive", or "immediate" (see docs for details). It also controls commit/rollback based on the exit code of the Tcl script (i.e., whether an error was thrown). You can nest Sqlites transaction blocks and only the outermost one will actually do anything. This means you can freely sprinkle transaction blocks throughout your code and it will do the Right Thing.

The examples above need to be reworked to handle errors using e.g. catch if the abstraction layer does not support something specific. It is usually expected that any error between the BEGIN and END will cause all work so far to roll back. Also, some people may be very surprised when one of their transaction blocks throws an error and leaves the handle in an unpredictable state for following queries, perhaps causing data loss, e.g. when all following inserts are added to an open transaction and never committed.


Quoting support, for dynamic queries

  • DIO* attempts autoquoting of values in queries
  • nstcl binding variables for queries with autoquoting functions to quote identifiers and values
  • tcldb binding variable for queries with autoquoting functions to quote values prepared statements parameter passing to the database
  • tclodbc binding variable for queries with autoquoting prepared statements parameter passing to the database optional type hinting
  • XOSql provides escape method for simple value quoting
  • sqlite3 autoquoting of variables within queries

NB:

* The current practice in DIO is insecure and is an SQL injection attack vector. Only values are quoted. Identifiers (table and field names) are passed to the database without any quoting. Value quoting seems broken and incomplete. It does not take SQL quoting rules into account, which may lead to data inconsistencies.

Example of SQL Injection:

  set table users
  set match {'\' OR 1==1; --}
  $obj exec "SELECT * FROM $table WHERE id = $match AND password = $passwd;"

Similar issues may be present in XOSql and nstcl but I did not take a closer look.

Sqlite3 looks similar but queries are brace-quoted and sqlite takes care of substituting variables, so while it looks like Tcl the values are all properly quoted.


NULL Handling

(How does the binding represent NULL values in retrieved data?)

  • DIO ???
  • nstcl ???
  • tcldb ???
  • tclodbc ???
  • XOSql ???
  • sqlite3 $db nullvalue "NULL"

NB:

Sqlite3 allows a special string to be used to represent NULL in query results. The default is to convert NULLs into the empty string ("").

NEM: My opinion is that the most natural way to model NULLs in Tcl would be to simply not have the column variable available in the resulting row. To make that concrete, there are a number of ways that could be accomplished: firstly, if rows were represented as dicts instead of lists, then you simply omit the missing column. Then you can do:

  proc null? {row col} { dict exists $row $col }

Alternatively, several of the extensions allow iterating over the result set using an array variable to hold each row. Again, you could simply then omit the column of missing rows (sqlite3 syntax):

  $db eval $query row {
      if {![info exists row(mycol)]} {
          # NULL
      }
  }

This is obviously tricker when returning all results at once, where a nested list-of-lists representation is natural, but I think I could live without that (or changed to a list-of-dicts solution).


TODO

  • need to mention how to get a handle (should be first section)
  • do any of these packages support handle pooling?
  • finish section on NULL values, input/output
  • binary data, input/output -- do any of these packages support this?
  • need to make sure samples being compared are reasonable, e.g. not encouraging SQL injection attacks, missed transaction rollbacks etc.
  • performance comparison
  • compare other RDBMS abstraction layers: dbConnect, tcl dbi, Oratcl ?
  • compare with non-RDBMS packages ?

(Possibly non-RDBMS packages are out of scope.)


jcw - Terrific overview.

A bit secondary, but perhaps also useful would be comparisons with non-SQL databases (e.g., Metakit, OOMK, and Ratcl, though it may be a bit early for the last). I admit that this strays somewhat from the term "abstraction", but knowing how several specific bindings solve problems which are really very similar may be of use in evaluating the trade-offs made in the other approaches.

schlenk - I did/do not use TclODBC so if someone else feels it should be added, feel free to do so. On the current level of examples (basically simple things), a direct comparison with non-SQL databases like Metakit could be easily done. If that is the intent, this page should be refactored into one page per example, where the example could be described and then implemented with multiple different DB interfaces, and a general overview just listing interfaces, supported databases, SQL support, license model etc. with links to the individual examples.

jcw - FWIW, I've set up a tentative comparison for Ratcl at [L2 ].

tjk - Thank you for this excellent review. I would very much like to use a db extension to interface with MySQL. To date I have been reluctant to commit to an extension because of lack of good comparative data (currently I use pure tcl). My real fear, when making a selection, is selecting a package with missing capabilities so a section that highlights known missing features would be a nice addition.

schlenk The problem with comparing missing capabilities is there are so many unique features for individual database access libraries that a database abstraction layer either has to carry lots of emulation code around or target a functional overlap between all the supported database layers. I for example like the way tcldb abstracts some of the differences between databases away, others may prefer a light weight abstraction layer that abstracts sending queries and doing the connection stuff, but does not help in writing portable SQL queries. So the only way to do it: Make a list of capabilities you want to use/expect from one of these db layers, and then this page can probably be extended with further comparisons for the capabilities not yet listed/compared.

LV 2007 Oct 17 Anyone want to look over TDIF to see how it compares in the above categories? And are there others that could be compared? Perhaps tcldbi? Or some of the other packages listed on interacting with databases?


Category Database