RSS
热门关键字:  数据挖掘  数据仓库  人工智能  搜索引擎  数据挖掘导论

Which programming language should I use?

来源: 作者:unkonwn 时间:2004-12-10 点击:

Some language comparisons

And the winner is...

I ended up choosing Ocaml (see why Ocaml?) for this one particular project which involved statistical natural language processing. (Update 29 Nov 2005: word on the street is that Python is definitely the language of choice these days...)

However, since then, I have gone back to my old friend matlab (see below), which for rapid prototyping of computer vision and statistics is hard to beat...

数据挖掘实验室

Comparison of Matlab, R/S/Splus, Gauss, etc.

  • Comparison of mathematical programs for data analysis, Stefan Steinhaus, tech report, 2000.
    This is a very detailed comparison of features and speed of several interactive scientific programming environments, e.g. Matlab, Mathematica, Splus.
  • User comparisons of several interactive langauges
  • Matlab vs R discussion, April 2004. Note: you can call matlab from R and vice versa.
  • Short but sensible comparison of Splus and Matlab, 2003.
  • Econometric programming environments: Gauss, Ox and S-PLUS, Francisco Cribari-Neto. J. of Applied Econometrics, 12(1):77-89, 1997

    数据挖掘研究院


    Ox can not be used interactively, and has a C-style syntax (it even requires users to pre-declare variables!). Its only advantage is speed. S-Plus has tons of features and good documentation, but is slow. Gauss is somewhere in between.
  • MATLAB as an econometric programming environment, Francisco Cribari-Neto and Mark J. Jensen. J. of Applied Econometrics, 12(6):735-432, 1997.
    The basic conclusion is that Matlab has excellent graphics and sparse-matrix facilities, but is slower than Gauss/Ox (especially on code with loops), and has few statistical routines built-in (one must buy the stats toolbox).
  • R: Yet another econometric programming environment, Francisco Cribari-Neto and S. Zarkos. J. of Applied Econometrics, 14(3):319-329, 1999.
    The basic conclusion is that R is much faster than Splus on code with loops, but a little bit slower on vectorized code. (Gauss/ Ox is much faster than both; in my experience, R and Matlab have about the same speed.) However, R has much better memory management than Splus, and R is free. Otherwise, R/S/Splus are essentially the same.
  • Scilab, an open source alternative to Matlab.
  • Octave, an open source version of matlab.
  • Lush, Yann Le Cun′s lisp-like Matlab replacement. It seems to meet many of the desiderata above (although it does not work on windows), and has proven adequate for real time computer vision and large-scale machine learning experiments.
  • PVwave, described by John Fisher as "Matlab on steroids". It is designed for data analysis and visualization.
  • R, an open-source version of S. Click here for a list of pros and cons for rewriting BNT in R. Click here for a new project to implement a graphical models library in R.
最新评论共有 0 位网友发表了评论
发表评论
评论内容:不能超过250字,需审核,请自觉遵守互联网相关政策法规。
匿名?