Software
and datasets
The
AutoClass Project
: AutoClass takes a database
of cases described by
a combination of real
and discrete valued attributes,
and automatically finds
the natural classes in
that data.
Autocoder Demo
(text classification)
C4.5 r8
: R. Quinlan's program
for top down induction
of decision trees
CBA
Data mining tool :
CBA is a data mining tool
developed at School of
Computing, National University
of Singapore. CBA originally
stands for Classification
Based on Associations.
However, it turns out
that it is much more powerful
than simply producing
an accurate classifier
for prediction. It can
also be used for mining
various forms of association
rules, and for text categorization
or classification.
Con-x
Connectionist Backprop
Language and Simulator
: Con-x (pronounced "kun
ex") is a neural network
scripting language and
environment, designed
to be used by serious
backprop researchers,
as well as a teaching
tool for use in introductory
AI courses
DELVE
- Data for Evaluating
Learning in Valid Experiments
: Delve is a standardised
environment designed to
evaluate the performance
of methods that learn
relationships based primarily
on empirical data. Delve
makes it possible for
users to compare their
learning methods with
other methods on many
datasets. The Delve learning
methods and evaluation
procedures are well documented,
such that meaningful comparisons
can be made.
FastICA package
for MATLAB : the FastICA
package is a public-domain
MATLAB program that implements
the fast fixed-point
algorithm for independent
component analysis and
projection pursuit. It
features an easy-to-use
graphical user interface,
and a computationally
powerful algorithm.
FFOIL r2
: R. Quinlan's program
for inductive logic programming
FOIL r6
LIBSVM by Chih-Chung
Chang and Chih-Jen Lin
: LIBSVM is an integrated
tool for support vector
classification, (C-SVC,
nu-SVC
), regression (epsilon-SVR,
nu-SVR)
and distribution estimation
(one-class
SVM ). It supports
multi-class classification.
The basic algorithm is
a simplification of both
SMO
by Platt
and SVMLight
by Joachims. It is also a simplification
of the modification2
of SMO by Keerthi
et al.
MLC++
Home Page (SGI) :
MLC++ is a library of
C++ classes for supervised
machine learning. MLC++
was initially developed
at Stanford University
and is now distributed
by SGI.
Microsoft Belief
Network Tools : an
application developed
by the Decision Theory
Adaptive Systems Group
within Microsoft Research.
It allows the creation,
assessment and evaluation
of Bayesian belief networks.
Software
by Radford Neal Available
On-Line : Flexible
Bayesian modeling and
Markov chain sampling,
Low Density Parity Check
(LDPC) codes, arithmetic
coding for data compression.
Neural Network
Toolbox for MATLAB
: this toolbox provides
a complete set of functions
and a graphical user interface
for the design, implementation,
visualization, and simulation
of neural networks. It
supports the most commonly
used supervised and unsupervised
network architectures
and a comprehensive set
of training and learning
functions
Neural Networks
at your Fingertips
: simulator for Adaline,
Backprop, Hopfield nets,
Bidirectional Associative
Memories, Boltzman Machine,
Counterpropagation, Self-organizing
maps, Adaptive Resonance
Theory
NeuroForecaster
GENETICA : full 32-bit
implementation for Windows
for general-purpose business
and financial forecasting.
Performs time-series analysis,
cross-sectional classification
and indicator analysis.
NEURON
: NEURON is a simulation
environment for developing
and exercising models
of neurons and networks
of neurons. It is particularly
well-suited to problems
where cable properties
of cells play an important
role, possibly including
extracellular potential
close to the membrane),
and where cell membrane
properties are complex,
involving many ion-specific
channels, ion accumulation,
and second messengers.
It evolved from a long
collaboration between
Michael Hines and John
W. Moore at the Department
of Neurobiology, Duke
University.
The
NICO ANN Toolkit Home
Page : the NICO Toolkit
is an artificial neural
network toolkit designed
and optimized for speech
technology applications.
It is easy to construct
neural networks with both
recurrent connections
and/or time-delay windows
to capture temporal features.
The network topology is
very flexible -- any number
of layers is allowed,
and layers can be arbitrarily
connected. Powerful tools
for sparse connectivity
are also included. Tools
for extracting input-features
from the speech signal
are also part of the toolkit,
as well as tools for computing
target values from many
common phonetic label-file
formats.
The NN learning
algorithm benchmarking
page : proper benchmarking
of (neural network and
other) learning architectures
is a prerequisite for
orderly progress in this
field. In many published
papers deficiencies can
be observed in the benchmarking
that is performed. A workshop
about NN benchmarking
at NIPS*95 addressed the
status quo of benchmarking,
common errors and how
to avoid them, currently
existing benchmark collections,
and, most prominently,
a new benchmarking facility
including a results database.
This page contains pointers
to written versions or
slides of most of the
talks given at the workshop
plus some related material.
The page is intended to
be a repository for such
information to be used
as a reference by researchers
in the field.
The NNCTRL
Toolkit. Neural networks
for control : the
NNCTRL toolkit is a set
of tools for design and
simulation of control
systems based on neural
networks. The toolkit
is an add-on to the NNSYSID
toolbox, which is
a toolbox for system identification
with neural networks.
Version 2 requires MATLAB
5.3 or higher. For MATLAB
4.2-MATLAB 5.2 it is possible
to use the old Version
1. The toolkit contains:
Control by feedback linearization.
Direct inverse control.
Internal model control.
Optimal control. Control
using instantaneous linearization
(includes approximate
pole placement, approximate
minimum variance and approximate
GPC control). Nonlinear
Generalized Predictive
Control. Nonlinear Feedforward
Control.
PDP++ Home Page
: the PDP++ software is
a neural-network simulation
system written in C++.
It represents the next
generation of the PDP
software originally released
with the McClelland and
Rumelhart "Explorations
in Parallel Distributed
Processing Handbook",
MIT Press, 1987. It is
easy enough for novice
users, but very powerful
and flexible for research
use.
Old PDP package
The Perceptron
: a simple simulator for
the Perceptron learning
rule
PlaNet5.7
Pygmalion
R:
the Comprehensive R Archive
Network : R is `GNU
S', a freely available
language and environment
for statistical computing
and graphics which provides
a wide variety of statistical
and graphical techniques:
linear and nonlinear modeling,
statistical tests, time
series analysis, classification,
clustering, etc.
RuleQuest
Research Data Mining Tools
(C5.0, Magnum Opus)
SNNS- Stuttgart
Neural Network Simulator
Spike-neuralog
SOM_PAK,
LVQ_PAK
SOM Toolbox for
Matlab
SUBDUE
Knowledge Discovery in
Structural Databases
StatLog: Evaluation
- Characterization of
Classification Algorithms
: this work was supported
by Esprit Project 5170
StatLog (1991-94). This
project was concerned
with comparative studies
of different machine learning,
neural and statistical
classification algorithms.
About 20 different algorithms
were evaluated on more
than 20 different datasets.
The tests carried out
under this project produced
many interesting results.
Site contains datasets
and algorithms
SVM-Light Support Vector
Machine : SVMlight
is an implementation of
Support Vector Machines
(SVMs) in C written by
T. Joachims
TiMBL:
Tilburg Memory Based Learner
: TiMBL is a program implementing
several Memory-Based Learning
techniques. TiMBL stores
a representation of the
training set explicitly
in memory, and classifies
new cases by extrapolation
from the most similar
stored cases. Several
metrics and algorithms
are implemented in TiMBL;
among others: Information
Gain weighting for dealing
with features of differing
importance (IB1-IG), and
the Modified Value Difference
metric for making graded
guesses of the match between
two different symbolic
values. TiMBL is optimized
for fast classification
by using several indexing
techniques and heuristic
approximations (such as
IGTREE and TRIBL).
Tlearn software page
UC
Irvine KDD Archive
UC Irvine
Machine Learning Datasets
Repository
UC Irvine Machine
Learning Programs
WEKA
Machine Learning Project
: several standard ML
techniques into a software
"workbench" called WEKA,
for Waikato Environment
for Knowledge Analysis
|