Lemga: Change Log

Lemga Change Log

Snapshot (2006/05/16)

This is the last snapshot before I graduated from Caltech.

Added the multiclass model AdaBoost_ERP mentioned in my paper Multiclass Boosting with Repartitioning.
Added cross-validation methods CrossVal, vFoldCrossVal, and HoldoutCrossVal. They can also be used as learning models. See test/testsvm.cpp for a demonstration.
Added a general ordinal regression model Ordinal_BLE which was developed in the early stage of the paper Ordinal Regression by Extended Binary Classification. It is outdated and will probably be rewritten in the future to keep up with the paper.
SVM changes: Support vectors are copied out from LIBSVM (which can save some memory usage); Kernel and thus SVM can be saved/loaded.
DataFeeder can add flipping noise (set_train_noise()).
More flexibilities (some are ongoing): Input/output dimensions are decided as late as possible; Training is continuable (plus a new function reset()); Possible to convert between some boosting models; (multiclass) Allow data sets to be loaded after an ECOC table is set.

Snapshot (2006/01/09)

Added multiclass models MultiClass_ECOC and AdaBoost_ECOC. See test/multi.cpp for an example with one-vs-all.
Unified the margin()/signed_margin() functions scattered in different classes to four functions in LearnModel: margin(), margin_of(), min_margin(), and margin_norm(). The first three give unnormalized margins and the last is the normalization term.
Added a utility class DataFeeder. It is handy when data split/normalization is needed.
Added the stump kernel and the perceptron kernel. See the paper Novel Distance-Based SVM Kernels for Infinite Ensemble Learning.
Interface of model constructors changed so that the input dimension could be omitted. Most programs should still work without modifications; Those using SVM need a small modification.
Implemented output cache for MultiClass_ECOC.
Some constants have been standardized in object.h.
Updated to working with LIBSVM 2.81 and GLPK 4.8.
Bugs fixed: coefficient signs in SVM::w_norm(), invalid cache in Boosting::initialize(), better Boosting::get_output() when no cache is used, and a typo in RBF::matrix.

Snapshot (2005/09/04)

Model Perceptron added. Implemented several perceptron learning algorithms mentioned in my paper Perceptron Learning with Random Coordinate Descent.
Model LPBoost added. Hsuan-Tien Lin contributed the code which uses GLPK.
A separate Kernel class was added since kernels can be used for algorithms other than SVM.
SVM: Can be cloned. More inside information can be obtained, such as the 2-norm of the weight vector, the support vectors, and the coefficients.
Added output cache (a hack) to Boosting, including AdaBoost and CGBoost.
load_data() can auto-detect the input dimension.
A Gaussian random number generator randn().
Commonly used c_error() and r_error() are in LearnModel.
Makefile includes the .o dependency.

Snapshot (2005/05/12)

Pulse bug (introduced in 0.1 beta) fixed. Pulse may fail to choose the optimal hypothesis under some conditions.
MgnBoost (Breiman's arc-gv) added. Test code is added to test/adabst.cpp.
boosting::margin() gives the margin of an individual training example, or the minimal margin of the training set.
Add the soft-threshold option to Stump. Incomplete.

Snapshot (2004/12/12)

I haven't tested the new code with Visual C++.NET.

Aggregation renamed to Aggregating.
Boosting model CGBoost added. CGBoost is better than AdaBoost in optimizing cost functions. For details please refer to the CGBoost technical report (note that small modifications are required in _conjugate_gradient (optimize.h) in order to set β=0 for the first several iterations).
Model SVM and test code testsvm added. LIBSVM, modified to support weighted training examples, is used for actual work. Currently only SVM classification with RBF kernel is supported. Serialization/unserialization has not been implemented yet.
Namespace lemga::cost (cost.h) added. I try to separate the cost functions from the learning/optimization methods, and this is a temporary solution before functors are used.
Faster code for Pulse and Stump. Training a pulse function now takes O(n) time.
Methods added to modify Pulse parameters.
Compilable by GCC 3.4.x.
Many other small changes, such as method dataset::replace and member Boosting::min_err.

0.1 beta (2003/03/12)

save() and load() replaced by a much better serialization/unserialization implementation. Operator >> is used for saving models, and << for loading. create(istream) can create an unknown-type object from an input stream. (Thus the base model in class Aggregation is no longer needed when loading models.)
load_data() accepts an input stream instead of a FILE* handle.
Learning model Pulse added. It is a multi-transition phase (step) function. The best hypothesis with number of transitions equal to or less than a given limit is returned. When the limit is 1, pulse is almost the same as stump (the only difference is that pulse may return a hypothesis with no transitions at all). The code has been tuned so that it is even faster than stump when the limit is 1.
Generic algorithm _line_search will early stop if a non-descending direction is met. This change affects conjugate gradient, boosting in the functional space, and the training of neural networks. For example, convex boosting now returns a very large number as cost when empty to avoid non-descending at the first step.
Weak learners in bagging/boosting are forced to binary outputs.
Macro REGISTER_CREATOR simplifies the object creator registration.
Weight decay (_gd_weightdecay) added.
id() returns a const string instead of char*.
Another test file (AdaBoost with Pulse) and a model file checker (showlm) added.

Snapshot (2003/02/08)

copy() renamed to clone().
Aggregation (Bagging, Boosting, and AdaBoost) no longer needs # of inputs and outputs in constructing. E.g., previous code Bagging bag(n_in, n_out) simply becomes Bagging bag.
AnyBoost (Boosting via BoostWgt and _boost_gd) reimplemented so that conjugate gradient and some variants of gradient descent are possible (in the functional space).
Better serialization/unserialization (step 1): Mechanism for registering object creator added (see _register_creator); constructors accepting istream as argument added (interface only); version() was renamed to id() and no longer contains version information.
Namespace vectorop became lemga::op. It serves only for generic optimization in Lemga; only a small set of operations is needed for optimization
More comparison functions in _shared_ptr.
create() added as one virtual constructor.
Bug fixes and comments; Makefile updated.

Snapshot (2003/01/26)

Compilable by Visual C++ .NET.
Traditional methods for convex combinations in AdaBoost removed.
Simple Cascade class added.
Two test files added.

0.1 alpha (2003/01/13)

The code rewriting is almost done and I've tested Lemga in one project (alphaBoost) with GCC 2.96, 3.0.1, and 3.2.1. Models and algorithms currently coded are:

Basic models: feed-forward neural network and decision stump
Aggregation: bagging and boosting (AnyBoost and AdaBoost)
Generic algorithms: gradient descent (normal, momentum, adaptive learning rate), line search, and conjugate gradient