Design & Learning on VW
VW is a very popular machine learning tools, with many fancy features: feature hashing, online learning and even support distributed running of the application.
I’ve always been very interested in the internal details about this tool, I want to learn more about this tool. Recently, I started to reading the source code of this tool.
This post is about the interesting design of this tool.
Feature Representation
For every machine learning tools, the most important stuff is the representation of features it accepts.
For VW, the feature still represented in <key, value> pair.
IO & Data Parsing
In order to handle the IO, VW use a custom class to represent the opening files.
One interesting thing about data parsing: the structure of sample line follow a LL-parser?
Feature Combination
VW has the concept of feature namespace. I think this is a essential feature for large scale machine learning, when we have multiple source of features. One of the usage is ngram of features between different namespace.
VW support general interaction, which involving multiple namespace. But widely used options are just quadratic & cubic feature combination.
Another interesting about VW: only 256 feature space available in total, don’t know why :)
Considering there is feature combination, if generate all the feature offline and store the combination in file, it will be huge. So the feature processing is online fashion.
Learner
One very interesting design is: learner is composable.
Learning in VW is just a set of functions following same interface.
struct func_data
{ using fn = void(*)(void* data);
void* data;
base_learner* base;
fn func;
};
inline func_data tuple_dbf(void* data, base_learner* base, void (*func)(void*))
{ func_data foo;
foo.data = data;
foo.base = base;
foo.func = func;
return foo;
}
struct learn_data
{ using fn = void(*)(void* data, base_learner& base, void* ex);
using multi_fn = void(*)(void* data, base_learner& base, void* ex, size_t count, size_t step, polyprediction*pred, bool finalize_predictions);
void* data;
base_learner* base;
fn learn_f;
fn predict_f;
fn update_f;
multi_fn multipredict_f;
};
VW use a struct of function pointers to represent all the functionality of a learner. This is also a very interesting design.
So basically, no too much classes in VW.
In the struct learn_data, you can find a base learner. In this way, complex learned can be composed using basic simple learner. This is fascinating.
That’s all the learning!!!
Written with StackEdit.
没有评论:
发表评论