Core Concepts
All You Need to Know
You do not need to be an expert in AI research to develop or deploy ML models. NatML focuses on making ML deployment as painless as possible for interactive media developers. Before jumping in, It is crucial to understand a few core concepts, and how they interact with one another:
An ML model is a 'black box' which consumes one or more input features and predicts one or more output features. For example, a vision classifier receives an image feature and produces a probability distribution feature, telling you how likely the image is one label or another.
Under the hood, an ML model is a computation graph. This graph is what empowers ML models with the ability to learn and make predictions on new data.
At NatML, we prefer to refer to ML models as graphs. It's pedantic, sure, but avoids any ambiguity.
These are models that run predictions on the local device. They are exposed with the
MLEdgeModel
class.NatML supports working with CoreML (
.mlmodel
), ONNX (.onnx
), and TensorFlow Lite (.tflite
) edge graphs.A feature is any data that can be consumed or produced by an
MLModel
. For example, you will use a lot of image features when working with vision models. NatML has built-in support for common features that might be used with ML models, including Texture2D
and WebCamTexture
instances:// Some common features include:
float[] arrayFeature = ...;
Texture2D imageFeature = ...;
WebCamTexture webCamFeature = ...;
AudioClip audioFeature = ...;
Every
MLFeature
has a corresponding MLFeatureType
. This type describes the feature and data that is contained within it. Similarly, every MLModel
has a set of input and output feature types, describing what data the model can consume and produce, respectively.Predictors are lightweight primitives that use one or more models to make predictions on features. They are self-contained units that know how to transform inputs into a format that a model expects. But more importantly, they are able to transform outputs of a model into a usable format. For example, you might have a predictor that uses the MobileNet ML model to classify images:
// Create the MobileNet v2 predictor
var predictor = await MobileNetv2Predictor.Create(model);
Whereas a raw classification model outputs a probability distribution, the classification predictor can transform this distribution into a form which is much more usable by developers. It simply returns a class label (
string
) along with a classification score (float
):var (label, confidence) = predictor.Predict(...);
Debug.Log($"Model predicted {label} with confidence {confidence}");
Last modified 3mo ago