truefoundry.ml
module TrueFoundry.ml
TrueFoundry.ml
Global Variables
- TRACKING_HOST_GLOBAL
function get_client
get_client
Initializes and returns the truefoundry client.
Args:
disable_analytics
(bool, optional): To turn off usage analytics collection, passTrue
. By default, this is set toFalse
.
Returns:
MLFoundry
: Instance ofMLFoundry
class which represents arun
.
Examples:
- Get client
import truefoundry.ml as tfm
client = tfm.get_client()
class MlFoundry
MlFoundry
MlFoundry.
function create_data_directory
create_data_directory
Create DataDirectory to Upload the files
Args:
ml_repo
(str): Name of the ML Repo in which you want to create data_directoryname
(str): Name of the DataDirectory to be created.description
(str): Description of the Datsetmetadata
(Dict<str>
: Any): Metadata about the data_directory in Dictionary form.
Returns:
DataDirectory
: An instance of class DataDirectory
Examples:
import truefoundry.ml as tfm
client = tfm.get_client()
data_directory = client.create_data_directory(name="<data_directory-name>", ml_repo="<repo-name>")
print(data_directory.fqn)
function create_ml_repo
create_ml_repo
Creates an ML Repository.
Args:
name
(str, optional): The name of the Repository you want to create. if not given, it creates a name by itself.storage_integration_fqn
(str): The storage integration FQN to use for the experiment for saving artifacts.
Examples:
- Create Repository
import truefoundry.ml as tfm
client = tfm.get_client()
ml_repo = client.create_ml_repo(name="my-repo", storage_integration_fqn="truefoundry:example:name:blob-storage:blob")
function create_run
create_run
Initialize a run
.
In a machine learning experiment run
represents a single experiment conducted under a project.
Args:
ml_repo
(str): The name of the project under which the run will be created. ml_repo should only contain alphanumerics (a-z,A-Z,0-9) or hyphen (-). The user must haveADMIN
orWRITE
access to this project.run_name
(Optional[str], optional): The name of the run. If not passed, a randomly generated name is assigned to the run. Under a project, all runs should have a unique name. If the passedrun_name
is already used under a project, therun_name
will be de-duplicated by adding a suffix. run name should only contain alphanumerics (a-z,A-Z,0-9) or hyphen (-).tags
(Optional[Dict[str, Any]], optional): Optional tags to attach with this run. Tags are key-value pairs.
kwargs:
Returns:
MlFoundryRun
: An instance ofMlFoundryRun
class which represents arun
.
Examples:
- Create a run under current user.
import truefoundry.ml as tfm
client = tfm.get_client()
tags = {"model_type": "svm"}
run = client.create_run(
ml_repo="my-classification-project", run_name="svm-with-rbf-kernel", tags=tags
)
run.end()
- Creating a run using context manager.
import truefoundry.ml as tfm
client = tfm.get_client()
with client.create_run(
ml_repo="my-classification-project", run_name="svm-with-rbf-kernel"
) as run:
# ...
# Model training code
...
# `run` will be automatically marked as `FINISHED` or `FAILED`.
- Create a run in a project owned by a different user.
import truefoundry.ml as tfm
client = tfm.get_client()
tags = {"model_type": "svm"}
run = client.create_run(
ml_repo="my-classification-project",
run_name="svm-with-rbf-kernel",
tags=tags,
)
run.end()
function get_all_runs
get_all_runs
Returns all the run name and id present under a project.
The user must have READ
access to the project.
Args:
ml_repo
(str): Name of the project.
Returns:
pd.DataFrame
: dataframe with two columns- run_id and run_name
Examples:
- get all the runs from a ml_repo
import truefoundry.ml as tfm
client = tfm.get_client()
run = client.get_all_runs(ml_repo='my-repo')
function get_artifact_version
get_artifact_version
Get the model version to download contents or load it in memory
Args:
ml_repo
(str): ML Repo to which artifact is loggedartifact_name
(str): Artifact Nameartifact_type
(str): The type of artifact to fetch (acceptable values: "artifact", "model", "plot", "image")version
(str | int): Artifact Version to fetch (default is the latest version)
Returns:
ArtifactVersion
: An ArtifactVersion instance of the artifact
Examples:
import tempfile
import truefoundry.ml as tfm
client = tfm.get_client()
model_version = client.get_artifact_version(ml_repo="ml-repo-name", name="artifact-name", version=1)
# load the model into memory
clf = model_version.load()
# download the model to disk
temp = tempfile.TemporaryDirectory()
download_info = model_version.download(path=temp.name)
print(download_info)
function get_artifact_version_by_fqn
get_artifact_version_by_fqn
Get the artifact version to download contents
Args:
fqn
(str): Fully qualified name of the artifact version.
Returns:
ArtifactVersion
: An ArtifactVersion instance of the artifact
Examples:
import tempfile
import truefoundry.ml as tfm
client = tfm.get_client()
artifact_version = client.get_artifact_version_by_fqn(
fqn="artifact:truefoundry/my-classification-project/sklearn-artifact:1"
)
# download the artifact to disk
temp = tempfile.TemporaryDirectory()
download_info = artifact_version.download(path=temp.name)
print(download_info)
function get_data_directory
get_data_directory
Get an existing data_directory
by name
.
Args:
ml_repo
(str): name of an the project of which the data-directory is part of.name
(str): the name of the data-directory
Returns:
DataDirectory
: An instance of class DataDirectory
Examples:
import truefoundry.ml as tfm
client = tfm.get_client()
data_directory = client.get_data_directory(ml_repo='my-repo', name="<data-directory-name>")
with open("artifact.txt", "w") as f:
f.write("hello-world")
data_directory.add_files(
artifact_paths=[tfm.DataDirectoryPath('artifact.txt', 'a/b/')]
)
# print the path of files and folder in the data_directory
for file in data_directory.list_files():
print(file.path)
function get_data_directory_by_fqn
get_data_directory_by_fqn
Get the DataDirectory by DataDirectory FQN
Args:
fqn
(str): Fully qualified name of the artifact version.
Returns:
DataDirectory
: An instance of class DataDirectory
Examples:
import truefoundry.ml as tfm
client = tfm.get_client()
data_directory = client.get_data_directory_by_fqn(fqn="<data-dir-fqn>")
with open("artifact.txt", "w") as f:
f.write("hello-world")
data_directory.add_files(
artifact_paths=[tfm.DataDirectoryPath('artifact.txt', 'a/b/')]
)
# print the path of files and folder in the data_directory
for file in data_directory.list_files():
print(file.path)
function get_data_directory_by_id
get_data_directory_by_id
Get the DataDirectory From the DataDirectory ID
Args:
id
(uuid.UUID): Id of the data_directory.
Returns:
DataDirectory
: An instance of class DataDirectory
Examples:
import truefoundry.ml as tfm
client = tfm.get_client()
data_directory = client.get_data_directory_by_id(id="<data_directory-id>")
with open("artifact.txt", "w") as f:
f.write("hello-world")
data_directory.add_files(
artifact_paths=[tfm.DataDirectoryPath('artifact.txt', 'a/b/')]
)
# print the path of files and folder in the data_directory
for file in data_directory.list_files():
print(file.path)
function get_model_version
get_model_version
Get the model version to download contents or load it in memory
Args:
ml_repo
(str): ML Repo to which model is loggedname
(str): Model Nameversion
(str | int): Model Version to fetch (default is the latest version)
Returns:
ModelVersion
: The ModelVersion instance of the model.
Examples:
- Sklearn
# See `truefoundry.ml.mlfoundry_api.MlFoundry.log_model` examples to understand model logging
import tempfile
import joblib
import truefoundry.ml as tfm
client = tfm.get_client()
model_version = client.get_model_version(
ml_repo="my-classification-project",
name="my-sklearn-model",
version=1
)
# Download the model to disk
temp = tempfile.TemporaryDirectory()
download_info = model_version.download(path=temp.name)
print(download_info.model_dir, download_info.model_filename)
# Deserialize and Load
model = joblib.load(
os.path.join(download_info.model_dir, download_info.model_filename)
)
- Huggingface Transformers
# See `truefoundry.ml.mlfoundry_api.MlFoundry.log_model` examples to understand model logging
import torch
from transformers import pipeline
import truefoundry.ml as tfm
client = tfm.get_client()
model_version = client.get_model_version(
ml_repo="my-llm-project",
name="my-transformers-model",
version=1
)
# Download the model to disk
temp = tempfile.TemporaryDirectory()
download_info = model_version.download(path=temp.name)
print(download_info.model_dir)
# Deserialize and Load
pln = pipeline("text-generation", model=download_info.model_dir, torch_dtype=torch.float16)
function get_model_version_by_fqn
get_model_version_by_fqn
Get the model version to download contents or load it in memory
Args:
fqn
(str): Fully qualified name of the model version.
Returns:
ModelVersion
: The ModelVersion instance of the model.
Examples:
- Sklearn
# See `truefoundry.ml.mlfoundry_api.MlFoundry.log_model` examples to understand model logging
import tempfile
import joblib
import truefoundry.ml as tfm
client = tfm.get_client()
model_version = client.get_model_version_by_fqn(
fqn="model:truefoundry/my-classification-project/my-sklearn-model:1"
)
# Download the model to disk
temp = tempfile.TemporaryDirectory()
download_info = model_version.download(path=temp.name)
print(download_info.model_dir, download_info.model_filename)
# Deserialize and Load
model = joblib.load(
os.path.join(download_info.model_dir, download_info.model_filename)
)
- Huggingface Transformers
# See `mlfoundry.mlfoundry_api.MlFoundry.log_model` examples to understand model logging
import torch
from transformers import pipeline
import truefoundry.ml as tfm
client = tfm.get_client()
model_version = client.get_model_version_by_fqn(
fqn="model:truefoundry/my-llm-project/my-transformers-model:1"
)
# Download the model to disk
temp = tempfile.TemporaryDirectory()
download_info = model_version.download(path=temp.name)
print(download_info.model_dir)
# Deserialize and Load
pln = pipeline("text-generation", model=download_info.model_dir, torch_dtype=torch.float16)
function get_run_by_fqn
get_run_by_fqn
Get an existing run
by fqn
.
fqn
stands for Fully Qualified Name. A run fqn
has the following pattern: tenant_name/ml_repo/run_name
If a run svm
under the project cat-classifier
in truefoundry
tenant, the fqn
will be truefoundry/cat-classifier/svm
.
Args:
run_fqn
(str):fqn
of an existing run.
Returns:
MlFoundryRun
: An instance ofMlFoundryRun
class which represents arun
.
Examples:
- get run by run fqn
import truefoundry.ml as tfm
client = tfm.get_client()
run = client.get_run_by_fqn(run_fqn='truefoundry/my-repo/svm')
function get_run_by_id
get_run_by_id
Get an existing run
by the run_id
.
Args:
run_id
(str): run_id or fqn of an existingrun
.
Returns:
MlFoundryRun
: An instance ofMlFoundryRun
class which represents arun
.
Examples:
- Get run by the run id
import truefoundry.ml as tfm
client = tfm.get_client()
run = client.get_run_by_id(run_id='a8f6dafd70aa4baf9437a33c52d7ee90')
function get_run_by_name
get_run_by_name
Get an existing run
by run_name
.
Args:
ml_repo
(str): name of the ml_repo of which the run is part of.run_name
(str): the name of the run required
Returns:
MlFoundryRun
: An instance ofMlFoundryRun
class which represents arun
.
Examples:
- get run by name
import truefoundry.ml as tfm
client = tfm.get_client()
run = client.get_run_by_name(run_name='svm', ml_repo='my-repo')
function get_tracking_uri
get_tracking_uri
Get the current tracking URI.
Returns:
The tracking URI.
Examples:
import tempfile
import truefoundry.ml as tfm
client = tfm.get_client()
tracking_uri = client.get_tracking_uri()
print("Current tracking uri: {}".format(tracking_uri))
function list_artifact_versions
list_artifact_versions
Get all the version of na artifact to download contents or load them in memory
Args:
ml_repo
(str): Repository in which the model is stored.name
(str): Name of the artifact whose version is requiredartifact_type
(ArtifactType): Type of artifact you want for example model, image, etc.
Returns:
Iterator[ArtifactVersion]
: An iterator that yields non deleted artifact-versions of an artifact under a given ml_repo sorted reverse by the version number
Examples:
import truefoundry.ml as tfm
client = tfm.get_client()
artifact_versions = client.list_artifact_versions(ml_repo="my-repo", name="artifact-name")
for artifact_version in artifact_versions:
print(artifact_version)
function list_artifact_versions_by_fqn
list_artifact_versions_by_fqn
List versions for a given artifact
Args:
artifact_fqn
: FQN of the Artifact to list versions for.An artifact_fqn looks like
{artifact_type}`: {org}/{user}/{project}/{artifact_name}`or
{artifact_type}`: {user}/{project}/{artifact_name}`
where artifact_type can be on of ("model", "image", "plot")
Returns:
Iterator[ArtifactVersion]
: An iterator that yields non deleted artifact versions under the given artifact_fqn sorted reverse by the version number
Yields:
ArtifactVersion
: An instance ofmlfoundry.ArtifactVersion
Examples:
import truefoundry.ml as tfm
tfm.login(tracking_uri=https://your.truefoundry.site.com")
client = tfm.get_client()
artifact_fqn = "artifact:org/my-project/my-artifact"
for av in client.list_artifact_versions(artifact_fqn=artifact_fqn):
print(av.name, av.version, av.description)
function list_data_directories
list_data_directories
Get the list of DataDirectory in a ml_repo
Args:
ml_repo
(str): Name of the ML Repositorymax_results
(int): Maximum number of Data Directory to listoffset
(int): Skip these number of instance of DataDirectory and then give the result from these number onwards
Returns:
DataDirectory
: An instance of class DataDirectory
Examples:
import truefoundry.ml as tfm
client = tfm.get_client()
data_directories = client.list_data_directories(ml_repo="<ml-repo-nam>")
for data_directory in data_directories:
print(data_directory.name)
function list_ml_repos
list_ml_repos
Returns a list of names of ML Repos accessible by the current user.
Returns:
List[str]
: A list of names of ML Repos
function list_model_versions
list_model_versions
Get all the version of a model to download contents or load them in memory
Args:
ml_repo
(str): Repository in which the model is stored.name
(str): Name of the model whose version is required
Returns:
Iterator[ModelVersion]
: An iterator that yields non deleted model versions of a model under a given ml_repo sorted reverse by the version number
Examples:
import truefoundry.ml as tfm
client = tfm.get_client()
model_versions = client.list_model_version(ml_repo="my-repo", name="svm")
for model_version in model_versions:
print(model_version)
function list_model_versions_by_fqn
list_model_versions_by_fqn
List versions for a given model
Args:
model_fqn
: FQN of the Model to list versions for.A model_fqn looks like
model`: {org}/{user}/{project}/{artifact_name}`or
model`: {user}/{project}/{artifact_name}`
Returns:
Iterator[ModelVersion]
: An iterator that yields non deleted model versions under the given model_fqn sorted reverse by the version number
Yields:
ModelVersion
: An instance ofmlfoundry.ModelVersion
Examples:
import truefoundry.ml as tfm
mlfoundry.login(tracking_uri="https://your.truefoundry.site.com")
client = tfm.get_client()
model_fqn = "model:org/my-project/my-model"
for mv in client.list_model_versions(model_fqn=model_fqn):
print(mv.name, mv.version, mv.description)
function log_artifact
log_artifact
Logs an artifact for the current ml_repo
.
An artifact
is a list of local files and directories. This function packs the mentioned files and directories in artifact_paths
and uploads them to remote storage linked to the ml_repo
Args:
ml_repo
(str): Name of the ML Repo to which an artifact is to be logged.name
(str): Name of the Artifact. If an artifact with this name already exists under the current ml_repo, the logged artifact will be added as a new version under thatname
. If no artifact exist with the givenname
, the given artifact will be logged as version 1.artifact_paths
(List[truefoundry.ml.ArtifactPath], optional): A list of pairs of (source path, destination path) to add files and folders to the artifact version contents. The first member of the pair should be a file or directory path and the second member should be the path inside the artifact contents to upload to.
E.g. >>> client.log_artifact(
... ml_repo="sample-repo",
... name="xyz",
... artifact_paths=[
tfm.ArtifactPath("foo.txt", "foo/bar/foo.txt"),
tfm.ArtifactPath("tokenizer/", "foo/tokenizer/"),
tfm.ArtifactPath('bar.text'),
('bar.txt', ),
('foo.txt', 'a/foo.txt')
]
... )
would result in
.
└── foo/
├── bar/
│ └── foo.txt
└── tokenizer/
└── # contents of tokenizer/ directory will be uploaded here
description
(Optional[str], optional): arbitrary text upto 1024 characters to store as description. This field can be updated at any time after logging. Defaults toNone
metadata
(Optional[Dict[str, Any]], optional): arbitrary json serializable dictionary to store metadata. For example, you can use this to store metrics, params, notes. This field can be updated at any time after logging. Defaults toNone
Returns:
truefoundry.ml.ArtifactVersion
: an instance ofArtifactVersion
that can be used to download the files, or update attributes like description, metadata.
Examples:
import os
import truefoundry.ml as tfm
with open("artifact.txt", "w") as f:
f.write("hello-world")
client = tfm.get_client()
ml_repo = "sample-repo"
client.create_ml_repo(ml_repo=ml_repo)
client.log_artifact(
ml_repo=ml_repo,
name="hello-world-file",
artifact_paths=[tfm.ArtifactPath('artifact.txt', 'a/b/')]
)
function log_model
log_model
Serialize and log a versioned model under the current ml_repo. Each logged model generates a new version associated with the given name
and linked to the current run. Multiple versions of the model can be logged as separate versions under the same name
.
Args:
ml_repo
(str): Name of the ML Repo to which an artifact is to be logged.name
(str): Name of the model. If a model with this name already exists under the current ML Repo, the logged model will be added as a new version under thatname
. If no models exist with the givenname
, the given model will be logged as version 1.model_file_or_folder
(str): Path to either a single file or a folder containing model files. This folder is usually created using serialization methods of libraries or frameworks e.g.joblib.dump
,model.save_pretrained(...)
,torch.save(...)
,model.save(...)
framework
(Union[enums.ModelFramework, str]): Model Framework. Ex:- pytorch, sklearn, tensorflow etc. The full list of supported frameworks can be found inmlfoundry.enums.ModelFramework
. Can also beNone
whenmodel
isNone
.description
(Optional[str], optional): arbitrary text upto 1024 characters to store as description. This field can be updated at any time after logging. Defaults toNone
metadata
(Optional[Dict[str, Any]], optional): arbitrary json serializable dictionary to store metadata. For example, you can use this to store metrics, params, notes. This field can be updated at any time after logging. Defaults toNone
Returns:
truefoundry.ml.ModelVersion
: an instance ofModelVersion
that can be used to download the files, load the model, or update attributes like description, metadata, schema.
Examples:
- Sklearn
import truefoundry.ml as tfm
from truefoundry.ml import ModelFramework
import joblib
import numpy as np
from sklearn.pipeline import make_pipeline
from sklearn.preprocessing import StandardScaler
from sklearn.svm import SVC
X = np.array([[-1, -1], [-2, -1], [1, 1], [2, 1]])
y = np.array([1, 1, 2, 2])
clf = make_pipeline(StandardScaler(), SVC(gamma='auto'))
clf.fit(X, y)
joblib.dump(clf, "sklearn-pipeline.joblib")
client = tfm.get_client()
client.create_ml_repo( # This is only required once
ml_repo="my-classification-project",
# This controls which bucket is used.
# You can get this from Integrations > Blob Storage. `None` picks the default
storage_integration_fqn=None
)
model_version = client.log_model(
ml_repo="my-classification-project",
name="my-sklearn-model",
model_file_or_folder="sklearn-pipeline.joblib",
framework=ModelFramework.SKLEARN,
metadata={"accuracy": 0.99, "f1": 0.80},
step=1, # step number, useful when using iterative algorithms like SGD
)
print(model_version.fqn)
- Huggingface Transformers
import truefoundry.ml as tfm
from truefoundry.ml import ModelFramework
import torch
from transformers import AutoTokenizer, AutoConfig, pipeline, AutoModelForCausalLM
pln = pipeline(
"text-generation",
model_file_or_folder="EleutherAI/pythia-70m",
tokenizer="EleutherAI/pythia-70m",
torch_dtype=torch.float16
)
pln.model.save_pretrained("my-transformers-model")
pln.tokenizer.save_pretrained("my-transformers-model")
client = tfm.get_client()
client.create_ml_repo( # This is only required once
ml_repo="my-llm-project",
# This controls which bucket is used.
# You can get this from Integrations > Blob Storage. `None` picks the default
storage_integration_fqn=None
)
model_version = client.log_model(
ml_repo="my-llm-project",
name="my-transformers-model",
model_file_or_folder="my-transformers-model/",
framework=ModelFramework.TRANSFORMERS
)
print(model_version.fqn)
function search_runs
search_runs
The user must have READ
access to the project. Returns an iterator that returns a MLFoundryRun on each next call. All the runs under a project which matches the filter string and the run_view_type are returned.
Args:
-
ml_repo
(str): Name of the project. filter_string (str, optional): Filter query string, defaults to searching all runs. Identifier required in the LHS of a search expression. -
Signifies an entity to compare against. An identifier has two parts separated by a period
: the type of the entity and the name of the entity. The type of the entity is metrics, params, attributes, or tags. The entity name can contain alphanumeric characters and special characters. -
You can search using two run attributes
: status and artifact_uri. Both attributes have string values. When a metric, parameter, or tag name contains a special character like hyphen, space, period, and so on, enclose the entity name in double quotes or backticks, params."model-type" or params.model-type
-
run_view_type
(str, optional): one of the following values "ACTIVE_ONLY", "DELETED_ONLY", or "ALL" runs. order_by (List[str], optional): List of columns to order by (e.g., "metrics.rmse"). Currently supported values are metric.key, parameter.key, tag.key, attribute.key. Theorder_by
column can contain an optionalDESC
orASC
value. The default isASC
. The default ordering is to sort bystart_time DESC
. -
job_run_name
(str): Name of the job which are associated with the runs to get that runs -
max_results
(int): max_results on the total numbers of run yielded through filter
Returns:
Iterator[MlFoundryRun]
: MLFoundryRuns matching the search query.
Examples:
import truefoundry.ml as tfm
client = tfm.get_client()
with client.create_run(ml_repo="my-project", run_name="run-1") as run1:
run1.log_metrics(metric_dict={"accuracy": 0.74, "loss": 0.6})
run1.log_params({"model": "LogisticRegression", "lambda": "0.001"})
with client.create_run(ml_repo="my-project", run_name="run-2") as run2:
run2.log_metrics(metric_dict={"accuracy": 0.8, "loss": 0.4})
run2.log_params({"model": "SVM"})
# Search for the subset of runs with logged accuracy metric greater than 0.75
filter_string = "metrics.accuracy > 0.75"
runs = client.search_runs(ml_repo="my-project", filter_string=filter_string)
# Search for the subset of runs with logged accuracy metric greater than 0.7
filter_string = "metrics.accuracy > 0.7"
runs = client.search_runs(ml_repo="my-project", filter_string=filter_string)
# Search for the subset of runs with logged accuracy metric greater than 0.7 and model="LogisticRegression"
filter_string = "metrics.accuracy > 0.7 and params.model = 'LogisticRegression'"
runs = client.search_runs(ml_repo="my-project", filter_string=filter_string)
# Search for the subset of runs with logged accuracy metric greater than 0.7 and
# order by accuracy in Descending order
filter_string = "metrics.accuracy > 0.7"
order_by = ["metric.accuracy DESC"]
runs = client.search_runs(
ml_repo="my-project", filter_string=filter_string, order_by=order_by
)
filter_string = "metrics.accuracy > 0.7"
runs = client.search_runs(
ml_repo="transformers", order_by=order_by ,job_run_name='job_run_name', filter_string=filter_string
)
order_by = ["metric.accuracy DESC"]
runs = client.search_runs(
ml_repo="my-project", filter_string=filter_string, order_by=order_by, max_results=10
)
Updated about 1 month ago