Define a custom init_run()
function#
You can set up a custom run initialization function by wrapping neptune.init_run()
. This way, you can automatically populate required fields and tags each time a run is created. It also ensures that the names of namespaces and fields are the same across all runs, making it easier to find and compare them.
This approach can be especially helpful when multiple people collaborate on the same project. It also frees up developers from having to add these fields to their code and remembering the field names.
Below is an example of a custom function and its usage in the model-training script.
Show init_run()
parameters list
See in API reference: neptune.init_run()
Name | Type | Default | Description |
---|---|---|---|
project |
str , optional |
None |
Name of a project in the form workspace-name/project-name . If None , the value of the NEPTUNE_PROJECT environment variable is used. |
api_token |
str , optional |
None |
Your Neptune API token (or a service account's API token). If None , the value of the NEPTUNE_API_TOKEN environment variable is used.To keep your token secure, avoid placing it in source code. Instead, save it as an environment variable. |
with_id |
str , optional |
None |
The Neptune identifier of an existing run to resume, such as "CLS-11". The identifier is stored in the object's sys/id field. If omitted or None is passed, a new tracked run is created. |
custom_run_id |
str , optional |
None |
A unique identifier that can be used to log metadata to a single run from multiple locations. Max length: 36 characters. If None and the NEPTUNE_CUSTOM_RUN_ID environment variable is set, Neptune will use that as the custom_run_id value. For details, see Set custom run ID. |
mode |
str , optional |
async |
Connection mode in which the logging will work. Possible values are async , sync , offline , read-only , and debug .If you leave it out, the value of the |
name |
str , optional |
Neptune ID | Custom name for the run. You can use it as a human-readable ID and add it as a column in the experiments table (sys/name ). If left empty, once the run is synchronized with the server, Neptune sets the auto-generated identifier (sys/id ) as the name. |
description |
str , optional |
"" |
Editable description of the run. You can add it as a column in the experiments table (sys/description ). |
tags |
list , optional |
[] |
Must be a list of str which represent the tags for the run. You can edit them after run is created, either in the run information or experiments table. |
source_files |
list or str , optional |
None |
List of source files to be uploaded. Must be list of If Unix style pathname pattern expansion is supported. For example, you can pass |
capture_stdout |
Boolean , optional |
True |
Whether to log the standard output stream. Is logged in the monitoring namespace. |
capture_stderr |
Boolean , optional |
True |
Whether to log the standard error stream. Is logged in the monitoring namespace. |
capture_hardware_metrics |
Boolean , optional |
True |
Whether to track hardware consumption (CPU, GPU, memory utilization). Logged in the monitoring namespace. |
fail_on_exception |
Boolean , optional |
True |
If an uncaught exception occurs, whether to set run's Failed state to True . |
monitoring_namespace |
str , optional |
"monitoring" |
Namespace inside which all monitoring logs will be stored. |
flush_period |
float , optional |
5 (seconds) |
In asynchronous (default) connection mode, how often Neptune should trigger disk flushing. |
proxies |
dict , optional |
None |
Argument passed to HTTP calls made via the Requests library. For details on proxies, see the Requests documentation. |
capture_traceback |
Boolean , optional |
True |
In case of an exception, whether to log the traceback of the run. |
git_ref |
GitRef or Boolean |
None |
GitRef object containing information about the Git repository path.If To specify a different location, set to To turn off Git tracking for the run, set to |
dependencies |
str , optional |
None |
Tracks environment requirements. If you pass "infer" to this argument, Neptune logs dependencies installed in the current environment. You can also pass a path to your dependency file directly. If left empty, no dependency file is uploaded. |
async_lag_callback |
NeptuneObjectCallback , optional |
None |
Custom callback function which is called if the lag between a queued operation and its synchronization with the server exceeds the duration defined by async_lag_threshold . The callback should take a Run object as the argument and can contain any custom code, such as calling stop() on the object.Note: Instead of using this argument, you can use Neptune's default callback by setting the |
async_lag_threshold |
float , optional |
1800.0 (seconds) |
Duration between the queueing and synchronization of an operation. If a lag callback (default callback enabled via environment variable or custom callback passed to the async_lag_callback argument) is enabled, the callback is called when this duration is exceeded. |
async_no_progress_callback |
NeptuneObjectCallback , optional |
None |
Custom callback function which is called if there has been no synchronization progress whatsoever for the duration defined by async_no_progress_threshold . The callback should take a Run object as the argument and can contain any custom code, such as calling stop() on the object.Note: Instead of using this argument, you can use Neptune's default callback by setting the |
async_no_progress_threshold |
float , optional |
300.0 (seconds) |
For how long there has been no synchronization progress. If a no-progress callback (default callback enabled via environment variable or custom callback passed to the async_no_progress_callback argument) is enabled, the callback is called when this duration is exceeded. |
from datetime import datetime
import neptune
def custom_init_run(
objective: str = "baseline",
fields: dict = None,
tags: list = None,
**kwargs,
) -> neptune.Run:
"""Creates a Neptune run and populates it with predefined fields and metadata.
Parameters:
objective: Objective of the experiment.
fields: A dictionary with key-value pairs corresponding to
run fields and their values.
tags: Tags to be assigned to the Neptune run.
**kwargs: Additional keyword arguments passed to `neptune.init_run()`.
Returns:
A Neptune run object. You can access it for logging of further metadata.
"""
custom_name = f"{datetime.today().strftime('%Y%m%d')}-{objective}"
run = neptune.init_run(
name=custom_name, # (1)!
tags=tags, # (2)!
**kwargs,
)
# Define mandatory fields and assign them to the run
fields.update(
{"mandatory_field": "value"},
)
run["prepopulated_fields"] = fields
return run
-
Sets a custom name, which you can use as a human-friendly ID.
To display it in the app, add
sys/name
as a column.You can also edit the name in the run information view ( → Run information).
-
Tags applied this way are stored in the
sys/tags
field and can later be modified in the app.
# Create a new run with necessary fields already populated
custom_run = custom_init_run(
objective="high_outliers",
tags=["tag1", "tag2", "tag3"],
fields={"sample_metric": 42, "sample_text": "lorem ipsum"},
)
# You can use "custom_run" as you would use a regular Neptune run object
custom_run["namespace/subnamespace/field"] = "some metadata"
The resulting run structure would be:
run root
|-- namespace
|-- subnamespace
|-- field (String): some metadata
|-- prepopulated_fields
|-- mandatory_field (String): value
|-- sample_metric (Int): 42
|-- sample_text (String): lorem ipsum
|-- sys
|-- name (String): 20240116-high_outliers
|-- tags (StringSet): {tag1, tag2, tag3}
You can learn more about Neptune field types in the API reference: Field types and methods →
For more ideas, check out the following:
- What Neptune logs automatically – unless disabled, system information and metrics are logged by default in the background.
- What you can log and display – a comprehensive overview of different metadata types that you can log.
- Best practices
- Onboarding guide for teams