DedicatedResources class final
A description of resources that are dedicated to a DeployedModel or DeployedIndex, and that need a higher degree of manual configuration.
- Inheritance
-
- Object
- ProtoMessage
- DedicatedResources
Constructors
-
DedicatedResources({required MachineSpec? machineSpec, required int minReplicaCount, int maxReplicaCount = 0, int requiredReplicaCount = 0, int initialReplicaCount = 0, List<
AutoscalingMetricSpec> autoscalingMetricSpecs = const [], bool spot = false, FlexStart? flexStart, DedicatedResources_ScaleToZeroSpec? scaleToZeroSpec}) -
DedicatedResources.fromJson(Map<
String, dynamic> json) -
factory
Properties
-
autoscalingMetricSpecs
→ List<
AutoscalingMetricSpec> -
Immutable. The metric specifications that overrides a resource
utilization metric (CPU utilization, accelerator's duty cycle, and so on)
target value (default to 60 if not set). At most one entry is allowed per
metric.
final
- flexStart → FlexStart?
-
Optional. Immutable. If set, use DWS resource to schedule the deployment
workload. reference:
(https://cloud.google.com/blog/products/compute/introducing-dynamic-workload-scheduler)
final
- hashCode → int
-
The hash code for this object.
no setterinherited
- initialReplicaCount → int
-
Immutable. Number of initial replicas being deployed on when scaling the
workload up from zero or when creating the workload in case
min_replica_count= 0. Whenmin_replica_countfinal - machineSpec → MachineSpec?
-
Required. Immutable. The specification of a single machine being used.
final
- maxReplicaCount → int
-
Immutable. The maximum number of replicas that may be deployed on when the
traffic against it increases. If the requested value is too large, the
deployment will error, but if deployment succeeds then the ability to scale
to that many replicas is guaranteed (barring service outages). If traffic
increases beyond what its replicas at maximum may handle, a portion of the
traffic will be dropped. If this value is not provided, will use
min_replica_countas the default value.final - minReplicaCount → int
-
Required. Immutable. The minimum number of machine replicas that will be
always deployed on. This value must be greater than or equal to 1.
final
- qualifiedName → String
-
The fully qualified name of this message, i.e.,
google.protobuf.Durationorgoogle.rpc.ErrorInfo.finalinherited - requiredReplicaCount → int
-
Optional. Number of required available replicas for the deployment to
succeed. This field is only needed when partial deployment/mutation is
desired. If set, the deploy/mutate operation will succeed once
available_replica_count reaches required_replica_count, and the rest of
the replicas will be retried. If not set, the default
required_replica_count will be min_replica_count.
final
- runtimeType → Type
-
A representation of the runtime type of the object.
no setterinherited
- scaleToZeroSpec → DedicatedResources_ScaleToZeroSpec?
-
Optional. Specification for scale-to-zero feature.
final
- spot → bool
-
Optional. If true, schedule the deployment workload on spot
VMs.
final
Methods
-
noSuchMethod(
Invocation invocation) → dynamic -
Invoked when a nonexistent method or property is accessed.
inherited
-
toJson(
) → Object -
override
-
toString(
) → String -
A string representation of this object.
override
Operators
-
operator ==(
Object other) → bool -
The equality operator.
inherited
Constants
- fullyQualifiedName → const String