blob: a082d939f1c2f50f1a1816aa6550f10ccbe39ae1 [file] [log] [blame]
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -04001<html><body>
2<style>
3
4body, h1, h2, h3, div, span, p, pre, a {
5 margin: 0;
6 padding: 0;
7 border: 0;
8 font-weight: inherit;
9 font-style: inherit;
10 font-size: 100%;
11 font-family: inherit;
12 vertical-align: baseline;
13}
14
15body {
16 font-size: 13px;
17 padding: 1em;
18}
19
20h1 {
21 font-size: 26px;
22 margin-bottom: 1em;
23}
24
25h2 {
26 font-size: 24px;
27 margin-bottom: 1em;
28}
29
30h3 {
31 font-size: 20px;
32 margin-bottom: 1em;
33 margin-top: 1em;
34}
35
36pre, code {
37 line-height: 1.5;
38 font-family: Monaco, 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Lucida Console', monospace;
39}
40
41pre {
42 margin-top: 0.5em;
43}
44
45h1, h2, h3, p {
46 font-family: Arial, sans serif;
47}
48
49h1, h2, h3 {
50 border-bottom: solid #CCC 1px;
51}
52
53.toc_element {
54 margin-top: 0.5em;
55}
56
57.firstline {
58 margin-left: 2 em;
59}
60
61.method {
62 margin-top: 1em;
63 border: solid 1px #CCC;
64 padding: 1em;
65 background: #EEE;
66}
67
68.details {
69 font-weight: bold;
70 font-size: 14px;
71}
72
73</style>
74
Bu Sun Kim715bd7f2019-06-14 16:50:42 -070075<h1><a href="dataflow_v1b3.html">Dataflow API</a> . <a href="dataflow_v1b3.projects.html">projects</a> . <a href="dataflow_v1b3.projects.locations.html">locations</a> . <a href="dataflow_v1b3.projects.locations.templates.html">templates</a></h1>
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -040076<h2>Instance Methods</h2>
77<p class="toc_element">
Dan O'Mearadd494642020-05-01 07:42:23 -070078 <code><a href="#create">create(projectId, location, body=None, x__xgafv=None)</a></code></p>
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -040079<p class="firstline">Creates a Cloud Dataflow job from a template.</p>
80<p class="toc_element">
Bu Sun Kim65020912020-05-20 12:08:20 -070081 <code><a href="#get">get(projectId, location, view=None, gcsPath=None, x__xgafv=None)</a></code></p>
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -040082<p class="firstline">Get the template associated with a template.</p>
83<p class="toc_element">
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -070084 <code><a href="#launch">launch(projectId, location, body=None, validateOnly=None, gcsPath=None, dynamicTemplate_gcsPath=None, dynamicTemplate_stagingLocation=None, x__xgafv=None)</a></code></p>
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -040085<p class="firstline">Launch a template.</p>
86<h3>Method Details</h3>
87<div class="method">
Dan O'Mearadd494642020-05-01 07:42:23 -070088 <code class="details" id="create">create(projectId, location, body=None, x__xgafv=None)</code>
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -040089 <pre>Creates a Cloud Dataflow job from a template.
90
91Args:
92 projectId: string, Required. The ID of the Cloud Platform project that the job belongs to. (required)
Bu Sun Kim715bd7f2019-06-14 16:50:42 -070093 location: string, The [regional endpoint]
94(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) to
95which to direct the request. (required)
Dan O'Mearadd494642020-05-01 07:42:23 -070096 body: object, The request body.
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -040097 The object takes the form of:
98
99{ # A request to create a Cloud Dataflow job from a template.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700100 &quot;jobName&quot;: &quot;A String&quot;, # Required. The job name to use for the created job.
101 &quot;gcsPath&quot;: &quot;A String&quot;, # Required. A Cloud Storage path to the template from which to
102 # create the job.
103 # Must be a valid Cloud Storage URL, beginning with `gs://`.
Bu Sun Kim65020912020-05-20 12:08:20 -0700104 &quot;environment&quot;: { # The environment values to set at runtime. # The runtime environment for the job.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700105 &quot;machineType&quot;: &quot;A String&quot;, # The machine type to use for the job. Defaults to the value from the
106 # template if not specified.
107 &quot;subnetwork&quot;: &quot;A String&quot;, # Subnetwork to which VMs will be assigned, if desired. Expected to be of
108 # the form &quot;regions/REGION/subnetworks/SUBNETWORK&quot;.
109 &quot;ipConfiguration&quot;: &quot;A String&quot;, # Configuration for VM IPs.
110 &quot;kmsKeyName&quot;: &quot;A String&quot;, # Optional. Name for the Cloud KMS key for the job.
111 # Key format is:
112 # projects/&lt;project&gt;/locations/&lt;location&gt;/keyRings/&lt;keyring&gt;/cryptoKeys/&lt;key&gt;
113 &quot;tempLocation&quot;: &quot;A String&quot;, # The Cloud Storage path to use for temporary files.
114 # Must be a valid Cloud Storage URL, beginning with `gs://`.
115 &quot;bypassTempDirValidation&quot;: True or False, # Whether to bypass the safety checks for the job&#x27;s temporary directory.
116 # Use with caution.
117 &quot;network&quot;: &quot;A String&quot;, # Network to which VMs will be assigned. If empty or unspecified,
118 # the service will use the network &quot;default&quot;.
Bu Sun Kim65020912020-05-20 12:08:20 -0700119 &quot;workerRegion&quot;: &quot;A String&quot;, # The Compute Engine region
Dan O'Mearadd494642020-05-01 07:42:23 -0700120 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
Bu Sun Kim65020912020-05-20 12:08:20 -0700121 # which worker processing should occur, e.g. &quot;us-west1&quot;. Mutually exclusive
Dan O'Mearadd494642020-05-01 07:42:23 -0700122 # with worker_zone. If neither worker_region nor worker_zone is specified,
Bu Sun Kim65020912020-05-20 12:08:20 -0700123 # default to the control plane&#x27;s region.
Bu Sun Kim65020912020-05-20 12:08:20 -0700124 &quot;zone&quot;: &quot;A String&quot;, # The Compute Engine [availability
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -0400125 # zone](https://cloud.google.com/compute/docs/regions-zones/regions-zones)
126 # for launching worker instances to run your pipeline.
Dan O'Mearadd494642020-05-01 07:42:23 -0700127 # In the future, worker_zone will take precedence.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700128 &quot;numWorkers&quot;: 42, # The initial number of Google Compute Engine instnaces for the job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700129 &quot;workerZone&quot;: &quot;A String&quot;, # The Compute Engine zone
130 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
131 # which worker processing should occur, e.g. &quot;us-west1-a&quot;. Mutually exclusive
132 # with worker_region. If neither worker_region nor worker_zone is specified,
133 # a zone in the control plane&#x27;s region is chosen based on available capacity.
134 # If both `worker_zone` and `zone` are set, `worker_zone` takes precedence.
135 &quot;additionalUserLabels&quot;: { # Additional user labels to be specified for the job.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700136 # Keys and values should follow the restrictions specified in the [labeling
137 # restrictions](https://cloud.google.com/compute/docs/labeling-resources#restrictions)
138 # page.
Bu Sun Kim65020912020-05-20 12:08:20 -0700139 &quot;a_key&quot;: &quot;A String&quot;,
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700140 },
Bu Sun Kim65020912020-05-20 12:08:20 -0700141 &quot;additionalExperiments&quot;: [ # Additional experiment flags for the job.
142 &quot;A String&quot;,
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700143 ],
Bu Sun Kim65020912020-05-20 12:08:20 -0700144 &quot;maxWorkers&quot;: 42, # The maximum number of Google Compute Engine instances to be made
145 # available to your pipeline during execution, from 1 to 1000.
146 &quot;serviceAccountEmail&quot;: &quot;A String&quot;, # The email address of the service account to run the job as.
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -0400147 },
Bu Sun Kim65020912020-05-20 12:08:20 -0700148 &quot;location&quot;: &quot;A String&quot;, # The [regional endpoint]
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700149 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) to
150 # which to direct the request.
Bu Sun Kim65020912020-05-20 12:08:20 -0700151 &quot;parameters&quot;: { # The runtime parameters to pass to the job.
152 &quot;a_key&quot;: &quot;A String&quot;,
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -0400153 },
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -0400154 }
155
156 x__xgafv: string, V1 error format.
157 Allowed values
158 1 - v1 error format
159 2 - v2 error format
160
161Returns:
162 An object of the form:
163
164 { # Defines a job to be run by the Cloud Dataflow service.
Bu Sun Kim65020912020-05-20 12:08:20 -0700165 &quot;clientRequestId&quot;: &quot;A String&quot;, # The client&#x27;s unique identifier of the job, re-used across retried attempts.
166 # If this field is set, the service will ensure its uniqueness.
167 # The request to create a job will fail if the service has knowledge of a
168 # previously submitted job with the same client&#x27;s ID and job name.
169 # The caller may use this field to ensure idempotence of job
170 # creation across retried attempts to create a job.
171 # By default, the field is empty and, in that case, the service ignores it.
172 &quot;id&quot;: &quot;A String&quot;, # The unique ID of this job.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700173 #
174 # This field is set by the Cloud Dataflow service when the Job is
175 # created, and is immutable for the life of the job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700176 &quot;currentStateTime&quot;: &quot;A String&quot;, # The timestamp associated with the current state.
177 &quot;transformNameMapping&quot;: { # The map of transform name prefixes of the job to be replaced to the
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700178 # corresponding name prefixes of the new job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700179 &quot;a_key&quot;: &quot;A String&quot;,
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700180 },
Bu Sun Kim65020912020-05-20 12:08:20 -0700181 &quot;environment&quot;: { # Describes the environment in which a Dataflow Job runs. # The environment for the job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700182 &quot;internalExperiments&quot;: { # Experimental settings.
183 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
184 },
185 &quot;workerRegion&quot;: &quot;A String&quot;, # The Compute Engine region
186 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
187 # which worker processing should occur, e.g. &quot;us-west1&quot;. Mutually exclusive
188 # with worker_zone. If neither worker_region nor worker_zone is specified,
189 # default to the control plane&#x27;s region.
190 &quot;serviceKmsKeyName&quot;: &quot;A String&quot;, # If set, contains the Cloud KMS key identifier used to encrypt data
191 # at rest, AKA a Customer Managed Encryption Key (CMEK).
192 #
193 # Format:
194 # projects/PROJECT_ID/locations/LOCATION/keyRings/KEY_RING/cryptoKeys/KEY
195 &quot;userAgent&quot;: { # A description of the process that generated the request.
196 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
197 },
198 &quot;workerZone&quot;: &quot;A String&quot;, # The Compute Engine zone
199 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
200 # which worker processing should occur, e.g. &quot;us-west1-a&quot;. Mutually exclusive
201 # with worker_region. If neither worker_region nor worker_zone is specified,
202 # a zone in the control plane&#x27;s region is chosen based on available capacity.
203 &quot;clusterManagerApiService&quot;: &quot;A String&quot;, # The type of cluster manager API to use. If unknown or
Dan O'Mearadd494642020-05-01 07:42:23 -0700204 # unspecified, the service will attempt to choose a reasonable
205 # default. This should be in the form of the API service name,
Bu Sun Kim65020912020-05-20 12:08:20 -0700206 # e.g. &quot;compute.googleapis.com&quot;.
207 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
208 # storage. The system will append the suffix &quot;/temp-{JOBNAME} to
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700209 # this resource prefix, where {JOBNAME} is the value of the
210 # job_name field. The resulting bucket and object prefix is used
211 # as the prefix of the resources used to store temporary data
212 # needed during the job execution. NOTE: This will override the
213 # value in taskrunner_settings.
214 # The supported resource type is:
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -0400215 #
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700216 # Google Cloud Storage:
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -0400217 #
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700218 # storage.googleapis.com/{bucket}/{object}
219 # bucket.storage.googleapis.com/{object}
Bu Sun Kim65020912020-05-20 12:08:20 -0700220 &quot;experiments&quot;: [ # The list of experiments to enable.
221 &quot;A String&quot;,
222 ],
223 &quot;version&quot;: { # A structure describing which components and their versions of the service
224 # are required in order to run the job.
225 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
226 },
227 &quot;serviceAccountEmail&quot;: &quot;A String&quot;, # Identity to run virtual machines as. Defaults to the default account.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700228 &quot;sdkPipelineOptions&quot;: { # The Cloud Dataflow SDK pipeline options specified by the user. These
229 # options are passed through the service and are used to recreate the
230 # SDK pipeline options on the worker in a language agnostic and platform
231 # independent way.
232 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
233 },
234 &quot;flexResourceSchedulingGoal&quot;: &quot;A String&quot;, # Which Flexible Resource Scheduling mode to run in.
235 &quot;workerPools&quot;: [ # The worker pools. At least one &quot;harness&quot; worker pool must be
236 # specified in order for the job to have workers.
237 { # Describes one particular pool of Cloud Dataflow workers to be
238 # instantiated by the Cloud Dataflow service in order to perform the
239 # computations required by a job. Note that a workflow job may use
240 # multiple pools, in order to match the various computational
241 # requirements of the various stages of the job.
242 &quot;numThreadsPerWorker&quot;: 42, # The number of threads per worker harness. If empty or unspecified, the
243 # service will choose a number of threads (according to the number of cores
244 # on the selected machine type for batch, or 1 by convention for streaming).
245 &quot;numWorkers&quot;: 42, # Number of Google Compute Engine workers in this pool needed to
246 # execute the job. If zero or unspecified, the service will
247 # attempt to choose a reasonable default.
248 &quot;zone&quot;: &quot;A String&quot;, # Zone to run the worker pools in. If empty or unspecified, the service
249 # will attempt to choose a reasonable default.
250 &quot;diskSourceImage&quot;: &quot;A String&quot;, # Fully qualified source image for disks.
251 &quot;packages&quot;: [ # Packages to be installed on workers.
252 { # The packages that must be installed in order for a worker to run the
253 # steps of the Cloud Dataflow job that will be assigned to its worker
254 # pool.
255 #
256 # This is the mechanism by which the Cloud Dataflow SDK causes code to
257 # be loaded onto the workers. For example, the Cloud Dataflow Java SDK
258 # might use this to install jars containing the user&#x27;s code and all of the
259 # various dependencies (libraries, data files, etc.) required in order
260 # for that code to run.
261 &quot;name&quot;: &quot;A String&quot;, # The name of the package.
262 &quot;location&quot;: &quot;A String&quot;, # The resource to read the package from. The supported resource type is:
263 #
264 # Google Cloud Storage:
265 #
266 # storage.googleapis.com/{bucket}
267 # bucket.storage.googleapis.com/
268 },
269 ],
270 &quot;teardownPolicy&quot;: &quot;A String&quot;, # Sets the policy for determining when to turndown worker pool.
271 # Allowed values are: `TEARDOWN_ALWAYS`, `TEARDOWN_ON_SUCCESS`, and
272 # `TEARDOWN_NEVER`.
273 # `TEARDOWN_ALWAYS` means workers are always torn down regardless of whether
274 # the job succeeds. `TEARDOWN_ON_SUCCESS` means workers are torn down
275 # if the job succeeds. `TEARDOWN_NEVER` means the workers are never torn
276 # down.
277 #
278 # If the workers are not torn down by the service, they will
279 # continue to run and use Google Compute Engine VM resources in the
280 # user&#x27;s project until they are explicitly terminated by the user.
281 # Because of this, Google recommends using the `TEARDOWN_ALWAYS`
282 # policy except for small, manually supervised test jobs.
283 #
284 # If unknown or unspecified, the service will attempt to choose a reasonable
285 # default.
286 &quot;onHostMaintenance&quot;: &quot;A String&quot;, # The action to take on host maintenance, as defined by the Google
287 # Compute Engine API.
288 &quot;poolArgs&quot;: { # Extra arguments for this worker pool.
289 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
290 },
291 &quot;diskSizeGb&quot;: 42, # Size of root disk for VMs, in GB. If zero or unspecified, the service will
292 # attempt to choose a reasonable default.
293 &quot;workerHarnessContainerImage&quot;: &quot;A String&quot;, # Required. Docker container image that executes the Cloud Dataflow worker
294 # harness, residing in Google Container Registry.
295 #
296 # Deprecated for the Fn API path. Use sdk_harness_container_images instead.
297 &quot;diskType&quot;: &quot;A String&quot;, # Type of root disk for VMs. If empty or unspecified, the service will
298 # attempt to choose a reasonable default.
299 &quot;machineType&quot;: &quot;A String&quot;, # Machine type (e.g. &quot;n1-standard-1&quot;). If empty or unspecified, the
300 # service will attempt to choose a reasonable default.
301 &quot;kind&quot;: &quot;A String&quot;, # The kind of the worker pool; currently only `harness` and `shuffle`
302 # are supported.
303 &quot;sdkHarnessContainerImages&quot;: [ # Set of SDK harness containers needed to execute this pipeline. This will
304 # only be set in the Fn API path. For non-cross-language pipelines this
305 # should have only one entry. Cross-language pipelines will have two or more
306 # entries.
307 { # Defines a SDK harness container for executing Dataflow pipelines.
308 &quot;containerImage&quot;: &quot;A String&quot;, # A docker container image that resides in Google Container Registry.
309 &quot;useSingleCorePerContainer&quot;: True or False, # If true, recommends the Dataflow service to use only one core per SDK
310 # container instance with this image. If false (or unset) recommends using
311 # more than one core per SDK container instance with this image for
312 # efficiency. Note that Dataflow service may choose to override this property
313 # if needed.
314 },
315 ],
316 &quot;dataDisks&quot;: [ # Data disks that are used by a VM in this workflow.
317 { # Describes the data disk used by a workflow job.
318 &quot;diskType&quot;: &quot;A String&quot;, # Disk storage type, as defined by Google Compute Engine. This
319 # must be a disk type appropriate to the project and zone in which
320 # the workers will run. If unknown or unspecified, the service
321 # will attempt to choose a reasonable default.
322 #
323 # For example, the standard persistent disk type is a resource name
324 # typically ending in &quot;pd-standard&quot;. If SSD persistent disks are
325 # available, the resource name typically ends with &quot;pd-ssd&quot;. The
326 # actual valid values are defined the Google Compute Engine API,
327 # not by the Cloud Dataflow API; consult the Google Compute Engine
328 # documentation for more information about determining the set of
329 # available disk types for a particular project and zone.
330 #
331 # Google Compute Engine Disk types are local to a particular
332 # project in a particular zone, and so the resource name will
333 # typically look something like this:
334 #
335 # compute.googleapis.com/projects/project-id/zones/zone/diskTypes/pd-standard
336 &quot;sizeGb&quot;: 42, # Size of disk in GB. If zero or unspecified, the service will
337 # attempt to choose a reasonable default.
338 &quot;mountPoint&quot;: &quot;A String&quot;, # Directory in a VM where disk is mounted.
339 },
340 ],
341 &quot;subnetwork&quot;: &quot;A String&quot;, # Subnetwork to which VMs will be assigned, if desired. Expected to be of
342 # the form &quot;regions/REGION/subnetworks/SUBNETWORK&quot;.
343 &quot;ipConfiguration&quot;: &quot;A String&quot;, # Configuration for VM IPs.
344 &quot;taskrunnerSettings&quot;: { # Taskrunner configuration settings. # Settings passed through to Google Compute Engine workers when
345 # using the standard Dataflow task runner. Users should ignore
346 # this field.
347 &quot;alsologtostderr&quot;: True or False, # Whether to also send taskrunner log info to stderr.
348 &quot;taskGroup&quot;: &quot;A String&quot;, # The UNIX group ID on the worker VM to use for tasks launched by
349 # taskrunner; e.g. &quot;wheel&quot;.
350 &quot;harnessCommand&quot;: &quot;A String&quot;, # The command to launch the worker harness.
351 &quot;logDir&quot;: &quot;A String&quot;, # The directory on the VM to store logs.
352 &quot;oauthScopes&quot;: [ # The OAuth2 scopes to be requested by the taskrunner in order to
353 # access the Cloud Dataflow API.
354 &quot;A String&quot;,
355 ],
356 &quot;dataflowApiVersion&quot;: &quot;A String&quot;, # The API version of endpoint, e.g. &quot;v1b3&quot;
357 &quot;logUploadLocation&quot;: &quot;A String&quot;, # Indicates where to put logs. If this is not specified, the logs
358 # will not be uploaded.
359 #
360 # The supported resource type is:
361 #
362 # Google Cloud Storage:
363 # storage.googleapis.com/{bucket}/{object}
364 # bucket.storage.googleapis.com/{object}
365 &quot;streamingWorkerMainClass&quot;: &quot;A String&quot;, # The streaming worker main class name.
366 &quot;workflowFileName&quot;: &quot;A String&quot;, # The file to store the workflow in.
367 &quot;languageHint&quot;: &quot;A String&quot;, # The suggested backend language.
368 &quot;commandlinesFileName&quot;: &quot;A String&quot;, # The file to store preprocessing commands in.
369 &quot;baseTaskDir&quot;: &quot;A String&quot;, # The location on the worker for task-specific subdirectories.
370 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the taskrunner should use for
371 # temporary storage.
372 #
373 # The supported resource type is:
374 #
375 # Google Cloud Storage:
376 # storage.googleapis.com/{bucket}/{object}
377 # bucket.storage.googleapis.com/{object}
378 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for the taskrunner to use when accessing Google Cloud APIs.
379 #
380 # When workers access Google Cloud APIs, they logically do so via
381 # relative URLs. If this field is specified, it supplies the base
382 # URL to use for resolving these relative URLs. The normative
383 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
384 # Locators&quot;.
385 #
386 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
387 &quot;logToSerialconsole&quot;: True or False, # Whether to send taskrunner log info to Google Compute Engine VM serial
388 # console.
389 &quot;continueOnException&quot;: True or False, # Whether to continue taskrunner if an exception is hit.
390 &quot;parallelWorkerSettings&quot;: { # Provides data to pass through to the worker harness. # The settings to pass to the parallel worker harness.
391 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
392 # storage.
393 #
394 # The supported resource type is:
395 #
396 # Google Cloud Storage:
397 #
398 # storage.googleapis.com/{bucket}/{object}
399 # bucket.storage.googleapis.com/{object}
400 &quot;reportingEnabled&quot;: True or False, # Whether to send work progress updates to the service.
401 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for accessing Google Cloud APIs.
402 #
403 # When workers access Google Cloud APIs, they logically do so via
404 # relative URLs. If this field is specified, it supplies the base
405 # URL to use for resolving these relative URLs. The normative
406 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
407 # Locators&quot;.
408 #
409 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
410 &quot;servicePath&quot;: &quot;A String&quot;, # The Cloud Dataflow service path relative to the root URL, for example,
411 # &quot;dataflow/v1b3/projects&quot;.
412 &quot;shuffleServicePath&quot;: &quot;A String&quot;, # The Shuffle service path relative to the root URL, for example,
413 # &quot;shuffle/v1beta1&quot;.
414 &quot;workerId&quot;: &quot;A String&quot;, # The ID of the worker running this pipeline.
415 },
416 &quot;taskUser&quot;: &quot;A String&quot;, # The UNIX user ID on the worker VM to use for tasks launched by
417 # taskrunner; e.g. &quot;root&quot;.
418 &quot;vmId&quot;: &quot;A String&quot;, # The ID string of the VM.
419 },
420 &quot;autoscalingSettings&quot;: { # Settings for WorkerPool autoscaling. # Settings for autoscaling of this WorkerPool.
421 &quot;algorithm&quot;: &quot;A String&quot;, # The algorithm to use for autoscaling.
422 &quot;maxNumWorkers&quot;: 42, # The maximum number of workers to cap scaling at.
423 },
424 &quot;metadata&quot;: { # Metadata to set on the Google Compute Engine VMs.
425 &quot;a_key&quot;: &quot;A String&quot;,
426 },
427 &quot;defaultPackageSet&quot;: &quot;A String&quot;, # The default package set to install. This allows the service to
428 # select a default set of packages which are useful to worker
429 # harnesses written in a particular language.
430 &quot;network&quot;: &quot;A String&quot;, # Network to which VMs will be assigned. If empty or unspecified,
431 # the service will use the network &quot;default&quot;.
432 },
433 ],
434 &quot;dataset&quot;: &quot;A String&quot;, # The dataset for the current project where various workflow
435 # related tables are stored.
436 #
437 # The supported resource type is:
438 #
439 # Google BigQuery:
440 # bigquery.googleapis.com/{dataset}
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700441 },
Bu Sun Kim65020912020-05-20 12:08:20 -0700442 &quot;stageStates&quot;: [ # This field may be mutated by the Cloud Dataflow service;
443 # callers cannot mutate it.
444 { # A message describing the state of a particular execution stage.
Bu Sun Kim65020912020-05-20 12:08:20 -0700445 &quot;currentStateTime&quot;: &quot;A String&quot;, # The time at which the stage transitioned to this state.
446 &quot;executionStageState&quot;: &quot;A String&quot;, # Executions stage states allow the same set of values as JobState.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700447 &quot;executionStageName&quot;: &quot;A String&quot;, # The name of the execution stage.
Bu Sun Kim65020912020-05-20 12:08:20 -0700448 },
449 ],
450 &quot;jobMetadata&quot;: { # Metadata available primarily for filtering jobs. Will be included in the # This field is populated by the Dataflow service to support filtering jobs
451 # by the metadata values provided here. Populated for ListJobs and all GetJob
452 # views SUMMARY and higher.
453 # ListJob response and Job SUMMARY view.
Bu Sun Kim65020912020-05-20 12:08:20 -0700454 &quot;datastoreDetails&quot;: [ # Identification of a Datastore source used in the Dataflow job.
455 { # Metadata for a Datastore connector used by the job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700456 &quot;namespace&quot;: &quot;A String&quot;, # Namespace used in the connection.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700457 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
Bu Sun Kim65020912020-05-20 12:08:20 -0700458 },
459 ],
460 &quot;sdkVersion&quot;: { # The version of the SDK used to run the job. # The SDK version used to run the job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700461 &quot;version&quot;: &quot;A String&quot;, # The version of the SDK used to run the job.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700462 &quot;sdkSupportStatus&quot;: &quot;A String&quot;, # The support status for this SDK version.
463 &quot;versionDisplayName&quot;: &quot;A String&quot;, # A readable string describing the version of the SDK.
Bu Sun Kim65020912020-05-20 12:08:20 -0700464 },
465 &quot;bigqueryDetails&quot;: [ # Identification of a BigQuery source used in the Dataflow job.
466 { # Metadata for a BigQuery connector used by the job.
467 &quot;table&quot;: &quot;A String&quot;, # Table accessed in the connection.
468 &quot;dataset&quot;: &quot;A String&quot;, # Dataset accessed in the connection.
Bu Sun Kim65020912020-05-20 12:08:20 -0700469 &quot;query&quot;: &quot;A String&quot;, # Query used to access data in the connection.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700470 &quot;projectId&quot;: &quot;A String&quot;, # Project accessed in the connection.
Bu Sun Kim65020912020-05-20 12:08:20 -0700471 },
472 ],
473 &quot;fileDetails&quot;: [ # Identification of a File source used in the Dataflow job.
474 { # Metadata for a File connector used by the job.
475 &quot;filePattern&quot;: &quot;A String&quot;, # File Pattern used to access files by the connector.
476 },
477 ],
478 &quot;pubsubDetails&quot;: [ # Identification of a PubSub source used in the Dataflow job.
479 { # Metadata for a PubSub connector used by the job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700480 &quot;topic&quot;: &quot;A String&quot;, # Topic accessed in the connection.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700481 &quot;subscription&quot;: &quot;A String&quot;, # Subscription used in the connection.
482 },
483 ],
484 &quot;bigTableDetails&quot;: [ # Identification of a BigTable source used in the Dataflow job.
485 { # Metadata for a BigTable connector used by the job.
486 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
487 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
488 &quot;tableId&quot;: &quot;A String&quot;, # TableId accessed in the connection.
489 },
490 ],
491 &quot;spannerDetails&quot;: [ # Identification of a Spanner source used in the Dataflow job.
492 { # Metadata for a Spanner connector used by the job.
493 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
494 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
495 &quot;databaseId&quot;: &quot;A String&quot;, # DatabaseId accessed in the connection.
Bu Sun Kim65020912020-05-20 12:08:20 -0700496 },
497 ],
498 },
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700499 &quot;type&quot;: &quot;A String&quot;, # The type of Cloud Dataflow job.
500 &quot;projectId&quot;: &quot;A String&quot;, # The ID of the Cloud Platform project that the job belongs to.
Bu Sun Kim65020912020-05-20 12:08:20 -0700501 &quot;createdFromSnapshotId&quot;: &quot;A String&quot;, # If this is specified, the job&#x27;s initial state is populated from the given
502 # snapshot.
Bu Sun Kim65020912020-05-20 12:08:20 -0700503 &quot;pipelineDescription&quot;: { # A descriptive representation of submitted pipeline as well as the executed # Preliminary field: The format of this data may change at any time.
504 # A description of the user pipeline and stages through which it is executed.
505 # Created by Cloud Dataflow service. Only retrieved with
506 # JOB_VIEW_DESCRIPTION or JOB_VIEW_ALL.
507 # form. This data is provided by the Dataflow service for ease of visualizing
508 # the pipeline and interpreting Dataflow provided metrics.
509 &quot;executionPipelineStage&quot;: [ # Description of each stage of execution of the pipeline.
510 { # Description of the composing transforms, names/ids, and input/outputs of a
511 # stage of execution. Some composing transforms and sources may have been
512 # generated by the Dataflow service during execution planning.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700513 &quot;outputSource&quot;: [ # Output sources for this stage.
514 { # Description of an input or output of an execution stage.
515 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
516 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
517 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
518 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
519 # source is most closely associated.
520 },
521 ],
522 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this stage.
523 &quot;inputSource&quot;: [ # Input sources for this stage.
524 { # Description of an input or output of an execution stage.
525 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
526 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
527 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
528 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
529 # source is most closely associated.
530 },
531 ],
Bu Sun Kim65020912020-05-20 12:08:20 -0700532 &quot;id&quot;: &quot;A String&quot;, # Dataflow service generated id for this stage.
533 &quot;componentTransform&quot;: [ # Transforms that comprise this execution stage.
534 { # Description of a transform executed as part of an execution stage.
535 &quot;originalTransform&quot;: &quot;A String&quot;, # User name for the original user transform with which this transform is
536 # most closely associated.
537 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
538 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
539 },
540 ],
541 &quot;componentSource&quot;: [ # Collections produced and consumed by component transforms of this stage.
542 { # Description of an interstitial value between transforms in an execution
543 # stage.
544 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
545 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
546 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
547 # source is most closely associated.
548 },
549 ],
550 &quot;kind&quot;: &quot;A String&quot;, # Type of tranform this stage is executing.
Bu Sun Kim65020912020-05-20 12:08:20 -0700551 },
552 ],
553 &quot;originalPipelineTransform&quot;: [ # Description of each transform in the pipeline and collections between them.
554 { # Description of the type, names/ids, and input/outputs for a transform.
555 &quot;kind&quot;: &quot;A String&quot;, # Type of transform.
556 &quot;inputCollectionName&quot;: [ # User names for all collection inputs to this transform.
557 &quot;A String&quot;,
558 ],
559 &quot;name&quot;: &quot;A String&quot;, # User provided name for this transform instance.
560 &quot;id&quot;: &quot;A String&quot;, # SDK generated id of this transform instance.
561 &quot;displayData&quot;: [ # Transform-specific display data.
562 { # Data provided with a pipeline or transform to provide descriptive info.
Bu Sun Kim65020912020-05-20 12:08:20 -0700563 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700564 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
Bu Sun Kim65020912020-05-20 12:08:20 -0700565 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
566 # language namespace (i.e. python module) which defines the display data.
567 # This allows a dax monitoring system to specially handle the data
568 # and perform custom rendering.
569 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
570 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
571 # This is intended to be used as a label for the display data
572 # when viewed in a dax monitoring system.
573 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
574 # For example a java_class_name_value of com.mypackage.MyDoFn
575 # will be stored with MyDoFn as the short_str_value and
576 # com.mypackage.MyDoFn as the java_class_name value.
577 # short_str_value can be displayed and java_class_name_value
578 # will be displayed as a tooltip.
579 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
580 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700581 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
582 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
583 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
584 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
Bu Sun Kim65020912020-05-20 12:08:20 -0700585 },
586 ],
587 &quot;outputCollectionName&quot;: [ # User names for all collection outputs to this transform.
588 &quot;A String&quot;,
589 ],
590 },
591 ],
592 &quot;displayData&quot;: [ # Pipeline level display data.
593 { # Data provided with a pipeline or transform to provide descriptive info.
Bu Sun Kim65020912020-05-20 12:08:20 -0700594 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700595 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
Bu Sun Kim65020912020-05-20 12:08:20 -0700596 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
597 # language namespace (i.e. python module) which defines the display data.
598 # This allows a dax monitoring system to specially handle the data
599 # and perform custom rendering.
600 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
601 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
602 # This is intended to be used as a label for the display data
603 # when viewed in a dax monitoring system.
604 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
605 # For example a java_class_name_value of com.mypackage.MyDoFn
606 # will be stored with MyDoFn as the short_str_value and
607 # com.mypackage.MyDoFn as the java_class_name value.
608 # short_str_value can be displayed and java_class_name_value
609 # will be displayed as a tooltip.
610 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
611 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700612 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
613 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
614 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
615 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
Bu Sun Kim65020912020-05-20 12:08:20 -0700616 },
617 ],
618 },
619 &quot;replaceJobId&quot;: &quot;A String&quot;, # If this job is an update of an existing job, this field is the job ID
620 # of the job it replaced.
621 #
622 # When sending a `CreateJobRequest`, you can update a job by specifying it
623 # here. The job named here is stopped, and its intermediate state is
624 # transferred to this job.
625 &quot;tempFiles&quot;: [ # A set of files the system should be aware of that are used
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700626 # for temporary storage. These temporary files will be
627 # removed on job completion.
628 # No duplicates are allowed.
629 # No file patterns are supported.
630 #
631 # The supported files are:
632 #
633 # Google Cloud Storage:
634 #
635 # storage.googleapis.com/{bucket}/{object}
636 # bucket.storage.googleapis.com/{object}
Bu Sun Kim65020912020-05-20 12:08:20 -0700637 &quot;A String&quot;,
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700638 ],
Bu Sun Kim65020912020-05-20 12:08:20 -0700639 &quot;name&quot;: &quot;A String&quot;, # The user-specified Cloud Dataflow job name.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700640 #
641 # Only one Job with a given name may exist in a project at any
642 # given time. If a caller attempts to create a Job with the same
643 # name as an already-existing Job, the attempt returns the
644 # existing Job.
645 #
646 # The name must match the regular expression
647 # `[a-z]([-a-z0-9]{0,38}[a-z0-9])?`
Bu Sun Kim65020912020-05-20 12:08:20 -0700648 &quot;steps&quot;: [ # Exactly one of step or steps_location should be specified.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700649 #
650 # The top-level steps that constitute the entire job.
651 { # Defines a particular step within a Cloud Dataflow job.
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -0400652 #
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700653 # A job consists of multiple steps, each of which performs some
654 # specific operation as part of the overall job. Data is typically
655 # passed from one step to another as part of the job.
656 #
Bu Sun Kim65020912020-05-20 12:08:20 -0700657 # Here&#x27;s an example of a sequence of steps which together implement a
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700658 # Map-Reduce job:
659 #
660 # * Read a collection of data from some source, parsing the
Bu Sun Kim65020912020-05-20 12:08:20 -0700661 # collection&#x27;s elements.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700662 #
663 # * Validate the elements.
664 #
665 # * Apply a user-defined function to map each element to some value
666 # and extract an element-specific key value.
667 #
668 # * Group elements with the same key into a single element with
669 # that key, transforming a multiply-keyed collection into a
670 # uniquely-keyed collection.
671 #
672 # * Write the elements out to some data sink.
673 #
674 # Note that the Cloud Dataflow service may be used to run many different
675 # types of jobs, not just Map-Reduce.
Bu Sun Kim65020912020-05-20 12:08:20 -0700676 &quot;name&quot;: &quot;A String&quot;, # The name that identifies the step. This must be unique for each
Dan O'Mearadd494642020-05-01 07:42:23 -0700677 # step with respect to all other steps in the Cloud Dataflow job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700678 &quot;kind&quot;: &quot;A String&quot;, # The kind of step in the Cloud Dataflow job.
679 &quot;properties&quot;: { # Named properties associated with the step. Each kind of
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700680 # predefined step has its own required set of properties.
681 # Must be provided on Create. Only retrieved with JOB_VIEW_ALL.
Bu Sun Kim65020912020-05-20 12:08:20 -0700682 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700683 },
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700684 },
685 ],
Bu Sun Kim65020912020-05-20 12:08:20 -0700686 &quot;replacedByJobId&quot;: &quot;A String&quot;, # If another job is an update of this job (and thus, this job is in
687 # `JOB_STATE_UPDATED`), this field contains the ID of that job.
688 &quot;executionInfo&quot;: { # Additional information about how a Cloud Dataflow job will be executed that # Deprecated.
689 # isn&#x27;t contained in the submitted job.
690 &quot;stages&quot;: { # A mapping from each stage to the information about that stage.
691 &quot;a_key&quot;: { # Contains information about how a particular
692 # google.dataflow.v1beta3.Step will be executed.
693 &quot;stepName&quot;: [ # The steps associated with the execution stage.
694 # Note that stages may have several steps, and that a given step
695 # might be run by more than one stage.
696 &quot;A String&quot;,
697 ],
698 },
699 },
700 },
701 &quot;currentState&quot;: &quot;A String&quot;, # The current state of the job.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700702 #
703 # Jobs are created in the `JOB_STATE_STOPPED` state unless otherwise
704 # specified.
705 #
706 # A job in the `JOB_STATE_RUNNING` state may asynchronously enter a
707 # terminal state. After a job has reached a terminal state, no
708 # further state updates may be made.
709 #
710 # This field may be mutated by the Cloud Dataflow service;
711 # callers cannot mutate it.
Bu Sun Kim65020912020-05-20 12:08:20 -0700712 &quot;location&quot;: &quot;A String&quot;, # The [regional endpoint]
713 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
714 # contains this job.
715 &quot;startTime&quot;: &quot;A String&quot;, # The timestamp when the job was started (transitioned to JOB_STATE_PENDING).
716 # Flexible resource scheduling jobs are started with some delay after job
717 # creation, so start_time is unset before start and is updated when the
718 # job is started by the Cloud Dataflow service. For other jobs, start_time
719 # always equals to create_time and is immutable and set by the Cloud Dataflow
720 # service.
721 &quot;stepsLocation&quot;: &quot;A String&quot;, # The GCS location where the steps are stored.
722 &quot;labels&quot;: { # User-defined labels for this job.
723 #
724 # The labels map can contain no more than 64 entries. Entries of the labels
725 # map are UTF8 strings that comply with the following restrictions:
726 #
727 # * Keys must conform to regexp: \p{Ll}\p{Lo}{0,62}
728 # * Values must conform to regexp: [\p{Ll}\p{Lo}\p{N}_-]{0,63}
729 # * Both keys and values are additionally constrained to be &lt;= 128 bytes in
730 # size.
731 &quot;a_key&quot;: &quot;A String&quot;,
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700732 },
Bu Sun Kim65020912020-05-20 12:08:20 -0700733 &quot;createTime&quot;: &quot;A String&quot;, # The timestamp when the job was initially created. Immutable and set by the
734 # Cloud Dataflow service.
735 &quot;requestedState&quot;: &quot;A String&quot;, # The job&#x27;s requested state.
736 #
737 # `UpdateJob` may be used to switch between the `JOB_STATE_STOPPED` and
738 # `JOB_STATE_RUNNING` states, by setting requested_state. `UpdateJob` may
739 # also be used to directly set a job&#x27;s requested state to
740 # `JOB_STATE_CANCELLED` or `JOB_STATE_DONE`, irrevocably terminating the
741 # job if it has not already reached a terminal state.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700742 }</pre>
743</div>
744
745<div class="method">
Bu Sun Kim65020912020-05-20 12:08:20 -0700746 <code class="details" id="get">get(projectId, location, view=None, gcsPath=None, x__xgafv=None)</code>
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700747 <pre>Get the template associated with a template.
748
749Args:
750 projectId: string, Required. The ID of the Cloud Platform project that the job belongs to. (required)
751 location: string, The [regional endpoint]
752(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) to
753which to direct the request. (required)
Bu Sun Kim65020912020-05-20 12:08:20 -0700754 view: string, The view to retrieve. Defaults to METADATA_ONLY.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700755 gcsPath: string, Required. A Cloud Storage path to the template from which to
756create the job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700757Must be valid Cloud Storage URL, beginning with &#x27;gs://&#x27;.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700758 x__xgafv: string, V1 error format.
759 Allowed values
760 1 - v1 error format
761 2 - v2 error format
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700762
763Returns:
764 An object of the form:
765
766 { # The response to a GetTemplate request.
Bu Sun Kim65020912020-05-20 12:08:20 -0700767 &quot;metadata&quot;: { # Metadata describing a template. # The template metadata describing the template name, available
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700768 # parameters, etc.
Bu Sun Kim65020912020-05-20 12:08:20 -0700769 &quot;parameters&quot;: [ # The parameters for the template.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700770 { # Metadata for a specific parameter.
Bu Sun Kim65020912020-05-20 12:08:20 -0700771 &quot;label&quot;: &quot;A String&quot;, # Required. The label to display for the parameter.
772 &quot;paramType&quot;: &quot;A String&quot;, # Optional. The type of the parameter.
Dan O'Mearadd494642020-05-01 07:42:23 -0700773 # Used for selecting input picker.
Bu Sun Kim65020912020-05-20 12:08:20 -0700774 &quot;helpText&quot;: &quot;A String&quot;, # Required. The help text to display for the parameter.
775 &quot;name&quot;: &quot;A String&quot;, # Required. The name of the parameter.
776 &quot;regexes&quot;: [ # Optional. Regexes that the parameter must match.
777 &quot;A String&quot;,
778 ],
779 &quot;isOptional&quot;: True or False, # Optional. Whether the parameter is optional. Defaults to false.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700780 },
781 ],
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700782 &quot;name&quot;: &quot;A String&quot;, # Required. The name of the template.
Bu Sun Kim65020912020-05-20 12:08:20 -0700783 &quot;description&quot;: &quot;A String&quot;, # Optional. A description of the template.
784 },
785 &quot;runtimeMetadata&quot;: { # RuntimeMetadata describing a runtime environment. # Describes the runtime metadata with SDKInfo and available parameters.
786 &quot;sdkInfo&quot;: { # SDK Information. # SDK Info for the template.
787 &quot;language&quot;: &quot;A String&quot;, # Required. The SDK Language.
788 &quot;version&quot;: &quot;A String&quot;, # Optional. The SDK version.
789 },
790 &quot;parameters&quot;: [ # The parameters for the template.
791 { # Metadata for a specific parameter.
792 &quot;label&quot;: &quot;A String&quot;, # Required. The label to display for the parameter.
793 &quot;paramType&quot;: &quot;A String&quot;, # Optional. The type of the parameter.
794 # Used for selecting input picker.
795 &quot;helpText&quot;: &quot;A String&quot;, # Required. The help text to display for the parameter.
796 &quot;name&quot;: &quot;A String&quot;, # Required. The name of the parameter.
797 &quot;regexes&quot;: [ # Optional. Regexes that the parameter must match.
798 &quot;A String&quot;,
799 ],
800 &quot;isOptional&quot;: True or False, # Optional. Whether the parameter is optional. Defaults to false.
801 },
802 ],
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700803 },
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700804 &quot;templateType&quot;: &quot;A String&quot;, # Template Type.
805 &quot;status&quot;: { # The `Status` type defines a logical error model that is suitable for # The status of the get template request. Any problems with the
806 # request will be indicated in the error_details.
807 # different programming environments, including REST APIs and RPC APIs. It is
808 # used by [gRPC](https://github.com/grpc). Each `Status` message contains
809 # three pieces of data: error code, error message, and error details.
810 #
811 # You can find out more about this error model and how to work with it in the
812 # [API Design Guide](https://cloud.google.com/apis/design/errors).
813 &quot;code&quot;: 42, # The status code, which should be an enum value of google.rpc.Code.
814 &quot;message&quot;: &quot;A String&quot;, # A developer-facing error message, which should be in English. Any
815 # user-facing error message should be localized and sent in the
816 # google.rpc.Status.details field, or localized by the client.
817 &quot;details&quot;: [ # A list of messages that carry the error details. There is a common set of
818 # message types for APIs to use.
819 {
820 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
821 },
822 ],
823 },
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700824 }</pre>
825</div>
826
827<div class="method">
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700828 <code class="details" id="launch">launch(projectId, location, body=None, validateOnly=None, gcsPath=None, dynamicTemplate_gcsPath=None, dynamicTemplate_stagingLocation=None, x__xgafv=None)</code>
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700829 <pre>Launch a template.
830
831Args:
832 projectId: string, Required. The ID of the Cloud Platform project that the job belongs to. (required)
833 location: string, The [regional endpoint]
834(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) to
835which to direct the request. (required)
Dan O'Mearadd494642020-05-01 07:42:23 -0700836 body: object, The request body.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700837 The object takes the form of:
838
839{ # Parameters to provide to the template being launched.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700840 &quot;parameters&quot;: { # The runtime parameters to pass to the job.
841 &quot;a_key&quot;: &quot;A String&quot;,
842 },
843 &quot;jobName&quot;: &quot;A String&quot;, # Required. The job name to use for the created job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700844 &quot;transformNameMapping&quot;: { # Only applicable when updating a pipeline. Map of transform name prefixes of
845 # the job to be replaced to the corresponding name prefixes of the new job.
846 &quot;a_key&quot;: &quot;A String&quot;,
847 },
848 &quot;environment&quot;: { # The environment values to set at runtime. # The runtime environment for the job.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700849 &quot;machineType&quot;: &quot;A String&quot;, # The machine type to use for the job. Defaults to the value from the
850 # template if not specified.
851 &quot;subnetwork&quot;: &quot;A String&quot;, # Subnetwork to which VMs will be assigned, if desired. Expected to be of
852 # the form &quot;regions/REGION/subnetworks/SUBNETWORK&quot;.
853 &quot;ipConfiguration&quot;: &quot;A String&quot;, # Configuration for VM IPs.
854 &quot;kmsKeyName&quot;: &quot;A String&quot;, # Optional. Name for the Cloud KMS key for the job.
855 # Key format is:
856 # projects/&lt;project&gt;/locations/&lt;location&gt;/keyRings/&lt;keyring&gt;/cryptoKeys/&lt;key&gt;
857 &quot;tempLocation&quot;: &quot;A String&quot;, # The Cloud Storage path to use for temporary files.
858 # Must be a valid Cloud Storage URL, beginning with `gs://`.
859 &quot;bypassTempDirValidation&quot;: True or False, # Whether to bypass the safety checks for the job&#x27;s temporary directory.
860 # Use with caution.
861 &quot;network&quot;: &quot;A String&quot;, # Network to which VMs will be assigned. If empty or unspecified,
862 # the service will use the network &quot;default&quot;.
Bu Sun Kim65020912020-05-20 12:08:20 -0700863 &quot;workerRegion&quot;: &quot;A String&quot;, # The Compute Engine region
Dan O'Mearadd494642020-05-01 07:42:23 -0700864 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
Bu Sun Kim65020912020-05-20 12:08:20 -0700865 # which worker processing should occur, e.g. &quot;us-west1&quot;. Mutually exclusive
Dan O'Mearadd494642020-05-01 07:42:23 -0700866 # with worker_zone. If neither worker_region nor worker_zone is specified,
Bu Sun Kim65020912020-05-20 12:08:20 -0700867 # default to the control plane&#x27;s region.
Bu Sun Kim65020912020-05-20 12:08:20 -0700868 &quot;zone&quot;: &quot;A String&quot;, # The Compute Engine [availability
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700869 # zone](https://cloud.google.com/compute/docs/regions-zones/regions-zones)
870 # for launching worker instances to run your pipeline.
Dan O'Mearadd494642020-05-01 07:42:23 -0700871 # In the future, worker_zone will take precedence.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700872 &quot;numWorkers&quot;: 42, # The initial number of Google Compute Engine instnaces for the job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700873 &quot;workerZone&quot;: &quot;A String&quot;, # The Compute Engine zone
874 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
875 # which worker processing should occur, e.g. &quot;us-west1-a&quot;. Mutually exclusive
876 # with worker_region. If neither worker_region nor worker_zone is specified,
877 # a zone in the control plane&#x27;s region is chosen based on available capacity.
878 # If both `worker_zone` and `zone` are set, `worker_zone` takes precedence.
879 &quot;additionalUserLabels&quot;: { # Additional user labels to be specified for the job.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700880 # Keys and values should follow the restrictions specified in the [labeling
881 # restrictions](https://cloud.google.com/compute/docs/labeling-resources#restrictions)
882 # page.
Bu Sun Kim65020912020-05-20 12:08:20 -0700883 &quot;a_key&quot;: &quot;A String&quot;,
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700884 },
Bu Sun Kim65020912020-05-20 12:08:20 -0700885 &quot;additionalExperiments&quot;: [ # Additional experiment flags for the job.
886 &quot;A String&quot;,
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700887 ],
Bu Sun Kim65020912020-05-20 12:08:20 -0700888 &quot;maxWorkers&quot;: 42, # The maximum number of Google Compute Engine instances to be made
889 # available to your pipeline during execution, from 1 to 1000.
890 &quot;serviceAccountEmail&quot;: &quot;A String&quot;, # The email address of the service account to run the job as.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700891 },
Bu Sun Kim65020912020-05-20 12:08:20 -0700892 &quot;update&quot;: True or False, # If set, replace the existing pipeline with the name specified by jobName
Dan O'Mearadd494642020-05-01 07:42:23 -0700893 # with this pipeline, preserving state.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700894 }
895
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700896 validateOnly: boolean, If true, the request is validated but not actually executed.
897Defaults to false.
Bu Sun Kim65020912020-05-20 12:08:20 -0700898 gcsPath: string, A Cloud Storage path to the template from which to create
899the job.
900Must be valid Cloud Storage URL, beginning with &#x27;gs://&#x27;.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700901 dynamicTemplate_gcsPath: string, Path to dynamic template spec file on GCS.
902The file must be a Json serialized DynamicTemplateFieSpec object.
Bu Sun Kim65020912020-05-20 12:08:20 -0700903 dynamicTemplate_stagingLocation: string, Cloud Storage path for staging dependencies.
904Must be a valid Cloud Storage URL, beginning with `gs://`.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700905 x__xgafv: string, V1 error format.
906 Allowed values
907 1 - v1 error format
908 2 - v2 error format
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700909
910Returns:
911 An object of the form:
912
913 { # Response to the request to launch a template.
Bu Sun Kim65020912020-05-20 12:08:20 -0700914 &quot;job&quot;: { # Defines a job to be run by the Cloud Dataflow service. # The job that was launched, if the request was not a dry run and
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700915 # the job was successfully launched.
Bu Sun Kim65020912020-05-20 12:08:20 -0700916 &quot;clientRequestId&quot;: &quot;A String&quot;, # The client&#x27;s unique identifier of the job, re-used across retried attempts.
917 # If this field is set, the service will ensure its uniqueness.
918 # The request to create a job will fail if the service has knowledge of a
919 # previously submitted job with the same client&#x27;s ID and job name.
920 # The caller may use this field to ensure idempotence of job
921 # creation across retried attempts to create a job.
922 # By default, the field is empty and, in that case, the service ignores it.
923 &quot;id&quot;: &quot;A String&quot;, # The unique ID of this job.
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -0400924 #
925 # This field is set by the Cloud Dataflow service when the Job is
926 # created, and is immutable for the life of the job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700927 &quot;currentStateTime&quot;: &quot;A String&quot;, # The timestamp associated with the current state.
928 &quot;transformNameMapping&quot;: { # The map of transform name prefixes of the job to be replaced to the
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700929 # corresponding name prefixes of the new job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700930 &quot;a_key&quot;: &quot;A String&quot;,
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700931 },
Bu Sun Kim65020912020-05-20 12:08:20 -0700932 &quot;environment&quot;: { # Describes the environment in which a Dataflow Job runs. # The environment for the job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700933 &quot;internalExperiments&quot;: { # Experimental settings.
934 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
935 },
936 &quot;workerRegion&quot;: &quot;A String&quot;, # The Compute Engine region
937 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
938 # which worker processing should occur, e.g. &quot;us-west1&quot;. Mutually exclusive
939 # with worker_zone. If neither worker_region nor worker_zone is specified,
940 # default to the control plane&#x27;s region.
941 &quot;serviceKmsKeyName&quot;: &quot;A String&quot;, # If set, contains the Cloud KMS key identifier used to encrypt data
942 # at rest, AKA a Customer Managed Encryption Key (CMEK).
943 #
944 # Format:
945 # projects/PROJECT_ID/locations/LOCATION/keyRings/KEY_RING/cryptoKeys/KEY
946 &quot;userAgent&quot;: { # A description of the process that generated the request.
947 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
948 },
949 &quot;workerZone&quot;: &quot;A String&quot;, # The Compute Engine zone
950 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
951 # which worker processing should occur, e.g. &quot;us-west1-a&quot;. Mutually exclusive
952 # with worker_region. If neither worker_region nor worker_zone is specified,
953 # a zone in the control plane&#x27;s region is chosen based on available capacity.
954 &quot;clusterManagerApiService&quot;: &quot;A String&quot;, # The type of cluster manager API to use. If unknown or
Dan O'Mearadd494642020-05-01 07:42:23 -0700955 # unspecified, the service will attempt to choose a reasonable
956 # default. This should be in the form of the API service name,
Bu Sun Kim65020912020-05-20 12:08:20 -0700957 # e.g. &quot;compute.googleapis.com&quot;.
958 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
959 # storage. The system will append the suffix &quot;/temp-{JOBNAME} to
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700960 # this resource prefix, where {JOBNAME} is the value of the
961 # job_name field. The resulting bucket and object prefix is used
962 # as the prefix of the resources used to store temporary data
963 # needed during the job execution. NOTE: This will override the
964 # value in taskrunner_settings.
965 # The supported resource type is:
966 #
967 # Google Cloud Storage:
968 #
969 # storage.googleapis.com/{bucket}/{object}
970 # bucket.storage.googleapis.com/{object}
Bu Sun Kim65020912020-05-20 12:08:20 -0700971 &quot;experiments&quot;: [ # The list of experiments to enable.
972 &quot;A String&quot;,
973 ],
974 &quot;version&quot;: { # A structure describing which components and their versions of the service
975 # are required in order to run the job.
976 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
977 },
978 &quot;serviceAccountEmail&quot;: &quot;A String&quot;, # Identity to run virtual machines as. Defaults to the default account.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700979 &quot;sdkPipelineOptions&quot;: { # The Cloud Dataflow SDK pipeline options specified by the user. These
980 # options are passed through the service and are used to recreate the
981 # SDK pipeline options on the worker in a language agnostic and platform
982 # independent way.
983 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
984 },
985 &quot;flexResourceSchedulingGoal&quot;: &quot;A String&quot;, # Which Flexible Resource Scheduling mode to run in.
986 &quot;workerPools&quot;: [ # The worker pools. At least one &quot;harness&quot; worker pool must be
987 # specified in order for the job to have workers.
988 { # Describes one particular pool of Cloud Dataflow workers to be
989 # instantiated by the Cloud Dataflow service in order to perform the
990 # computations required by a job. Note that a workflow job may use
991 # multiple pools, in order to match the various computational
992 # requirements of the various stages of the job.
993 &quot;numThreadsPerWorker&quot;: 42, # The number of threads per worker harness. If empty or unspecified, the
994 # service will choose a number of threads (according to the number of cores
995 # on the selected machine type for batch, or 1 by convention for streaming).
996 &quot;numWorkers&quot;: 42, # Number of Google Compute Engine workers in this pool needed to
997 # execute the job. If zero or unspecified, the service will
998 # attempt to choose a reasonable default.
999 &quot;zone&quot;: &quot;A String&quot;, # Zone to run the worker pools in. If empty or unspecified, the service
1000 # will attempt to choose a reasonable default.
1001 &quot;diskSourceImage&quot;: &quot;A String&quot;, # Fully qualified source image for disks.
1002 &quot;packages&quot;: [ # Packages to be installed on workers.
1003 { # The packages that must be installed in order for a worker to run the
1004 # steps of the Cloud Dataflow job that will be assigned to its worker
1005 # pool.
1006 #
1007 # This is the mechanism by which the Cloud Dataflow SDK causes code to
1008 # be loaded onto the workers. For example, the Cloud Dataflow Java SDK
1009 # might use this to install jars containing the user&#x27;s code and all of the
1010 # various dependencies (libraries, data files, etc.) required in order
1011 # for that code to run.
1012 &quot;name&quot;: &quot;A String&quot;, # The name of the package.
1013 &quot;location&quot;: &quot;A String&quot;, # The resource to read the package from. The supported resource type is:
1014 #
1015 # Google Cloud Storage:
1016 #
1017 # storage.googleapis.com/{bucket}
1018 # bucket.storage.googleapis.com/
1019 },
1020 ],
1021 &quot;teardownPolicy&quot;: &quot;A String&quot;, # Sets the policy for determining when to turndown worker pool.
1022 # Allowed values are: `TEARDOWN_ALWAYS`, `TEARDOWN_ON_SUCCESS`, and
1023 # `TEARDOWN_NEVER`.
1024 # `TEARDOWN_ALWAYS` means workers are always torn down regardless of whether
1025 # the job succeeds. `TEARDOWN_ON_SUCCESS` means workers are torn down
1026 # if the job succeeds. `TEARDOWN_NEVER` means the workers are never torn
1027 # down.
1028 #
1029 # If the workers are not torn down by the service, they will
1030 # continue to run and use Google Compute Engine VM resources in the
1031 # user&#x27;s project until they are explicitly terminated by the user.
1032 # Because of this, Google recommends using the `TEARDOWN_ALWAYS`
1033 # policy except for small, manually supervised test jobs.
1034 #
1035 # If unknown or unspecified, the service will attempt to choose a reasonable
1036 # default.
1037 &quot;onHostMaintenance&quot;: &quot;A String&quot;, # The action to take on host maintenance, as defined by the Google
1038 # Compute Engine API.
1039 &quot;poolArgs&quot;: { # Extra arguments for this worker pool.
1040 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
1041 },
1042 &quot;diskSizeGb&quot;: 42, # Size of root disk for VMs, in GB. If zero or unspecified, the service will
1043 # attempt to choose a reasonable default.
1044 &quot;workerHarnessContainerImage&quot;: &quot;A String&quot;, # Required. Docker container image that executes the Cloud Dataflow worker
1045 # harness, residing in Google Container Registry.
1046 #
1047 # Deprecated for the Fn API path. Use sdk_harness_container_images instead.
1048 &quot;diskType&quot;: &quot;A String&quot;, # Type of root disk for VMs. If empty or unspecified, the service will
1049 # attempt to choose a reasonable default.
1050 &quot;machineType&quot;: &quot;A String&quot;, # Machine type (e.g. &quot;n1-standard-1&quot;). If empty or unspecified, the
1051 # service will attempt to choose a reasonable default.
1052 &quot;kind&quot;: &quot;A String&quot;, # The kind of the worker pool; currently only `harness` and `shuffle`
1053 # are supported.
1054 &quot;sdkHarnessContainerImages&quot;: [ # Set of SDK harness containers needed to execute this pipeline. This will
1055 # only be set in the Fn API path. For non-cross-language pipelines this
1056 # should have only one entry. Cross-language pipelines will have two or more
1057 # entries.
1058 { # Defines a SDK harness container for executing Dataflow pipelines.
1059 &quot;containerImage&quot;: &quot;A String&quot;, # A docker container image that resides in Google Container Registry.
1060 &quot;useSingleCorePerContainer&quot;: True or False, # If true, recommends the Dataflow service to use only one core per SDK
1061 # container instance with this image. If false (or unset) recommends using
1062 # more than one core per SDK container instance with this image for
1063 # efficiency. Note that Dataflow service may choose to override this property
1064 # if needed.
1065 },
1066 ],
1067 &quot;dataDisks&quot;: [ # Data disks that are used by a VM in this workflow.
1068 { # Describes the data disk used by a workflow job.
1069 &quot;diskType&quot;: &quot;A String&quot;, # Disk storage type, as defined by Google Compute Engine. This
1070 # must be a disk type appropriate to the project and zone in which
1071 # the workers will run. If unknown or unspecified, the service
1072 # will attempt to choose a reasonable default.
1073 #
1074 # For example, the standard persistent disk type is a resource name
1075 # typically ending in &quot;pd-standard&quot;. If SSD persistent disks are
1076 # available, the resource name typically ends with &quot;pd-ssd&quot;. The
1077 # actual valid values are defined the Google Compute Engine API,
1078 # not by the Cloud Dataflow API; consult the Google Compute Engine
1079 # documentation for more information about determining the set of
1080 # available disk types for a particular project and zone.
1081 #
1082 # Google Compute Engine Disk types are local to a particular
1083 # project in a particular zone, and so the resource name will
1084 # typically look something like this:
1085 #
1086 # compute.googleapis.com/projects/project-id/zones/zone/diskTypes/pd-standard
1087 &quot;sizeGb&quot;: 42, # Size of disk in GB. If zero or unspecified, the service will
1088 # attempt to choose a reasonable default.
1089 &quot;mountPoint&quot;: &quot;A String&quot;, # Directory in a VM where disk is mounted.
1090 },
1091 ],
1092 &quot;subnetwork&quot;: &quot;A String&quot;, # Subnetwork to which VMs will be assigned, if desired. Expected to be of
1093 # the form &quot;regions/REGION/subnetworks/SUBNETWORK&quot;.
1094 &quot;ipConfiguration&quot;: &quot;A String&quot;, # Configuration for VM IPs.
1095 &quot;taskrunnerSettings&quot;: { # Taskrunner configuration settings. # Settings passed through to Google Compute Engine workers when
1096 # using the standard Dataflow task runner. Users should ignore
1097 # this field.
1098 &quot;alsologtostderr&quot;: True or False, # Whether to also send taskrunner log info to stderr.
1099 &quot;taskGroup&quot;: &quot;A String&quot;, # The UNIX group ID on the worker VM to use for tasks launched by
1100 # taskrunner; e.g. &quot;wheel&quot;.
1101 &quot;harnessCommand&quot;: &quot;A String&quot;, # The command to launch the worker harness.
1102 &quot;logDir&quot;: &quot;A String&quot;, # The directory on the VM to store logs.
1103 &quot;oauthScopes&quot;: [ # The OAuth2 scopes to be requested by the taskrunner in order to
1104 # access the Cloud Dataflow API.
1105 &quot;A String&quot;,
1106 ],
1107 &quot;dataflowApiVersion&quot;: &quot;A String&quot;, # The API version of endpoint, e.g. &quot;v1b3&quot;
1108 &quot;logUploadLocation&quot;: &quot;A String&quot;, # Indicates where to put logs. If this is not specified, the logs
1109 # will not be uploaded.
1110 #
1111 # The supported resource type is:
1112 #
1113 # Google Cloud Storage:
1114 # storage.googleapis.com/{bucket}/{object}
1115 # bucket.storage.googleapis.com/{object}
1116 &quot;streamingWorkerMainClass&quot;: &quot;A String&quot;, # The streaming worker main class name.
1117 &quot;workflowFileName&quot;: &quot;A String&quot;, # The file to store the workflow in.
1118 &quot;languageHint&quot;: &quot;A String&quot;, # The suggested backend language.
1119 &quot;commandlinesFileName&quot;: &quot;A String&quot;, # The file to store preprocessing commands in.
1120 &quot;baseTaskDir&quot;: &quot;A String&quot;, # The location on the worker for task-specific subdirectories.
1121 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the taskrunner should use for
1122 # temporary storage.
1123 #
1124 # The supported resource type is:
1125 #
1126 # Google Cloud Storage:
1127 # storage.googleapis.com/{bucket}/{object}
1128 # bucket.storage.googleapis.com/{object}
1129 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for the taskrunner to use when accessing Google Cloud APIs.
1130 #
1131 # When workers access Google Cloud APIs, they logically do so via
1132 # relative URLs. If this field is specified, it supplies the base
1133 # URL to use for resolving these relative URLs. The normative
1134 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
1135 # Locators&quot;.
1136 #
1137 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
1138 &quot;logToSerialconsole&quot;: True or False, # Whether to send taskrunner log info to Google Compute Engine VM serial
1139 # console.
1140 &quot;continueOnException&quot;: True or False, # Whether to continue taskrunner if an exception is hit.
1141 &quot;parallelWorkerSettings&quot;: { # Provides data to pass through to the worker harness. # The settings to pass to the parallel worker harness.
1142 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
1143 # storage.
1144 #
1145 # The supported resource type is:
1146 #
1147 # Google Cloud Storage:
1148 #
1149 # storage.googleapis.com/{bucket}/{object}
1150 # bucket.storage.googleapis.com/{object}
1151 &quot;reportingEnabled&quot;: True or False, # Whether to send work progress updates to the service.
1152 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for accessing Google Cloud APIs.
1153 #
1154 # When workers access Google Cloud APIs, they logically do so via
1155 # relative URLs. If this field is specified, it supplies the base
1156 # URL to use for resolving these relative URLs. The normative
1157 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
1158 # Locators&quot;.
1159 #
1160 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
1161 &quot;servicePath&quot;: &quot;A String&quot;, # The Cloud Dataflow service path relative to the root URL, for example,
1162 # &quot;dataflow/v1b3/projects&quot;.
1163 &quot;shuffleServicePath&quot;: &quot;A String&quot;, # The Shuffle service path relative to the root URL, for example,
1164 # &quot;shuffle/v1beta1&quot;.
1165 &quot;workerId&quot;: &quot;A String&quot;, # The ID of the worker running this pipeline.
1166 },
1167 &quot;taskUser&quot;: &quot;A String&quot;, # The UNIX user ID on the worker VM to use for tasks launched by
1168 # taskrunner; e.g. &quot;root&quot;.
1169 &quot;vmId&quot;: &quot;A String&quot;, # The ID string of the VM.
1170 },
1171 &quot;autoscalingSettings&quot;: { # Settings for WorkerPool autoscaling. # Settings for autoscaling of this WorkerPool.
1172 &quot;algorithm&quot;: &quot;A String&quot;, # The algorithm to use for autoscaling.
1173 &quot;maxNumWorkers&quot;: 42, # The maximum number of workers to cap scaling at.
1174 },
1175 &quot;metadata&quot;: { # Metadata to set on the Google Compute Engine VMs.
1176 &quot;a_key&quot;: &quot;A String&quot;,
1177 },
1178 &quot;defaultPackageSet&quot;: &quot;A String&quot;, # The default package set to install. This allows the service to
1179 # select a default set of packages which are useful to worker
1180 # harnesses written in a particular language.
1181 &quot;network&quot;: &quot;A String&quot;, # Network to which VMs will be assigned. If empty or unspecified,
1182 # the service will use the network &quot;default&quot;.
1183 },
1184 ],
1185 &quot;dataset&quot;: &quot;A String&quot;, # The dataset for the current project where various workflow
1186 # related tables are stored.
1187 #
1188 # The supported resource type is:
1189 #
1190 # Google BigQuery:
1191 # bigquery.googleapis.com/{dataset}
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -04001192 },
Bu Sun Kim65020912020-05-20 12:08:20 -07001193 &quot;stageStates&quot;: [ # This field may be mutated by the Cloud Dataflow service;
1194 # callers cannot mutate it.
1195 { # A message describing the state of a particular execution stage.
Bu Sun Kim65020912020-05-20 12:08:20 -07001196 &quot;currentStateTime&quot;: &quot;A String&quot;, # The time at which the stage transitioned to this state.
1197 &quot;executionStageState&quot;: &quot;A String&quot;, # Executions stage states allow the same set of values as JobState.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001198 &quot;executionStageName&quot;: &quot;A String&quot;, # The name of the execution stage.
Bu Sun Kim65020912020-05-20 12:08:20 -07001199 },
1200 ],
1201 &quot;jobMetadata&quot;: { # Metadata available primarily for filtering jobs. Will be included in the # This field is populated by the Dataflow service to support filtering jobs
1202 # by the metadata values provided here. Populated for ListJobs and all GetJob
1203 # views SUMMARY and higher.
1204 # ListJob response and Job SUMMARY view.
Bu Sun Kim65020912020-05-20 12:08:20 -07001205 &quot;datastoreDetails&quot;: [ # Identification of a Datastore source used in the Dataflow job.
1206 { # Metadata for a Datastore connector used by the job.
Bu Sun Kim65020912020-05-20 12:08:20 -07001207 &quot;namespace&quot;: &quot;A String&quot;, # Namespace used in the connection.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001208 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
Bu Sun Kim65020912020-05-20 12:08:20 -07001209 },
1210 ],
1211 &quot;sdkVersion&quot;: { # The version of the SDK used to run the job. # The SDK version used to run the job.
Bu Sun Kim65020912020-05-20 12:08:20 -07001212 &quot;version&quot;: &quot;A String&quot;, # The version of the SDK used to run the job.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001213 &quot;sdkSupportStatus&quot;: &quot;A String&quot;, # The support status for this SDK version.
1214 &quot;versionDisplayName&quot;: &quot;A String&quot;, # A readable string describing the version of the SDK.
Bu Sun Kim65020912020-05-20 12:08:20 -07001215 },
1216 &quot;bigqueryDetails&quot;: [ # Identification of a BigQuery source used in the Dataflow job.
1217 { # Metadata for a BigQuery connector used by the job.
1218 &quot;table&quot;: &quot;A String&quot;, # Table accessed in the connection.
1219 &quot;dataset&quot;: &quot;A String&quot;, # Dataset accessed in the connection.
Bu Sun Kim65020912020-05-20 12:08:20 -07001220 &quot;query&quot;: &quot;A String&quot;, # Query used to access data in the connection.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001221 &quot;projectId&quot;: &quot;A String&quot;, # Project accessed in the connection.
Bu Sun Kim65020912020-05-20 12:08:20 -07001222 },
1223 ],
1224 &quot;fileDetails&quot;: [ # Identification of a File source used in the Dataflow job.
1225 { # Metadata for a File connector used by the job.
1226 &quot;filePattern&quot;: &quot;A String&quot;, # File Pattern used to access files by the connector.
1227 },
1228 ],
1229 &quot;pubsubDetails&quot;: [ # Identification of a PubSub source used in the Dataflow job.
1230 { # Metadata for a PubSub connector used by the job.
Bu Sun Kim65020912020-05-20 12:08:20 -07001231 &quot;topic&quot;: &quot;A String&quot;, # Topic accessed in the connection.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001232 &quot;subscription&quot;: &quot;A String&quot;, # Subscription used in the connection.
1233 },
1234 ],
1235 &quot;bigTableDetails&quot;: [ # Identification of a BigTable source used in the Dataflow job.
1236 { # Metadata for a BigTable connector used by the job.
1237 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
1238 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
1239 &quot;tableId&quot;: &quot;A String&quot;, # TableId accessed in the connection.
1240 },
1241 ],
1242 &quot;spannerDetails&quot;: [ # Identification of a Spanner source used in the Dataflow job.
1243 { # Metadata for a Spanner connector used by the job.
1244 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
1245 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
1246 &quot;databaseId&quot;: &quot;A String&quot;, # DatabaseId accessed in the connection.
Bu Sun Kim65020912020-05-20 12:08:20 -07001247 },
1248 ],
1249 },
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001250 &quot;type&quot;: &quot;A String&quot;, # The type of Cloud Dataflow job.
1251 &quot;projectId&quot;: &quot;A String&quot;, # The ID of the Cloud Platform project that the job belongs to.
Bu Sun Kim65020912020-05-20 12:08:20 -07001252 &quot;createdFromSnapshotId&quot;: &quot;A String&quot;, # If this is specified, the job&#x27;s initial state is populated from the given
1253 # snapshot.
Bu Sun Kim65020912020-05-20 12:08:20 -07001254 &quot;pipelineDescription&quot;: { # A descriptive representation of submitted pipeline as well as the executed # Preliminary field: The format of this data may change at any time.
1255 # A description of the user pipeline and stages through which it is executed.
1256 # Created by Cloud Dataflow service. Only retrieved with
1257 # JOB_VIEW_DESCRIPTION or JOB_VIEW_ALL.
1258 # form. This data is provided by the Dataflow service for ease of visualizing
1259 # the pipeline and interpreting Dataflow provided metrics.
1260 &quot;executionPipelineStage&quot;: [ # Description of each stage of execution of the pipeline.
1261 { # Description of the composing transforms, names/ids, and input/outputs of a
1262 # stage of execution. Some composing transforms and sources may have been
1263 # generated by the Dataflow service during execution planning.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001264 &quot;outputSource&quot;: [ # Output sources for this stage.
1265 { # Description of an input or output of an execution stage.
1266 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
1267 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
1268 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
1269 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
1270 # source is most closely associated.
1271 },
1272 ],
1273 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this stage.
1274 &quot;inputSource&quot;: [ # Input sources for this stage.
1275 { # Description of an input or output of an execution stage.
1276 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
1277 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
1278 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
1279 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
1280 # source is most closely associated.
1281 },
1282 ],
Bu Sun Kim65020912020-05-20 12:08:20 -07001283 &quot;id&quot;: &quot;A String&quot;, # Dataflow service generated id for this stage.
1284 &quot;componentTransform&quot;: [ # Transforms that comprise this execution stage.
1285 { # Description of a transform executed as part of an execution stage.
1286 &quot;originalTransform&quot;: &quot;A String&quot;, # User name for the original user transform with which this transform is
1287 # most closely associated.
1288 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
1289 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
1290 },
1291 ],
1292 &quot;componentSource&quot;: [ # Collections produced and consumed by component transforms of this stage.
1293 { # Description of an interstitial value between transforms in an execution
1294 # stage.
1295 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
1296 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
1297 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
1298 # source is most closely associated.
1299 },
1300 ],
1301 &quot;kind&quot;: &quot;A String&quot;, # Type of tranform this stage is executing.
Bu Sun Kim65020912020-05-20 12:08:20 -07001302 },
1303 ],
1304 &quot;originalPipelineTransform&quot;: [ # Description of each transform in the pipeline and collections between them.
1305 { # Description of the type, names/ids, and input/outputs for a transform.
1306 &quot;kind&quot;: &quot;A String&quot;, # Type of transform.
1307 &quot;inputCollectionName&quot;: [ # User names for all collection inputs to this transform.
1308 &quot;A String&quot;,
1309 ],
1310 &quot;name&quot;: &quot;A String&quot;, # User provided name for this transform instance.
1311 &quot;id&quot;: &quot;A String&quot;, # SDK generated id of this transform instance.
1312 &quot;displayData&quot;: [ # Transform-specific display data.
1313 { # Data provided with a pipeline or transform to provide descriptive info.
Bu Sun Kim65020912020-05-20 12:08:20 -07001314 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001315 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
Bu Sun Kim65020912020-05-20 12:08:20 -07001316 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
1317 # language namespace (i.e. python module) which defines the display data.
1318 # This allows a dax monitoring system to specially handle the data
1319 # and perform custom rendering.
1320 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
1321 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
1322 # This is intended to be used as a label for the display data
1323 # when viewed in a dax monitoring system.
1324 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
1325 # For example a java_class_name_value of com.mypackage.MyDoFn
1326 # will be stored with MyDoFn as the short_str_value and
1327 # com.mypackage.MyDoFn as the java_class_name value.
1328 # short_str_value can be displayed and java_class_name_value
1329 # will be displayed as a tooltip.
1330 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
1331 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001332 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
1333 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
1334 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
1335 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
Bu Sun Kim65020912020-05-20 12:08:20 -07001336 },
1337 ],
1338 &quot;outputCollectionName&quot;: [ # User names for all collection outputs to this transform.
1339 &quot;A String&quot;,
1340 ],
1341 },
1342 ],
1343 &quot;displayData&quot;: [ # Pipeline level display data.
1344 { # Data provided with a pipeline or transform to provide descriptive info.
Bu Sun Kim65020912020-05-20 12:08:20 -07001345 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001346 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
Bu Sun Kim65020912020-05-20 12:08:20 -07001347 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
1348 # language namespace (i.e. python module) which defines the display data.
1349 # This allows a dax monitoring system to specially handle the data
1350 # and perform custom rendering.
1351 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
1352 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
1353 # This is intended to be used as a label for the display data
1354 # when viewed in a dax monitoring system.
1355 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
1356 # For example a java_class_name_value of com.mypackage.MyDoFn
1357 # will be stored with MyDoFn as the short_str_value and
1358 # com.mypackage.MyDoFn as the java_class_name value.
1359 # short_str_value can be displayed and java_class_name_value
1360 # will be displayed as a tooltip.
1361 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
1362 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001363 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
1364 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
1365 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
1366 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
Bu Sun Kim65020912020-05-20 12:08:20 -07001367 },
1368 ],
1369 },
1370 &quot;replaceJobId&quot;: &quot;A String&quot;, # If this job is an update of an existing job, this field is the job ID
1371 # of the job it replaced.
1372 #
1373 # When sending a `CreateJobRequest`, you can update a job by specifying it
1374 # here. The job named here is stopped, and its intermediate state is
1375 # transferred to this job.
1376 &quot;tempFiles&quot;: [ # A set of files the system should be aware of that are used
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001377 # for temporary storage. These temporary files will be
1378 # removed on job completion.
1379 # No duplicates are allowed.
1380 # No file patterns are supported.
1381 #
1382 # The supported files are:
1383 #
1384 # Google Cloud Storage:
1385 #
1386 # storage.googleapis.com/{bucket}/{object}
1387 # bucket.storage.googleapis.com/{object}
Bu Sun Kim65020912020-05-20 12:08:20 -07001388 &quot;A String&quot;,
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001389 ],
Bu Sun Kim65020912020-05-20 12:08:20 -07001390 &quot;name&quot;: &quot;A String&quot;, # The user-specified Cloud Dataflow job name.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001391 #
1392 # Only one Job with a given name may exist in a project at any
1393 # given time. If a caller attempts to create a Job with the same
1394 # name as an already-existing Job, the attempt returns the
1395 # existing Job.
1396 #
1397 # The name must match the regular expression
1398 # `[a-z]([-a-z0-9]{0,38}[a-z0-9])?`
Bu Sun Kim65020912020-05-20 12:08:20 -07001399 &quot;steps&quot;: [ # Exactly one of step or steps_location should be specified.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001400 #
1401 # The top-level steps that constitute the entire job.
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -04001402 { # Defines a particular step within a Cloud Dataflow job.
1403 #
1404 # A job consists of multiple steps, each of which performs some
1405 # specific operation as part of the overall job. Data is typically
1406 # passed from one step to another as part of the job.
1407 #
Bu Sun Kim65020912020-05-20 12:08:20 -07001408 # Here&#x27;s an example of a sequence of steps which together implement a
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -04001409 # Map-Reduce job:
1410 #
1411 # * Read a collection of data from some source, parsing the
Bu Sun Kim65020912020-05-20 12:08:20 -07001412 # collection&#x27;s elements.
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -04001413 #
1414 # * Validate the elements.
1415 #
1416 # * Apply a user-defined function to map each element to some value
1417 # and extract an element-specific key value.
1418 #
1419 # * Group elements with the same key into a single element with
1420 # that key, transforming a multiply-keyed collection into a
1421 # uniquely-keyed collection.
1422 #
1423 # * Write the elements out to some data sink.
1424 #
1425 # Note that the Cloud Dataflow service may be used to run many different
1426 # types of jobs, not just Map-Reduce.
Bu Sun Kim65020912020-05-20 12:08:20 -07001427 &quot;name&quot;: &quot;A String&quot;, # The name that identifies the step. This must be unique for each
Dan O'Mearadd494642020-05-01 07:42:23 -07001428 # step with respect to all other steps in the Cloud Dataflow job.
Bu Sun Kim65020912020-05-20 12:08:20 -07001429 &quot;kind&quot;: &quot;A String&quot;, # The kind of step in the Cloud Dataflow job.
1430 &quot;properties&quot;: { # Named properties associated with the step. Each kind of
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -04001431 # predefined step has its own required set of properties.
1432 # Must be provided on Create. Only retrieved with JOB_VIEW_ALL.
Bu Sun Kim65020912020-05-20 12:08:20 -07001433 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -04001434 },
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -04001435 },
1436 ],
Bu Sun Kim65020912020-05-20 12:08:20 -07001437 &quot;replacedByJobId&quot;: &quot;A String&quot;, # If another job is an update of this job (and thus, this job is in
1438 # `JOB_STATE_UPDATED`), this field contains the ID of that job.
1439 &quot;executionInfo&quot;: { # Additional information about how a Cloud Dataflow job will be executed that # Deprecated.
1440 # isn&#x27;t contained in the submitted job.
1441 &quot;stages&quot;: { # A mapping from each stage to the information about that stage.
1442 &quot;a_key&quot;: { # Contains information about how a particular
1443 # google.dataflow.v1beta3.Step will be executed.
1444 &quot;stepName&quot;: [ # The steps associated with the execution stage.
1445 # Note that stages may have several steps, and that a given step
1446 # might be run by more than one stage.
1447 &quot;A String&quot;,
1448 ],
1449 },
1450 },
1451 },
1452 &quot;currentState&quot;: &quot;A String&quot;, # The current state of the job.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001453 #
1454 # Jobs are created in the `JOB_STATE_STOPPED` state unless otherwise
1455 # specified.
1456 #
1457 # A job in the `JOB_STATE_RUNNING` state may asynchronously enter a
1458 # terminal state. After a job has reached a terminal state, no
1459 # further state updates may be made.
1460 #
1461 # This field may be mutated by the Cloud Dataflow service;
1462 # callers cannot mutate it.
Bu Sun Kim65020912020-05-20 12:08:20 -07001463 &quot;location&quot;: &quot;A String&quot;, # The [regional endpoint]
1464 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
1465 # contains this job.
1466 &quot;startTime&quot;: &quot;A String&quot;, # The timestamp when the job was started (transitioned to JOB_STATE_PENDING).
1467 # Flexible resource scheduling jobs are started with some delay after job
1468 # creation, so start_time is unset before start and is updated when the
1469 # job is started by the Cloud Dataflow service. For other jobs, start_time
1470 # always equals to create_time and is immutable and set by the Cloud Dataflow
1471 # service.
1472 &quot;stepsLocation&quot;: &quot;A String&quot;, # The GCS location where the steps are stored.
1473 &quot;labels&quot;: { # User-defined labels for this job.
1474 #
1475 # The labels map can contain no more than 64 entries. Entries of the labels
1476 # map are UTF8 strings that comply with the following restrictions:
1477 #
1478 # * Keys must conform to regexp: \p{Ll}\p{Lo}{0,62}
1479 # * Values must conform to regexp: [\p{Ll}\p{Lo}\p{N}_-]{0,63}
1480 # * Both keys and values are additionally constrained to be &lt;= 128 bytes in
1481 # size.
1482 &quot;a_key&quot;: &quot;A String&quot;,
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -04001483 },
Bu Sun Kim65020912020-05-20 12:08:20 -07001484 &quot;createTime&quot;: &quot;A String&quot;, # The timestamp when the job was initially created. Immutable and set by the
1485 # Cloud Dataflow service.
1486 &quot;requestedState&quot;: &quot;A String&quot;, # The job&#x27;s requested state.
1487 #
1488 # `UpdateJob` may be used to switch between the `JOB_STATE_STOPPED` and
1489 # `JOB_STATE_RUNNING` states, by setting requested_state. `UpdateJob` may
1490 # also be used to directly set a job&#x27;s requested state to
1491 # `JOB_STATE_CANCELLED` or `JOB_STATE_DONE`, irrevocably terminating the
1492 # job if it has not already reached a terminal state.
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -04001493 },
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -04001494 }</pre>
1495</div>
1496
1497</body></html>