blob: 9d55f00652def258ac9f86b52c12e5490ae82e83 [file] [log] [blame]
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08001<html><body>
2<style>
3
4body, h1, h2, h3, div, span, p, pre, a {
5 margin: 0;
6 padding: 0;
7 border: 0;
8 font-weight: inherit;
9 font-style: inherit;
10 font-size: 100%;
11 font-family: inherit;
12 vertical-align: baseline;
13}
14
15body {
16 font-size: 13px;
17 padding: 1em;
18}
19
20h1 {
21 font-size: 26px;
22 margin-bottom: 1em;
23}
24
25h2 {
26 font-size: 24px;
27 margin-bottom: 1em;
28}
29
30h3 {
31 font-size: 20px;
32 margin-bottom: 1em;
33 margin-top: 1em;
34}
35
36pre, code {
37 line-height: 1.5;
38 font-family: Monaco, 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Lucida Console', monospace;
39}
40
41pre {
42 margin-top: 0.5em;
43}
44
45h1, h2, h3, p {
46 font-family: Arial, sans serif;
47}
48
49h1, h2, h3 {
50 border-bottom: solid #CCC 1px;
51}
52
53.toc_element {
54 margin-top: 0.5em;
55}
56
57.firstline {
58 margin-left: 2 em;
59}
60
61.method {
62 margin-top: 1em;
63 border: solid 1px #CCC;
64 padding: 1em;
65 background: #EEE;
66}
67
68.details {
69 font-weight: bold;
70 font-size: 14px;
71}
72
73</style>
74
Bu Sun Kim715bd7f2019-06-14 16:50:42 -070075<h1><a href="dataflow_v1b3.html">Dataflow API</a> . <a href="dataflow_v1b3.projects.html">projects</a> . <a href="dataflow_v1b3.projects.locations.html">locations</a> . <a href="dataflow_v1b3.projects.locations.jobs.html">jobs</a></h1>
Jon Wayne Parrott692617a2017-01-06 09:58:29 -080076<h2>Instance Methods</h2>
77<p class="toc_element">
Sai Cheemalapati4ba8c232017-06-06 18:46:08 -040078 <code><a href="dataflow_v1b3.projects.locations.jobs.debug.html">debug()</a></code>
79</p>
80<p class="firstline">Returns the debug Resource.</p>
81
82<p class="toc_element">
Jon Wayne Parrott692617a2017-01-06 09:58:29 -080083 <code><a href="dataflow_v1b3.projects.locations.jobs.messages.html">messages()</a></code>
84</p>
85<p class="firstline">Returns the messages Resource.</p>
86
87<p class="toc_element">
Bu Sun Kim715bd7f2019-06-14 16:50:42 -070088 <code><a href="dataflow_v1b3.projects.locations.jobs.snapshots.html">snapshots()</a></code>
89</p>
90<p class="firstline">Returns the snapshots Resource.</p>
91
92<p class="toc_element">
Jon Wayne Parrott692617a2017-01-06 09:58:29 -080093 <code><a href="dataflow_v1b3.projects.locations.jobs.workItems.html">workItems()</a></code>
94</p>
95<p class="firstline">Returns the workItems Resource.</p>
96
97<p class="toc_element">
Bu Sun Kimd059ad82020-07-22 17:02:09 -070098 <code><a href="#create">create(projectId, location, body=None, view=None, replaceJobId=None, x__xgafv=None)</a></code></p>
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -040099<p class="firstline">Creates a Cloud Dataflow job.</p>
Jon Wayne Parrott692617a2017-01-06 09:58:29 -0800100<p class="toc_element">
Bu Sun Kim65020912020-05-20 12:08:20 -0700101 <code><a href="#get">get(projectId, location, jobId, view=None, x__xgafv=None)</a></code></p>
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -0400102<p class="firstline">Gets the state of the specified Cloud Dataflow job.</p>
Jon Wayne Parrott692617a2017-01-06 09:58:29 -0800103<p class="toc_element">
104 <code><a href="#getMetrics">getMetrics(projectId, location, jobId, startTime=None, x__xgafv=None)</a></code></p>
105<p class="firstline">Request the job status.</p>
106<p class="toc_element">
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700107 <code><a href="#list">list(projectId, location, pageToken=None, view=None, pageSize=None, filter=None, x__xgafv=None)</a></code></p>
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -0400108<p class="firstline">List the jobs of a project.</p>
Jon Wayne Parrott692617a2017-01-06 09:58:29 -0800109<p class="toc_element">
110 <code><a href="#list_next">list_next(previous_request, previous_response)</a></code></p>
111<p class="firstline">Retrieves the next page of results.</p>
112<p class="toc_element">
Dan O'Mearadd494642020-05-01 07:42:23 -0700113 <code><a href="#snapshot">snapshot(projectId, location, jobId, body=None, x__xgafv=None)</a></code></p>
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700114<p class="firstline">Snapshot the state of a streaming job.</p>
115<p class="toc_element">
Dan O'Mearadd494642020-05-01 07:42:23 -0700116 <code><a href="#update">update(projectId, location, jobId, body=None, x__xgafv=None)</a></code></p>
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -0400117<p class="firstline">Updates the state of an existing Cloud Dataflow job.</p>
Jon Wayne Parrott692617a2017-01-06 09:58:29 -0800118<h3>Method Details</h3>
119<div class="method">
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700120 <code class="details" id="create">create(projectId, location, body=None, view=None, replaceJobId=None, x__xgafv=None)</code>
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -0400121 <pre>Creates a Cloud Dataflow job.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -0800122
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700123To create a job, we recommend using `projects.locations.jobs.create` with a
124[regional endpoint]
125(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints). Using
126`projects.jobs.create` is not recommended, as your job will always start
127in `us-central1`.
128
Jon Wayne Parrott692617a2017-01-06 09:58:29 -0800129Args:
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -0400130 projectId: string, The ID of the Cloud Platform project that the job belongs to. (required)
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700131 location: string, The [regional endpoint]
132(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
133contains this job. (required)
Dan O'Mearadd494642020-05-01 07:42:23 -0700134 body: object, The request body.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -0800135 The object takes the form of:
136
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -0400137{ # Defines a job to be run by the Cloud Dataflow service.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700138 &quot;pipelineDescription&quot;: { # A descriptive representation of submitted pipeline as well as the executed # Preliminary field: The format of this data may change at any time.
139 # A description of the user pipeline and stages through which it is executed.
140 # Created by Cloud Dataflow service. Only retrieved with
141 # JOB_VIEW_DESCRIPTION or JOB_VIEW_ALL.
142 # form. This data is provided by the Dataflow service for ease of visualizing
143 # the pipeline and interpreting Dataflow provided metrics.
144 &quot;displayData&quot;: [ # Pipeline level display data.
145 { # Data provided with a pipeline or transform to provide descriptive info.
146 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
147 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
148 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
149 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
150 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
151 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
152 # This is intended to be used as a label for the display data
153 # when viewed in a dax monitoring system.
154 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
155 # language namespace (i.e. python module) which defines the display data.
156 # This allows a dax monitoring system to specially handle the data
157 # and perform custom rendering.
158 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
159 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
160 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
161 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
162 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
163 # For example a java_class_name_value of com.mypackage.MyDoFn
164 # will be stored with MyDoFn as the short_str_value and
165 # com.mypackage.MyDoFn as the java_class_name value.
166 # short_str_value can be displayed and java_class_name_value
167 # will be displayed as a tooltip.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700168 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700169 ],
170 &quot;originalPipelineTransform&quot;: [ # Description of each transform in the pipeline and collections between them.
171 { # Description of the type, names/ids, and input/outputs for a transform.
172 &quot;outputCollectionName&quot;: [ # User names for all collection outputs to this transform.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700173 &quot;A String&quot;,
174 ],
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700175 &quot;displayData&quot;: [ # Transform-specific display data.
176 { # Data provided with a pipeline or transform to provide descriptive info.
177 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
178 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
179 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
180 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
181 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
182 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
183 # This is intended to be used as a label for the display data
184 # when viewed in a dax monitoring system.
185 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
186 # language namespace (i.e. python module) which defines the display data.
187 # This allows a dax monitoring system to specially handle the data
188 # and perform custom rendering.
189 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
190 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
191 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
192 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
193 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
194 # For example a java_class_name_value of com.mypackage.MyDoFn
195 # will be stored with MyDoFn as the short_str_value and
196 # com.mypackage.MyDoFn as the java_class_name value.
197 # short_str_value can be displayed and java_class_name_value
198 # will be displayed as a tooltip.
199 },
200 ],
201 &quot;id&quot;: &quot;A String&quot;, # SDK generated id of this transform instance.
202 &quot;inputCollectionName&quot;: [ # User names for all collection inputs to this transform.
203 &quot;A String&quot;,
204 ],
205 &quot;name&quot;: &quot;A String&quot;, # User provided name for this transform instance.
206 &quot;kind&quot;: &quot;A String&quot;, # Type of transform.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700207 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700208 ],
209 &quot;executionPipelineStage&quot;: [ # Description of each stage of execution of the pipeline.
210 { # Description of the composing transforms, names/ids, and input/outputs of a
211 # stage of execution. Some composing transforms and sources may have been
212 # generated by the Dataflow service during execution planning.
213 &quot;componentSource&quot;: [ # Collections produced and consumed by component transforms of this stage.
214 { # Description of an interstitial value between transforms in an execution
215 # stage.
216 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
217 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
218 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
219 # source is most closely associated.
220 },
221 ],
222 &quot;inputSource&quot;: [ # Input sources for this stage.
223 { # Description of an input or output of an execution stage.
224 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
225 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
226 # source is most closely associated.
227 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
228 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
229 },
230 ],
231 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this stage.
232 &quot;componentTransform&quot;: [ # Transforms that comprise this execution stage.
233 { # Description of a transform executed as part of an execution stage.
234 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
235 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
236 &quot;originalTransform&quot;: &quot;A String&quot;, # User name for the original user transform with which this transform is
237 # most closely associated.
238 },
239 ],
240 &quot;id&quot;: &quot;A String&quot;, # Dataflow service generated id for this stage.
241 &quot;outputSource&quot;: [ # Output sources for this stage.
242 { # Description of an input or output of an execution stage.
243 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
244 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
245 # source is most closely associated.
246 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
247 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
248 },
249 ],
250 &quot;kind&quot;: &quot;A String&quot;, # Type of tranform this stage is executing.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700251 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700252 ],
Bu Sun Kim65020912020-05-20 12:08:20 -0700253 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700254 &quot;labels&quot;: { # User-defined labels for this job.
255 #
256 # The labels map can contain no more than 64 entries. Entries of the labels
257 # map are UTF8 strings that comply with the following restrictions:
258 #
259 # * Keys must conform to regexp: \p{Ll}\p{Lo}{0,62}
260 # * Values must conform to regexp: [\p{Ll}\p{Lo}\p{N}_-]{0,63}
261 # * Both keys and values are additionally constrained to be &lt;= 128 bytes in
262 # size.
Bu Sun Kim65020912020-05-20 12:08:20 -0700263 &quot;a_key&quot;: &quot;A String&quot;,
Jon Wayne Parrott692617a2017-01-06 09:58:29 -0800264 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700265 &quot;projectId&quot;: &quot;A String&quot;, # The ID of the Cloud Platform project that the job belongs to.
Bu Sun Kim65020912020-05-20 12:08:20 -0700266 &quot;environment&quot;: { # Describes the environment in which a Dataflow Job runs. # The environment for the job.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700267 &quot;flexResourceSchedulingGoal&quot;: &quot;A String&quot;, # Which Flexible Resource Scheduling mode to run in.
Bu Sun Kim65020912020-05-20 12:08:20 -0700268 &quot;workerRegion&quot;: &quot;A String&quot;, # The Compute Engine region
269 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
270 # which worker processing should occur, e.g. &quot;us-west1&quot;. Mutually exclusive
271 # with worker_zone. If neither worker_region nor worker_zone is specified,
272 # default to the control plane&#x27;s region.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700273 &quot;userAgent&quot;: { # A description of the process that generated the request.
274 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
275 },
276 &quot;serviceAccountEmail&quot;: &quot;A String&quot;, # Identity to run virtual machines as. Defaults to the default account.
277 &quot;version&quot;: { # A structure describing which components and their versions of the service
278 # are required in order to run the job.
279 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
280 },
Bu Sun Kim65020912020-05-20 12:08:20 -0700281 &quot;serviceKmsKeyName&quot;: &quot;A String&quot;, # If set, contains the Cloud KMS key identifier used to encrypt data
282 # at rest, AKA a Customer Managed Encryption Key (CMEK).
283 #
284 # Format:
285 # projects/PROJECT_ID/locations/LOCATION/keyRings/KEY_RING/cryptoKeys/KEY
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700286 &quot;experiments&quot;: [ # The list of experiments to enable.
287 &quot;A String&quot;,
288 ],
Bu Sun Kim65020912020-05-20 12:08:20 -0700289 &quot;workerZone&quot;: &quot;A String&quot;, # The Compute Engine zone
290 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
291 # which worker processing should occur, e.g. &quot;us-west1-a&quot;. Mutually exclusive
292 # with worker_region. If neither worker_region nor worker_zone is specified,
293 # a zone in the control plane&#x27;s region is chosen based on available capacity.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700294 &quot;workerPools&quot;: [ # The worker pools. At least one &quot;harness&quot; worker pool must be
295 # specified in order for the job to have workers.
296 { # Describes one particular pool of Cloud Dataflow workers to be
297 # instantiated by the Cloud Dataflow service in order to perform the
298 # computations required by a job. Note that a workflow job may use
299 # multiple pools, in order to match the various computational
300 # requirements of the various stages of the job.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700301 &quot;onHostMaintenance&quot;: &quot;A String&quot;, # The action to take on host maintenance, as defined by the Google
302 # Compute Engine API.
303 &quot;sdkHarnessContainerImages&quot;: [ # Set of SDK harness containers needed to execute this pipeline. This will
304 # only be set in the Fn API path. For non-cross-language pipelines this
305 # should have only one entry. Cross-language pipelines will have two or more
306 # entries.
307 { # Defines a SDK harness container for executing Dataflow pipelines.
308 &quot;containerImage&quot;: &quot;A String&quot;, # A docker container image that resides in Google Container Registry.
309 &quot;useSingleCorePerContainer&quot;: True or False, # If true, recommends the Dataflow service to use only one core per SDK
310 # container instance with this image. If false (or unset) recommends using
311 # more than one core per SDK container instance with this image for
312 # efficiency. Note that Dataflow service may choose to override this property
313 # if needed.
314 },
315 ],
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700316 &quot;zone&quot;: &quot;A String&quot;, # Zone to run the worker pools in. If empty or unspecified, the service
317 # will attempt to choose a reasonable default.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700318 &quot;kind&quot;: &quot;A String&quot;, # The kind of the worker pool; currently only `harness` and `shuffle`
319 # are supported.
320 &quot;metadata&quot;: { # Metadata to set on the Google Compute Engine VMs.
321 &quot;a_key&quot;: &quot;A String&quot;,
322 },
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700323 &quot;diskSourceImage&quot;: &quot;A String&quot;, # Fully qualified source image for disks.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700324 &quot;dataDisks&quot;: [ # Data disks that are used by a VM in this workflow.
325 { # Describes the data disk used by a workflow job.
326 &quot;sizeGb&quot;: 42, # Size of disk in GB. If zero or unspecified, the service will
327 # attempt to choose a reasonable default.
328 &quot;diskType&quot;: &quot;A String&quot;, # Disk storage type, as defined by Google Compute Engine. This
329 # must be a disk type appropriate to the project and zone in which
330 # the workers will run. If unknown or unspecified, the service
331 # will attempt to choose a reasonable default.
332 #
333 # For example, the standard persistent disk type is a resource name
334 # typically ending in &quot;pd-standard&quot;. If SSD persistent disks are
335 # available, the resource name typically ends with &quot;pd-ssd&quot;. The
336 # actual valid values are defined the Google Compute Engine API,
337 # not by the Cloud Dataflow API; consult the Google Compute Engine
338 # documentation for more information about determining the set of
339 # available disk types for a particular project and zone.
340 #
341 # Google Compute Engine Disk types are local to a particular
342 # project in a particular zone, and so the resource name will
343 # typically look something like this:
344 #
345 # compute.googleapis.com/projects/project-id/zones/zone/diskTypes/pd-standard
346 &quot;mountPoint&quot;: &quot;A String&quot;, # Directory in a VM where disk is mounted.
347 },
348 ],
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700349 &quot;packages&quot;: [ # Packages to be installed on workers.
350 { # The packages that must be installed in order for a worker to run the
351 # steps of the Cloud Dataflow job that will be assigned to its worker
352 # pool.
353 #
354 # This is the mechanism by which the Cloud Dataflow SDK causes code to
355 # be loaded onto the workers. For example, the Cloud Dataflow Java SDK
356 # might use this to install jars containing the user&#x27;s code and all of the
357 # various dependencies (libraries, data files, etc.) required in order
358 # for that code to run.
359 &quot;name&quot;: &quot;A String&quot;, # The name of the package.
360 &quot;location&quot;: &quot;A String&quot;, # The resource to read the package from. The supported resource type is:
361 #
362 # Google Cloud Storage:
363 #
364 # storage.googleapis.com/{bucket}
365 # bucket.storage.googleapis.com/
366 },
367 ],
368 &quot;teardownPolicy&quot;: &quot;A String&quot;, # Sets the policy for determining when to turndown worker pool.
369 # Allowed values are: `TEARDOWN_ALWAYS`, `TEARDOWN_ON_SUCCESS`, and
370 # `TEARDOWN_NEVER`.
371 # `TEARDOWN_ALWAYS` means workers are always torn down regardless of whether
372 # the job succeeds. `TEARDOWN_ON_SUCCESS` means workers are torn down
373 # if the job succeeds. `TEARDOWN_NEVER` means the workers are never torn
374 # down.
375 #
376 # If the workers are not torn down by the service, they will
377 # continue to run and use Google Compute Engine VM resources in the
378 # user&#x27;s project until they are explicitly terminated by the user.
379 # Because of this, Google recommends using the `TEARDOWN_ALWAYS`
380 # policy except for small, manually supervised test jobs.
381 #
382 # If unknown or unspecified, the service will attempt to choose a reasonable
383 # default.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700384 &quot;network&quot;: &quot;A String&quot;, # Network to which VMs will be assigned. If empty or unspecified,
385 # the service will use the network &quot;default&quot;.
386 &quot;ipConfiguration&quot;: &quot;A String&quot;, # Configuration for VM IPs.
387 &quot;diskSizeGb&quot;: 42, # Size of root disk for VMs, in GB. If zero or unspecified, the service will
388 # attempt to choose a reasonable default.
389 &quot;autoscalingSettings&quot;: { # Settings for WorkerPool autoscaling. # Settings for autoscaling of this WorkerPool.
390 &quot;maxNumWorkers&quot;: 42, # The maximum number of workers to cap scaling at.
391 &quot;algorithm&quot;: &quot;A String&quot;, # The algorithm to use for autoscaling.
392 },
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700393 &quot;poolArgs&quot;: { # Extra arguments for this worker pool.
394 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
395 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700396 &quot;subnetwork&quot;: &quot;A String&quot;, # Subnetwork to which VMs will be assigned, if desired. Expected to be of
397 # the form &quot;regions/REGION/subnetworks/SUBNETWORK&quot;.
398 &quot;numWorkers&quot;: 42, # Number of Google Compute Engine workers in this pool needed to
399 # execute the job. If zero or unspecified, the service will
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700400 # attempt to choose a reasonable default.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700401 &quot;numThreadsPerWorker&quot;: 42, # The number of threads per worker harness. If empty or unspecified, the
402 # service will choose a number of threads (according to the number of cores
403 # on the selected machine type for batch, or 1 by convention for streaming).
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700404 &quot;workerHarnessContainerImage&quot;: &quot;A String&quot;, # Required. Docker container image that executes the Cloud Dataflow worker
405 # harness, residing in Google Container Registry.
406 #
407 # Deprecated for the Fn API path. Use sdk_harness_container_images instead.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700408 &quot;taskrunnerSettings&quot;: { # Taskrunner configuration settings. # Settings passed through to Google Compute Engine workers when
409 # using the standard Dataflow task runner. Users should ignore
410 # this field.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700411 &quot;dataflowApiVersion&quot;: &quot;A String&quot;, # The API version of endpoint, e.g. &quot;v1b3&quot;
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700412 &quot;oauthScopes&quot;: [ # The OAuth2 scopes to be requested by the taskrunner in order to
413 # access the Cloud Dataflow API.
414 &quot;A String&quot;,
415 ],
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700416 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for the taskrunner to use when accessing Google Cloud APIs.
417 #
418 # When workers access Google Cloud APIs, they logically do so via
419 # relative URLs. If this field is specified, it supplies the base
420 # URL to use for resolving these relative URLs. The normative
421 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
422 # Locators&quot;.
423 #
424 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700425 &quot;workflowFileName&quot;: &quot;A String&quot;, # The file to store the workflow in.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700426 &quot;logToSerialconsole&quot;: True or False, # Whether to send taskrunner log info to Google Compute Engine VM serial
427 # console.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700428 &quot;baseTaskDir&quot;: &quot;A String&quot;, # The location on the worker for task-specific subdirectories.
429 &quot;taskUser&quot;: &quot;A String&quot;, # The UNIX user ID on the worker VM to use for tasks launched by
430 # taskrunner; e.g. &quot;root&quot;.
431 &quot;vmId&quot;: &quot;A String&quot;, # The ID string of the VM.
432 &quot;alsologtostderr&quot;: True or False, # Whether to also send taskrunner log info to stderr.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700433 &quot;parallelWorkerSettings&quot;: { # Provides data to pass through to the worker harness. # The settings to pass to the parallel worker harness.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700434 &quot;shuffleServicePath&quot;: &quot;A String&quot;, # The Shuffle service path relative to the root URL, for example,
435 # &quot;shuffle/v1beta1&quot;.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700436 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
437 # storage.
438 #
439 # The supported resource type is:
440 #
441 # Google Cloud Storage:
442 #
443 # storage.googleapis.com/{bucket}/{object}
444 # bucket.storage.googleapis.com/{object}
445 &quot;reportingEnabled&quot;: True or False, # Whether to send work progress updates to the service.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700446 &quot;servicePath&quot;: &quot;A String&quot;, # The Cloud Dataflow service path relative to the root URL, for example,
447 # &quot;dataflow/v1b3/projects&quot;.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700448 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for accessing Google Cloud APIs.
449 #
450 # When workers access Google Cloud APIs, they logically do so via
451 # relative URLs. If this field is specified, it supplies the base
452 # URL to use for resolving these relative URLs. The normative
453 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
454 # Locators&quot;.
455 #
456 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700457 &quot;workerId&quot;: &quot;A String&quot;, # The ID of the worker running this pipeline.
458 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700459 &quot;harnessCommand&quot;: &quot;A String&quot;, # The command to launch the worker harness.
460 &quot;logDir&quot;: &quot;A String&quot;, # The directory on the VM to store logs.
461 &quot;streamingWorkerMainClass&quot;: &quot;A String&quot;, # The streaming worker main class name.
462 &quot;languageHint&quot;: &quot;A String&quot;, # The suggested backend language.
463 &quot;taskGroup&quot;: &quot;A String&quot;, # The UNIX group ID on the worker VM to use for tasks launched by
464 # taskrunner; e.g. &quot;wheel&quot;.
465 &quot;logUploadLocation&quot;: &quot;A String&quot;, # Indicates where to put logs. If this is not specified, the logs
466 # will not be uploaded.
467 #
468 # The supported resource type is:
469 #
470 # Google Cloud Storage:
471 # storage.googleapis.com/{bucket}/{object}
472 # bucket.storage.googleapis.com/{object}
473 &quot;commandlinesFileName&quot;: &quot;A String&quot;, # The file to store preprocessing commands in.
474 &quot;continueOnException&quot;: True or False, # Whether to continue taskrunner if an exception is hit.
475 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the taskrunner should use for
476 # temporary storage.
477 #
478 # The supported resource type is:
479 #
480 # Google Cloud Storage:
481 # storage.googleapis.com/{bucket}/{object}
482 # bucket.storage.googleapis.com/{object}
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700483 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700484 &quot;diskType&quot;: &quot;A String&quot;, # Type of root disk for VMs. If empty or unspecified, the service will
485 # attempt to choose a reasonable default.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700486 &quot;defaultPackageSet&quot;: &quot;A String&quot;, # The default package set to install. This allows the service to
487 # select a default set of packages which are useful to worker
488 # harnesses written in a particular language.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700489 &quot;machineType&quot;: &quot;A String&quot;, # Machine type (e.g. &quot;n1-standard-1&quot;). If empty or unspecified, the
490 # service will attempt to choose a reasonable default.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700491 },
492 ],
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700493 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
494 # storage. The system will append the suffix &quot;/temp-{JOBNAME} to
495 # this resource prefix, where {JOBNAME} is the value of the
496 # job_name field. The resulting bucket and object prefix is used
497 # as the prefix of the resources used to store temporary data
498 # needed during the job execution. NOTE: This will override the
499 # value in taskrunner_settings.
500 # The supported resource type is:
501 #
502 # Google Cloud Storage:
503 #
504 # storage.googleapis.com/{bucket}/{object}
505 # bucket.storage.googleapis.com/{object}
506 &quot;internalExperiments&quot;: { # Experimental settings.
507 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
508 },
509 &quot;sdkPipelineOptions&quot;: { # The Cloud Dataflow SDK pipeline options specified by the user. These
510 # options are passed through the service and are used to recreate the
511 # SDK pipeline options on the worker in a language agnostic and platform
512 # independent way.
513 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
514 },
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -0700515 &quot;dataset&quot;: &quot;A String&quot;, # The dataset for the current project where various workflow
516 # related tables are stored.
517 #
518 # The supported resource type is:
519 #
520 # Google BigQuery:
521 # bigquery.googleapis.com/{dataset}
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700522 &quot;clusterManagerApiService&quot;: &quot;A String&quot;, # The type of cluster manager API to use. If unknown or
523 # unspecified, the service will attempt to choose a reasonable
524 # default. This should be in the form of the API service name,
525 # e.g. &quot;compute.googleapis.com&quot;.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700526 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700527 &quot;stepsLocation&quot;: &quot;A String&quot;, # The GCS location where the steps are stored.
Bu Sun Kim65020912020-05-20 12:08:20 -0700528 &quot;steps&quot;: [ # Exactly one of step or steps_location should be specified.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700529 #
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700530 # The top-level steps that constitute the entire job.
531 { # Defines a particular step within a Cloud Dataflow job.
532 #
533 # A job consists of multiple steps, each of which performs some
534 # specific operation as part of the overall job. Data is typically
535 # passed from one step to another as part of the job.
536 #
Bu Sun Kim65020912020-05-20 12:08:20 -0700537 # Here&#x27;s an example of a sequence of steps which together implement a
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700538 # Map-Reduce job:
539 #
540 # * Read a collection of data from some source, parsing the
Bu Sun Kim65020912020-05-20 12:08:20 -0700541 # collection&#x27;s elements.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700542 #
543 # * Validate the elements.
544 #
545 # * Apply a user-defined function to map each element to some value
546 # and extract an element-specific key value.
547 #
548 # * Group elements with the same key into a single element with
549 # that key, transforming a multiply-keyed collection into a
550 # uniquely-keyed collection.
551 #
552 # * Write the elements out to some data sink.
553 #
554 # Note that the Cloud Dataflow service may be used to run many different
555 # types of jobs, not just Map-Reduce.
Bu Sun Kim65020912020-05-20 12:08:20 -0700556 &quot;kind&quot;: &quot;A String&quot;, # The kind of step in the Cloud Dataflow job.
557 &quot;properties&quot;: { # Named properties associated with the step. Each kind of
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700558 # predefined step has its own required set of properties.
559 # Must be provided on Create. Only retrieved with JOB_VIEW_ALL.
Bu Sun Kim65020912020-05-20 12:08:20 -0700560 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700561 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700562 &quot;name&quot;: &quot;A String&quot;, # The name that identifies the step. This must be unique for each
563 # step with respect to all other steps in the Cloud Dataflow job.
564 },
565 ],
566 &quot;stageStates&quot;: [ # This field may be mutated by the Cloud Dataflow service;
567 # callers cannot mutate it.
568 { # A message describing the state of a particular execution stage.
569 &quot;executionStageState&quot;: &quot;A String&quot;, # Executions stage states allow the same set of values as JobState.
570 &quot;executionStageName&quot;: &quot;A String&quot;, # The name of the execution stage.
571 &quot;currentStateTime&quot;: &quot;A String&quot;, # The time at which the stage transitioned to this state.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -0700572 },
573 ],
Bu Sun Kim65020912020-05-20 12:08:20 -0700574 &quot;replacedByJobId&quot;: &quot;A String&quot;, # If another job is an update of this job (and thus, this job is in
575 # `JOB_STATE_UPDATED`), this field contains the ID of that job.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700576 &quot;jobMetadata&quot;: { # Metadata available primarily for filtering jobs. Will be included in the # This field is populated by the Dataflow service to support filtering jobs
577 # by the metadata values provided here. Populated for ListJobs and all GetJob
578 # views SUMMARY and higher.
579 # ListJob response and Job SUMMARY view.
580 &quot;sdkVersion&quot;: { # The version of the SDK used to run the job. # The SDK version used to run the job.
581 &quot;sdkSupportStatus&quot;: &quot;A String&quot;, # The support status for this SDK version.
582 &quot;versionDisplayName&quot;: &quot;A String&quot;, # A readable string describing the version of the SDK.
583 &quot;version&quot;: &quot;A String&quot;, # The version of the SDK used to run the job.
584 },
585 &quot;bigTableDetails&quot;: [ # Identification of a BigTable source used in the Dataflow job.
586 { # Metadata for a BigTable connector used by the job.
587 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
588 &quot;tableId&quot;: &quot;A String&quot;, # TableId accessed in the connection.
589 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
590 },
591 ],
592 &quot;pubsubDetails&quot;: [ # Identification of a PubSub source used in the Dataflow job.
593 { # Metadata for a PubSub connector used by the job.
594 &quot;subscription&quot;: &quot;A String&quot;, # Subscription used in the connection.
595 &quot;topic&quot;: &quot;A String&quot;, # Topic accessed in the connection.
596 },
597 ],
598 &quot;bigqueryDetails&quot;: [ # Identification of a BigQuery source used in the Dataflow job.
599 { # Metadata for a BigQuery connector used by the job.
600 &quot;dataset&quot;: &quot;A String&quot;, # Dataset accessed in the connection.
601 &quot;projectId&quot;: &quot;A String&quot;, # Project accessed in the connection.
602 &quot;query&quot;: &quot;A String&quot;, # Query used to access data in the connection.
603 &quot;table&quot;: &quot;A String&quot;, # Table accessed in the connection.
604 },
605 ],
606 &quot;fileDetails&quot;: [ # Identification of a File source used in the Dataflow job.
607 { # Metadata for a File connector used by the job.
608 &quot;filePattern&quot;: &quot;A String&quot;, # File Pattern used to access files by the connector.
609 },
610 ],
611 &quot;datastoreDetails&quot;: [ # Identification of a Datastore source used in the Dataflow job.
612 { # Metadata for a Datastore connector used by the job.
613 &quot;namespace&quot;: &quot;A String&quot;, # Namespace used in the connection.
614 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
615 },
616 ],
617 &quot;spannerDetails&quot;: [ # Identification of a Spanner source used in the Dataflow job.
618 { # Metadata for a Spanner connector used by the job.
619 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
620 &quot;databaseId&quot;: &quot;A String&quot;, # DatabaseId accessed in the connection.
621 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
622 },
623 ],
624 },
625 &quot;location&quot;: &quot;A String&quot;, # The [regional endpoint]
626 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
627 # contains this job.
628 &quot;transformNameMapping&quot;: { # The map of transform name prefixes of the job to be replaced to the
629 # corresponding name prefixes of the new job.
630 &quot;a_key&quot;: &quot;A String&quot;,
631 },
632 &quot;startTime&quot;: &quot;A String&quot;, # The timestamp when the job was started (transitioned to JOB_STATE_PENDING).
633 # Flexible resource scheduling jobs are started with some delay after job
634 # creation, so start_time is unset before start and is updated when the
635 # job is started by the Cloud Dataflow service. For other jobs, start_time
636 # always equals to create_time and is immutable and set by the Cloud Dataflow
637 # service.
638 &quot;clientRequestId&quot;: &quot;A String&quot;, # The client&#x27;s unique identifier of the job, re-used across retried attempts.
639 # If this field is set, the service will ensure its uniqueness.
640 # The request to create a job will fail if the service has knowledge of a
641 # previously submitted job with the same client&#x27;s ID and job name.
642 # The caller may use this field to ensure idempotence of job
643 # creation across retried attempts to create a job.
644 # By default, the field is empty and, in that case, the service ignores it.
Bu Sun Kim65020912020-05-20 12:08:20 -0700645 &quot;executionInfo&quot;: { # Additional information about how a Cloud Dataflow job will be executed that # Deprecated.
646 # isn&#x27;t contained in the submitted job.
647 &quot;stages&quot;: { # A mapping from each stage to the information about that stage.
648 &quot;a_key&quot;: { # Contains information about how a particular
649 # google.dataflow.v1beta3.Step will be executed.
650 &quot;stepName&quot;: [ # The steps associated with the execution stage.
651 # Note that stages may have several steps, and that a given step
652 # might be run by more than one stage.
653 &quot;A String&quot;,
654 ],
655 },
656 },
657 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700658 &quot;type&quot;: &quot;A String&quot;, # The type of Cloud Dataflow job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700659 &quot;createTime&quot;: &quot;A String&quot;, # The timestamp when the job was initially created. Immutable and set by the
660 # Cloud Dataflow service.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700661 &quot;tempFiles&quot;: [ # A set of files the system should be aware of that are used
662 # for temporary storage. These temporary files will be
663 # removed on job completion.
664 # No duplicates are allowed.
665 # No file patterns are supported.
666 #
667 # The supported files are:
668 #
669 # Google Cloud Storage:
670 #
671 # storage.googleapis.com/{bucket}/{object}
672 # bucket.storage.googleapis.com/{object}
673 &quot;A String&quot;,
674 ],
675 &quot;id&quot;: &quot;A String&quot;, # The unique ID of this job.
676 #
677 # This field is set by the Cloud Dataflow service when the Job is
678 # created, and is immutable for the life of the job.
Bu Sun Kim65020912020-05-20 12:08:20 -0700679 &quot;requestedState&quot;: &quot;A String&quot;, # The job&#x27;s requested state.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700680 #
Bu Sun Kim65020912020-05-20 12:08:20 -0700681 # `UpdateJob` may be used to switch between the `JOB_STATE_STOPPED` and
682 # `JOB_STATE_RUNNING` states, by setting requested_state. `UpdateJob` may
683 # also be used to directly set a job&#x27;s requested state to
684 # `JOB_STATE_CANCELLED` or `JOB_STATE_DONE`, irrevocably terminating the
685 # job if it has not already reached a terminal state.
Bu Sun Kimd059ad82020-07-22 17:02:09 -0700686 &quot;replaceJobId&quot;: &quot;A String&quot;, # If this job is an update of an existing job, this field is the job ID
687 # of the job it replaced.
688 #
689 # When sending a `CreateJobRequest`, you can update a job by specifying it
690 # here. The job named here is stopped, and its intermediate state is
691 # transferred to this job.
692 &quot;createdFromSnapshotId&quot;: &quot;A String&quot;, # If this is specified, the job&#x27;s initial state is populated from the given
693 # snapshot.
694 &quot;currentState&quot;: &quot;A String&quot;, # The current state of the job.
695 #
696 # Jobs are created in the `JOB_STATE_STOPPED` state unless otherwise
697 # specified.
698 #
699 # A job in the `JOB_STATE_RUNNING` state may asynchronously enter a
700 # terminal state. After a job has reached a terminal state, no
701 # further state updates may be made.
702 #
703 # This field may be mutated by the Cloud Dataflow service;
704 # callers cannot mutate it.
705 &quot;name&quot;: &quot;A String&quot;, # The user-specified Cloud Dataflow job name.
706 #
707 # Only one Job with a given name may exist in a project at any
708 # given time. If a caller attempts to create a Job with the same
709 # name as an already-existing Job, the attempt returns the
710 # existing Job.
711 #
712 # The name must match the regular expression
713 # `[a-z]([-a-z0-9]{0,38}[a-z0-9])?`
714 &quot;currentStateTime&quot;: &quot;A String&quot;, # The timestamp associated with the current state.
715 }
716
717 view: string, The level of information requested in response.
718 replaceJobId: string, Deprecated. This field is now in the Job message.
719 x__xgafv: string, V1 error format.
720 Allowed values
721 1 - v1 error format
722 2 - v2 error format
723
724Returns:
725 An object of the form:
726
727 { # Defines a job to be run by the Cloud Dataflow service.
728 &quot;pipelineDescription&quot;: { # A descriptive representation of submitted pipeline as well as the executed # Preliminary field: The format of this data may change at any time.
729 # A description of the user pipeline and stages through which it is executed.
730 # Created by Cloud Dataflow service. Only retrieved with
731 # JOB_VIEW_DESCRIPTION or JOB_VIEW_ALL.
732 # form. This data is provided by the Dataflow service for ease of visualizing
733 # the pipeline and interpreting Dataflow provided metrics.
734 &quot;displayData&quot;: [ # Pipeline level display data.
735 { # Data provided with a pipeline or transform to provide descriptive info.
736 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
737 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
738 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
739 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
740 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
741 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
742 # This is intended to be used as a label for the display data
743 # when viewed in a dax monitoring system.
744 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
745 # language namespace (i.e. python module) which defines the display data.
746 # This allows a dax monitoring system to specially handle the data
747 # and perform custom rendering.
748 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
749 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
750 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
751 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
752 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
753 # For example a java_class_name_value of com.mypackage.MyDoFn
754 # will be stored with MyDoFn as the short_str_value and
755 # com.mypackage.MyDoFn as the java_class_name value.
756 # short_str_value can be displayed and java_class_name_value
757 # will be displayed as a tooltip.
758 },
759 ],
760 &quot;originalPipelineTransform&quot;: [ # Description of each transform in the pipeline and collections between them.
761 { # Description of the type, names/ids, and input/outputs for a transform.
762 &quot;outputCollectionName&quot;: [ # User names for all collection outputs to this transform.
763 &quot;A String&quot;,
764 ],
765 &quot;displayData&quot;: [ # Transform-specific display data.
766 { # Data provided with a pipeline or transform to provide descriptive info.
767 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
768 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
769 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
770 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
771 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
772 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
773 # This is intended to be used as a label for the display data
774 # when viewed in a dax monitoring system.
775 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
776 # language namespace (i.e. python module) which defines the display data.
777 # This allows a dax monitoring system to specially handle the data
778 # and perform custom rendering.
779 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
780 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
781 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
782 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
783 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
784 # For example a java_class_name_value of com.mypackage.MyDoFn
785 # will be stored with MyDoFn as the short_str_value and
786 # com.mypackage.MyDoFn as the java_class_name value.
787 # short_str_value can be displayed and java_class_name_value
788 # will be displayed as a tooltip.
789 },
790 ],
791 &quot;id&quot;: &quot;A String&quot;, # SDK generated id of this transform instance.
792 &quot;inputCollectionName&quot;: [ # User names for all collection inputs to this transform.
793 &quot;A String&quot;,
794 ],
795 &quot;name&quot;: &quot;A String&quot;, # User provided name for this transform instance.
796 &quot;kind&quot;: &quot;A String&quot;, # Type of transform.
797 },
798 ],
799 &quot;executionPipelineStage&quot;: [ # Description of each stage of execution of the pipeline.
800 { # Description of the composing transforms, names/ids, and input/outputs of a
801 # stage of execution. Some composing transforms and sources may have been
802 # generated by the Dataflow service during execution planning.
803 &quot;componentSource&quot;: [ # Collections produced and consumed by component transforms of this stage.
804 { # Description of an interstitial value between transforms in an execution
805 # stage.
806 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
807 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
808 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
809 # source is most closely associated.
810 },
811 ],
812 &quot;inputSource&quot;: [ # Input sources for this stage.
813 { # Description of an input or output of an execution stage.
814 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
815 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
816 # source is most closely associated.
817 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
818 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
819 },
820 ],
821 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this stage.
822 &quot;componentTransform&quot;: [ # Transforms that comprise this execution stage.
823 { # Description of a transform executed as part of an execution stage.
824 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
825 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
826 &quot;originalTransform&quot;: &quot;A String&quot;, # User name for the original user transform with which this transform is
827 # most closely associated.
828 },
829 ],
830 &quot;id&quot;: &quot;A String&quot;, # Dataflow service generated id for this stage.
831 &quot;outputSource&quot;: [ # Output sources for this stage.
832 { # Description of an input or output of an execution stage.
833 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
834 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
835 # source is most closely associated.
836 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
837 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
838 },
839 ],
840 &quot;kind&quot;: &quot;A String&quot;, # Type of tranform this stage is executing.
841 },
842 ],
843 },
844 &quot;labels&quot;: { # User-defined labels for this job.
845 #
846 # The labels map can contain no more than 64 entries. Entries of the labels
847 # map are UTF8 strings that comply with the following restrictions:
848 #
849 # * Keys must conform to regexp: \p{Ll}\p{Lo}{0,62}
850 # * Values must conform to regexp: [\p{Ll}\p{Lo}\p{N}_-]{0,63}
851 # * Both keys and values are additionally constrained to be &lt;= 128 bytes in
852 # size.
853 &quot;a_key&quot;: &quot;A String&quot;,
854 },
855 &quot;projectId&quot;: &quot;A String&quot;, # The ID of the Cloud Platform project that the job belongs to.
856 &quot;environment&quot;: { # Describes the environment in which a Dataflow Job runs. # The environment for the job.
857 &quot;flexResourceSchedulingGoal&quot;: &quot;A String&quot;, # Which Flexible Resource Scheduling mode to run in.
858 &quot;workerRegion&quot;: &quot;A String&quot;, # The Compute Engine region
859 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
860 # which worker processing should occur, e.g. &quot;us-west1&quot;. Mutually exclusive
861 # with worker_zone. If neither worker_region nor worker_zone is specified,
862 # default to the control plane&#x27;s region.
863 &quot;userAgent&quot;: { # A description of the process that generated the request.
864 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
865 },
866 &quot;serviceAccountEmail&quot;: &quot;A String&quot;, # Identity to run virtual machines as. Defaults to the default account.
867 &quot;version&quot;: { # A structure describing which components and their versions of the service
868 # are required in order to run the job.
869 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
870 },
871 &quot;serviceKmsKeyName&quot;: &quot;A String&quot;, # If set, contains the Cloud KMS key identifier used to encrypt data
872 # at rest, AKA a Customer Managed Encryption Key (CMEK).
873 #
874 # Format:
875 # projects/PROJECT_ID/locations/LOCATION/keyRings/KEY_RING/cryptoKeys/KEY
876 &quot;experiments&quot;: [ # The list of experiments to enable.
877 &quot;A String&quot;,
878 ],
879 &quot;workerZone&quot;: &quot;A String&quot;, # The Compute Engine zone
880 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
881 # which worker processing should occur, e.g. &quot;us-west1-a&quot;. Mutually exclusive
882 # with worker_region. If neither worker_region nor worker_zone is specified,
883 # a zone in the control plane&#x27;s region is chosen based on available capacity.
884 &quot;workerPools&quot;: [ # The worker pools. At least one &quot;harness&quot; worker pool must be
885 # specified in order for the job to have workers.
886 { # Describes one particular pool of Cloud Dataflow workers to be
887 # instantiated by the Cloud Dataflow service in order to perform the
888 # computations required by a job. Note that a workflow job may use
889 # multiple pools, in order to match the various computational
890 # requirements of the various stages of the job.
891 &quot;onHostMaintenance&quot;: &quot;A String&quot;, # The action to take on host maintenance, as defined by the Google
892 # Compute Engine API.
893 &quot;sdkHarnessContainerImages&quot;: [ # Set of SDK harness containers needed to execute this pipeline. This will
894 # only be set in the Fn API path. For non-cross-language pipelines this
895 # should have only one entry. Cross-language pipelines will have two or more
896 # entries.
897 { # Defines a SDK harness container for executing Dataflow pipelines.
898 &quot;containerImage&quot;: &quot;A String&quot;, # A docker container image that resides in Google Container Registry.
899 &quot;useSingleCorePerContainer&quot;: True or False, # If true, recommends the Dataflow service to use only one core per SDK
900 # container instance with this image. If false (or unset) recommends using
901 # more than one core per SDK container instance with this image for
902 # efficiency. Note that Dataflow service may choose to override this property
903 # if needed.
904 },
905 ],
906 &quot;zone&quot;: &quot;A String&quot;, # Zone to run the worker pools in. If empty or unspecified, the service
907 # will attempt to choose a reasonable default.
908 &quot;kind&quot;: &quot;A String&quot;, # The kind of the worker pool; currently only `harness` and `shuffle`
909 # are supported.
910 &quot;metadata&quot;: { # Metadata to set on the Google Compute Engine VMs.
911 &quot;a_key&quot;: &quot;A String&quot;,
912 },
913 &quot;diskSourceImage&quot;: &quot;A String&quot;, # Fully qualified source image for disks.
914 &quot;dataDisks&quot;: [ # Data disks that are used by a VM in this workflow.
915 { # Describes the data disk used by a workflow job.
916 &quot;sizeGb&quot;: 42, # Size of disk in GB. If zero or unspecified, the service will
917 # attempt to choose a reasonable default.
918 &quot;diskType&quot;: &quot;A String&quot;, # Disk storage type, as defined by Google Compute Engine. This
919 # must be a disk type appropriate to the project and zone in which
920 # the workers will run. If unknown or unspecified, the service
921 # will attempt to choose a reasonable default.
922 #
923 # For example, the standard persistent disk type is a resource name
924 # typically ending in &quot;pd-standard&quot;. If SSD persistent disks are
925 # available, the resource name typically ends with &quot;pd-ssd&quot;. The
926 # actual valid values are defined the Google Compute Engine API,
927 # not by the Cloud Dataflow API; consult the Google Compute Engine
928 # documentation for more information about determining the set of
929 # available disk types for a particular project and zone.
930 #
931 # Google Compute Engine Disk types are local to a particular
932 # project in a particular zone, and so the resource name will
933 # typically look something like this:
934 #
935 # compute.googleapis.com/projects/project-id/zones/zone/diskTypes/pd-standard
936 &quot;mountPoint&quot;: &quot;A String&quot;, # Directory in a VM where disk is mounted.
937 },
938 ],
939 &quot;packages&quot;: [ # Packages to be installed on workers.
940 { # The packages that must be installed in order for a worker to run the
941 # steps of the Cloud Dataflow job that will be assigned to its worker
942 # pool.
943 #
944 # This is the mechanism by which the Cloud Dataflow SDK causes code to
945 # be loaded onto the workers. For example, the Cloud Dataflow Java SDK
946 # might use this to install jars containing the user&#x27;s code and all of the
947 # various dependencies (libraries, data files, etc.) required in order
948 # for that code to run.
949 &quot;name&quot;: &quot;A String&quot;, # The name of the package.
950 &quot;location&quot;: &quot;A String&quot;, # The resource to read the package from. The supported resource type is:
951 #
952 # Google Cloud Storage:
953 #
954 # storage.googleapis.com/{bucket}
955 # bucket.storage.googleapis.com/
956 },
957 ],
958 &quot;teardownPolicy&quot;: &quot;A String&quot;, # Sets the policy for determining when to turndown worker pool.
959 # Allowed values are: `TEARDOWN_ALWAYS`, `TEARDOWN_ON_SUCCESS`, and
960 # `TEARDOWN_NEVER`.
961 # `TEARDOWN_ALWAYS` means workers are always torn down regardless of whether
962 # the job succeeds. `TEARDOWN_ON_SUCCESS` means workers are torn down
963 # if the job succeeds. `TEARDOWN_NEVER` means the workers are never torn
964 # down.
965 #
966 # If the workers are not torn down by the service, they will
967 # continue to run and use Google Compute Engine VM resources in the
968 # user&#x27;s project until they are explicitly terminated by the user.
969 # Because of this, Google recommends using the `TEARDOWN_ALWAYS`
970 # policy except for small, manually supervised test jobs.
971 #
972 # If unknown or unspecified, the service will attempt to choose a reasonable
973 # default.
974 &quot;network&quot;: &quot;A String&quot;, # Network to which VMs will be assigned. If empty or unspecified,
975 # the service will use the network &quot;default&quot;.
976 &quot;ipConfiguration&quot;: &quot;A String&quot;, # Configuration for VM IPs.
977 &quot;diskSizeGb&quot;: 42, # Size of root disk for VMs, in GB. If zero or unspecified, the service will
978 # attempt to choose a reasonable default.
979 &quot;autoscalingSettings&quot;: { # Settings for WorkerPool autoscaling. # Settings for autoscaling of this WorkerPool.
980 &quot;maxNumWorkers&quot;: 42, # The maximum number of workers to cap scaling at.
981 &quot;algorithm&quot;: &quot;A String&quot;, # The algorithm to use for autoscaling.
982 },
983 &quot;poolArgs&quot;: { # Extra arguments for this worker pool.
984 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
985 },
986 &quot;subnetwork&quot;: &quot;A String&quot;, # Subnetwork to which VMs will be assigned, if desired. Expected to be of
987 # the form &quot;regions/REGION/subnetworks/SUBNETWORK&quot;.
988 &quot;numWorkers&quot;: 42, # Number of Google Compute Engine workers in this pool needed to
989 # execute the job. If zero or unspecified, the service will
990 # attempt to choose a reasonable default.
991 &quot;numThreadsPerWorker&quot;: 42, # The number of threads per worker harness. If empty or unspecified, the
992 # service will choose a number of threads (according to the number of cores
993 # on the selected machine type for batch, or 1 by convention for streaming).
994 &quot;workerHarnessContainerImage&quot;: &quot;A String&quot;, # Required. Docker container image that executes the Cloud Dataflow worker
995 # harness, residing in Google Container Registry.
996 #
997 # Deprecated for the Fn API path. Use sdk_harness_container_images instead.
998 &quot;taskrunnerSettings&quot;: { # Taskrunner configuration settings. # Settings passed through to Google Compute Engine workers when
999 # using the standard Dataflow task runner. Users should ignore
1000 # this field.
1001 &quot;dataflowApiVersion&quot;: &quot;A String&quot;, # The API version of endpoint, e.g. &quot;v1b3&quot;
1002 &quot;oauthScopes&quot;: [ # The OAuth2 scopes to be requested by the taskrunner in order to
1003 # access the Cloud Dataflow API.
1004 &quot;A String&quot;,
1005 ],
1006 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for the taskrunner to use when accessing Google Cloud APIs.
1007 #
1008 # When workers access Google Cloud APIs, they logically do so via
1009 # relative URLs. If this field is specified, it supplies the base
1010 # URL to use for resolving these relative URLs. The normative
1011 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
1012 # Locators&quot;.
1013 #
1014 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
1015 &quot;workflowFileName&quot;: &quot;A String&quot;, # The file to store the workflow in.
1016 &quot;logToSerialconsole&quot;: True or False, # Whether to send taskrunner log info to Google Compute Engine VM serial
1017 # console.
1018 &quot;baseTaskDir&quot;: &quot;A String&quot;, # The location on the worker for task-specific subdirectories.
1019 &quot;taskUser&quot;: &quot;A String&quot;, # The UNIX user ID on the worker VM to use for tasks launched by
1020 # taskrunner; e.g. &quot;root&quot;.
1021 &quot;vmId&quot;: &quot;A String&quot;, # The ID string of the VM.
1022 &quot;alsologtostderr&quot;: True or False, # Whether to also send taskrunner log info to stderr.
1023 &quot;parallelWorkerSettings&quot;: { # Provides data to pass through to the worker harness. # The settings to pass to the parallel worker harness.
1024 &quot;shuffleServicePath&quot;: &quot;A String&quot;, # The Shuffle service path relative to the root URL, for example,
1025 # &quot;shuffle/v1beta1&quot;.
1026 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
1027 # storage.
1028 #
1029 # The supported resource type is:
1030 #
1031 # Google Cloud Storage:
1032 #
1033 # storage.googleapis.com/{bucket}/{object}
1034 # bucket.storage.googleapis.com/{object}
1035 &quot;reportingEnabled&quot;: True or False, # Whether to send work progress updates to the service.
1036 &quot;servicePath&quot;: &quot;A String&quot;, # The Cloud Dataflow service path relative to the root URL, for example,
1037 # &quot;dataflow/v1b3/projects&quot;.
1038 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for accessing Google Cloud APIs.
1039 #
1040 # When workers access Google Cloud APIs, they logically do so via
1041 # relative URLs. If this field is specified, it supplies the base
1042 # URL to use for resolving these relative URLs. The normative
1043 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
1044 # Locators&quot;.
1045 #
1046 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
1047 &quot;workerId&quot;: &quot;A String&quot;, # The ID of the worker running this pipeline.
1048 },
1049 &quot;harnessCommand&quot;: &quot;A String&quot;, # The command to launch the worker harness.
1050 &quot;logDir&quot;: &quot;A String&quot;, # The directory on the VM to store logs.
1051 &quot;streamingWorkerMainClass&quot;: &quot;A String&quot;, # The streaming worker main class name.
1052 &quot;languageHint&quot;: &quot;A String&quot;, # The suggested backend language.
1053 &quot;taskGroup&quot;: &quot;A String&quot;, # The UNIX group ID on the worker VM to use for tasks launched by
1054 # taskrunner; e.g. &quot;wheel&quot;.
1055 &quot;logUploadLocation&quot;: &quot;A String&quot;, # Indicates where to put logs. If this is not specified, the logs
1056 # will not be uploaded.
1057 #
1058 # The supported resource type is:
1059 #
1060 # Google Cloud Storage:
1061 # storage.googleapis.com/{bucket}/{object}
1062 # bucket.storage.googleapis.com/{object}
1063 &quot;commandlinesFileName&quot;: &quot;A String&quot;, # The file to store preprocessing commands in.
1064 &quot;continueOnException&quot;: True or False, # Whether to continue taskrunner if an exception is hit.
1065 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the taskrunner should use for
1066 # temporary storage.
1067 #
1068 # The supported resource type is:
1069 #
1070 # Google Cloud Storage:
1071 # storage.googleapis.com/{bucket}/{object}
1072 # bucket.storage.googleapis.com/{object}
1073 },
1074 &quot;diskType&quot;: &quot;A String&quot;, # Type of root disk for VMs. If empty or unspecified, the service will
1075 # attempt to choose a reasonable default.
1076 &quot;defaultPackageSet&quot;: &quot;A String&quot;, # The default package set to install. This allows the service to
1077 # select a default set of packages which are useful to worker
1078 # harnesses written in a particular language.
1079 &quot;machineType&quot;: &quot;A String&quot;, # Machine type (e.g. &quot;n1-standard-1&quot;). If empty or unspecified, the
1080 # service will attempt to choose a reasonable default.
1081 },
1082 ],
1083 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
1084 # storage. The system will append the suffix &quot;/temp-{JOBNAME} to
1085 # this resource prefix, where {JOBNAME} is the value of the
1086 # job_name field. The resulting bucket and object prefix is used
1087 # as the prefix of the resources used to store temporary data
1088 # needed during the job execution. NOTE: This will override the
1089 # value in taskrunner_settings.
1090 # The supported resource type is:
1091 #
1092 # Google Cloud Storage:
1093 #
1094 # storage.googleapis.com/{bucket}/{object}
1095 # bucket.storage.googleapis.com/{object}
1096 &quot;internalExperiments&quot;: { # Experimental settings.
1097 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
1098 },
1099 &quot;sdkPipelineOptions&quot;: { # The Cloud Dataflow SDK pipeline options specified by the user. These
1100 # options are passed through the service and are used to recreate the
1101 # SDK pipeline options on the worker in a language agnostic and platform
1102 # independent way.
1103 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
1104 },
1105 &quot;dataset&quot;: &quot;A String&quot;, # The dataset for the current project where various workflow
1106 # related tables are stored.
1107 #
1108 # The supported resource type is:
1109 #
1110 # Google BigQuery:
1111 # bigquery.googleapis.com/{dataset}
1112 &quot;clusterManagerApiService&quot;: &quot;A String&quot;, # The type of cluster manager API to use. If unknown or
1113 # unspecified, the service will attempt to choose a reasonable
1114 # default. This should be in the form of the API service name,
1115 # e.g. &quot;compute.googleapis.com&quot;.
1116 },
1117 &quot;stepsLocation&quot;: &quot;A String&quot;, # The GCS location where the steps are stored.
1118 &quot;steps&quot;: [ # Exactly one of step or steps_location should be specified.
1119 #
1120 # The top-level steps that constitute the entire job.
1121 { # Defines a particular step within a Cloud Dataflow job.
1122 #
1123 # A job consists of multiple steps, each of which performs some
1124 # specific operation as part of the overall job. Data is typically
1125 # passed from one step to another as part of the job.
1126 #
1127 # Here&#x27;s an example of a sequence of steps which together implement a
1128 # Map-Reduce job:
1129 #
1130 # * Read a collection of data from some source, parsing the
1131 # collection&#x27;s elements.
1132 #
1133 # * Validate the elements.
1134 #
1135 # * Apply a user-defined function to map each element to some value
1136 # and extract an element-specific key value.
1137 #
1138 # * Group elements with the same key into a single element with
1139 # that key, transforming a multiply-keyed collection into a
1140 # uniquely-keyed collection.
1141 #
1142 # * Write the elements out to some data sink.
1143 #
1144 # Note that the Cloud Dataflow service may be used to run many different
1145 # types of jobs, not just Map-Reduce.
1146 &quot;kind&quot;: &quot;A String&quot;, # The kind of step in the Cloud Dataflow job.
1147 &quot;properties&quot;: { # Named properties associated with the step. Each kind of
1148 # predefined step has its own required set of properties.
1149 # Must be provided on Create. Only retrieved with JOB_VIEW_ALL.
1150 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
1151 },
1152 &quot;name&quot;: &quot;A String&quot;, # The name that identifies the step. This must be unique for each
1153 # step with respect to all other steps in the Cloud Dataflow job.
1154 },
1155 ],
1156 &quot;stageStates&quot;: [ # This field may be mutated by the Cloud Dataflow service;
1157 # callers cannot mutate it.
1158 { # A message describing the state of a particular execution stage.
1159 &quot;executionStageState&quot;: &quot;A String&quot;, # Executions stage states allow the same set of values as JobState.
1160 &quot;executionStageName&quot;: &quot;A String&quot;, # The name of the execution stage.
1161 &quot;currentStateTime&quot;: &quot;A String&quot;, # The time at which the stage transitioned to this state.
1162 },
1163 ],
1164 &quot;replacedByJobId&quot;: &quot;A String&quot;, # If another job is an update of this job (and thus, this job is in
1165 # `JOB_STATE_UPDATED`), this field contains the ID of that job.
1166 &quot;jobMetadata&quot;: { # Metadata available primarily for filtering jobs. Will be included in the # This field is populated by the Dataflow service to support filtering jobs
1167 # by the metadata values provided here. Populated for ListJobs and all GetJob
1168 # views SUMMARY and higher.
1169 # ListJob response and Job SUMMARY view.
1170 &quot;sdkVersion&quot;: { # The version of the SDK used to run the job. # The SDK version used to run the job.
1171 &quot;sdkSupportStatus&quot;: &quot;A String&quot;, # The support status for this SDK version.
1172 &quot;versionDisplayName&quot;: &quot;A String&quot;, # A readable string describing the version of the SDK.
1173 &quot;version&quot;: &quot;A String&quot;, # The version of the SDK used to run the job.
1174 },
1175 &quot;bigTableDetails&quot;: [ # Identification of a BigTable source used in the Dataflow job.
1176 { # Metadata for a BigTable connector used by the job.
1177 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
1178 &quot;tableId&quot;: &quot;A String&quot;, # TableId accessed in the connection.
1179 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
1180 },
1181 ],
1182 &quot;pubsubDetails&quot;: [ # Identification of a PubSub source used in the Dataflow job.
1183 { # Metadata for a PubSub connector used by the job.
1184 &quot;subscription&quot;: &quot;A String&quot;, # Subscription used in the connection.
1185 &quot;topic&quot;: &quot;A String&quot;, # Topic accessed in the connection.
1186 },
1187 ],
1188 &quot;bigqueryDetails&quot;: [ # Identification of a BigQuery source used in the Dataflow job.
1189 { # Metadata for a BigQuery connector used by the job.
1190 &quot;dataset&quot;: &quot;A String&quot;, # Dataset accessed in the connection.
1191 &quot;projectId&quot;: &quot;A String&quot;, # Project accessed in the connection.
1192 &quot;query&quot;: &quot;A String&quot;, # Query used to access data in the connection.
1193 &quot;table&quot;: &quot;A String&quot;, # Table accessed in the connection.
1194 },
1195 ],
1196 &quot;fileDetails&quot;: [ # Identification of a File source used in the Dataflow job.
1197 { # Metadata for a File connector used by the job.
1198 &quot;filePattern&quot;: &quot;A String&quot;, # File Pattern used to access files by the connector.
1199 },
1200 ],
1201 &quot;datastoreDetails&quot;: [ # Identification of a Datastore source used in the Dataflow job.
1202 { # Metadata for a Datastore connector used by the job.
1203 &quot;namespace&quot;: &quot;A String&quot;, # Namespace used in the connection.
1204 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
1205 },
1206 ],
1207 &quot;spannerDetails&quot;: [ # Identification of a Spanner source used in the Dataflow job.
1208 { # Metadata for a Spanner connector used by the job.
1209 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
1210 &quot;databaseId&quot;: &quot;A String&quot;, # DatabaseId accessed in the connection.
1211 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
1212 },
1213 ],
1214 },
1215 &quot;location&quot;: &quot;A String&quot;, # The [regional endpoint]
1216 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
1217 # contains this job.
1218 &quot;transformNameMapping&quot;: { # The map of transform name prefixes of the job to be replaced to the
1219 # corresponding name prefixes of the new job.
1220 &quot;a_key&quot;: &quot;A String&quot;,
1221 },
1222 &quot;startTime&quot;: &quot;A String&quot;, # The timestamp when the job was started (transitioned to JOB_STATE_PENDING).
1223 # Flexible resource scheduling jobs are started with some delay after job
1224 # creation, so start_time is unset before start and is updated when the
1225 # job is started by the Cloud Dataflow service. For other jobs, start_time
1226 # always equals to create_time and is immutable and set by the Cloud Dataflow
1227 # service.
1228 &quot;clientRequestId&quot;: &quot;A String&quot;, # The client&#x27;s unique identifier of the job, re-used across retried attempts.
1229 # If this field is set, the service will ensure its uniqueness.
1230 # The request to create a job will fail if the service has knowledge of a
1231 # previously submitted job with the same client&#x27;s ID and job name.
1232 # The caller may use this field to ensure idempotence of job
1233 # creation across retried attempts to create a job.
1234 # By default, the field is empty and, in that case, the service ignores it.
1235 &quot;executionInfo&quot;: { # Additional information about how a Cloud Dataflow job will be executed that # Deprecated.
1236 # isn&#x27;t contained in the submitted job.
1237 &quot;stages&quot;: { # A mapping from each stage to the information about that stage.
1238 &quot;a_key&quot;: { # Contains information about how a particular
1239 # google.dataflow.v1beta3.Step will be executed.
1240 &quot;stepName&quot;: [ # The steps associated with the execution stage.
1241 # Note that stages may have several steps, and that a given step
1242 # might be run by more than one stage.
1243 &quot;A String&quot;,
1244 ],
1245 },
1246 },
1247 },
1248 &quot;type&quot;: &quot;A String&quot;, # The type of Cloud Dataflow job.
1249 &quot;createTime&quot;: &quot;A String&quot;, # The timestamp when the job was initially created. Immutable and set by the
1250 # Cloud Dataflow service.
1251 &quot;tempFiles&quot;: [ # A set of files the system should be aware of that are used
1252 # for temporary storage. These temporary files will be
1253 # removed on job completion.
1254 # No duplicates are allowed.
1255 # No file patterns are supported.
1256 #
1257 # The supported files are:
1258 #
1259 # Google Cloud Storage:
1260 #
1261 # storage.googleapis.com/{bucket}/{object}
1262 # bucket.storage.googleapis.com/{object}
1263 &quot;A String&quot;,
1264 ],
1265 &quot;id&quot;: &quot;A String&quot;, # The unique ID of this job.
1266 #
1267 # This field is set by the Cloud Dataflow service when the Job is
1268 # created, and is immutable for the life of the job.
1269 &quot;requestedState&quot;: &quot;A String&quot;, # The job&#x27;s requested state.
1270 #
1271 # `UpdateJob` may be used to switch between the `JOB_STATE_STOPPED` and
1272 # `JOB_STATE_RUNNING` states, by setting requested_state. `UpdateJob` may
1273 # also be used to directly set a job&#x27;s requested state to
1274 # `JOB_STATE_CANCELLED` or `JOB_STATE_DONE`, irrevocably terminating the
1275 # job if it has not already reached a terminal state.
1276 &quot;replaceJobId&quot;: &quot;A String&quot;, # If this job is an update of an existing job, this field is the job ID
1277 # of the job it replaced.
1278 #
1279 # When sending a `CreateJobRequest`, you can update a job by specifying it
1280 # here. The job named here is stopped, and its intermediate state is
1281 # transferred to this job.
1282 &quot;createdFromSnapshotId&quot;: &quot;A String&quot;, # If this is specified, the job&#x27;s initial state is populated from the given
1283 # snapshot.
1284 &quot;currentState&quot;: &quot;A String&quot;, # The current state of the job.
1285 #
1286 # Jobs are created in the `JOB_STATE_STOPPED` state unless otherwise
1287 # specified.
1288 #
1289 # A job in the `JOB_STATE_RUNNING` state may asynchronously enter a
1290 # terminal state. After a job has reached a terminal state, no
1291 # further state updates may be made.
1292 #
1293 # This field may be mutated by the Cloud Dataflow service;
1294 # callers cannot mutate it.
1295 &quot;name&quot;: &quot;A String&quot;, # The user-specified Cloud Dataflow job name.
1296 #
1297 # Only one Job with a given name may exist in a project at any
1298 # given time. If a caller attempts to create a Job with the same
1299 # name as an already-existing Job, the attempt returns the
1300 # existing Job.
1301 #
1302 # The name must match the regular expression
1303 # `[a-z]([-a-z0-9]{0,38}[a-z0-9])?`
1304 &quot;currentStateTime&quot;: &quot;A String&quot;, # The timestamp associated with the current state.
1305 }</pre>
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001306</div>
1307
1308<div class="method">
Bu Sun Kim65020912020-05-20 12:08:20 -07001309 <code class="details" id="get">get(projectId, location, jobId, view=None, x__xgafv=None)</code>
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001310 <pre>Gets the state of the specified Cloud Dataflow job.
1311
1312To get the state of a job, we recommend using `projects.locations.jobs.get`
1313with a [regional endpoint]
1314(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints). Using
1315`projects.jobs.get` is not recommended, as you can only get the state of
1316jobs that are running in `us-central1`.
1317
1318Args:
1319 projectId: string, The ID of the Cloud Platform project that the job belongs to. (required)
1320 location: string, The [regional endpoint]
1321(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
1322contains this job. (required)
1323 jobId: string, The job ID. (required)
Bu Sun Kim65020912020-05-20 12:08:20 -07001324 view: string, The level of information requested in response.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001325 x__xgafv: string, V1 error format.
1326 Allowed values
1327 1 - v1 error format
1328 2 - v2 error format
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001329
1330Returns:
1331 An object of the form:
1332
1333 { # Defines a job to be run by the Cloud Dataflow service.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001334 &quot;pipelineDescription&quot;: { # A descriptive representation of submitted pipeline as well as the executed # Preliminary field: The format of this data may change at any time.
1335 # A description of the user pipeline and stages through which it is executed.
1336 # Created by Cloud Dataflow service. Only retrieved with
1337 # JOB_VIEW_DESCRIPTION or JOB_VIEW_ALL.
1338 # form. This data is provided by the Dataflow service for ease of visualizing
1339 # the pipeline and interpreting Dataflow provided metrics.
1340 &quot;displayData&quot;: [ # Pipeline level display data.
1341 { # Data provided with a pipeline or transform to provide descriptive info.
1342 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
1343 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
1344 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
1345 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
1346 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
1347 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
1348 # This is intended to be used as a label for the display data
1349 # when viewed in a dax monitoring system.
1350 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
1351 # language namespace (i.e. python module) which defines the display data.
1352 # This allows a dax monitoring system to specially handle the data
1353 # and perform custom rendering.
1354 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
1355 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
1356 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
1357 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
1358 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
1359 # For example a java_class_name_value of com.mypackage.MyDoFn
1360 # will be stored with MyDoFn as the short_str_value and
1361 # com.mypackage.MyDoFn as the java_class_name value.
1362 # short_str_value can be displayed and java_class_name_value
1363 # will be displayed as a tooltip.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001364 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001365 ],
1366 &quot;originalPipelineTransform&quot;: [ # Description of each transform in the pipeline and collections between them.
1367 { # Description of the type, names/ids, and input/outputs for a transform.
1368 &quot;outputCollectionName&quot;: [ # User names for all collection outputs to this transform.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001369 &quot;A String&quot;,
1370 ],
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001371 &quot;displayData&quot;: [ # Transform-specific display data.
1372 { # Data provided with a pipeline or transform to provide descriptive info.
1373 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
1374 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
1375 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
1376 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
1377 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
1378 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
1379 # This is intended to be used as a label for the display data
1380 # when viewed in a dax monitoring system.
1381 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
1382 # language namespace (i.e. python module) which defines the display data.
1383 # This allows a dax monitoring system to specially handle the data
1384 # and perform custom rendering.
1385 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
1386 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
1387 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
1388 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
1389 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
1390 # For example a java_class_name_value of com.mypackage.MyDoFn
1391 # will be stored with MyDoFn as the short_str_value and
1392 # com.mypackage.MyDoFn as the java_class_name value.
1393 # short_str_value can be displayed and java_class_name_value
1394 # will be displayed as a tooltip.
1395 },
1396 ],
1397 &quot;id&quot;: &quot;A String&quot;, # SDK generated id of this transform instance.
1398 &quot;inputCollectionName&quot;: [ # User names for all collection inputs to this transform.
1399 &quot;A String&quot;,
1400 ],
1401 &quot;name&quot;: &quot;A String&quot;, # User provided name for this transform instance.
1402 &quot;kind&quot;: &quot;A String&quot;, # Type of transform.
1403 },
1404 ],
1405 &quot;executionPipelineStage&quot;: [ # Description of each stage of execution of the pipeline.
1406 { # Description of the composing transforms, names/ids, and input/outputs of a
1407 # stage of execution. Some composing transforms and sources may have been
1408 # generated by the Dataflow service during execution planning.
1409 &quot;componentSource&quot;: [ # Collections produced and consumed by component transforms of this stage.
1410 { # Description of an interstitial value between transforms in an execution
1411 # stage.
1412 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
1413 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
1414 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
1415 # source is most closely associated.
1416 },
1417 ],
1418 &quot;inputSource&quot;: [ # Input sources for this stage.
1419 { # Description of an input or output of an execution stage.
1420 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
1421 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
1422 # source is most closely associated.
1423 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
1424 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
1425 },
1426 ],
1427 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this stage.
1428 &quot;componentTransform&quot;: [ # Transforms that comprise this execution stage.
1429 { # Description of a transform executed as part of an execution stage.
1430 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
1431 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
1432 &quot;originalTransform&quot;: &quot;A String&quot;, # User name for the original user transform with which this transform is
1433 # most closely associated.
1434 },
1435 ],
1436 &quot;id&quot;: &quot;A String&quot;, # Dataflow service generated id for this stage.
1437 &quot;outputSource&quot;: [ # Output sources for this stage.
1438 { # Description of an input or output of an execution stage.
1439 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
1440 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
1441 # source is most closely associated.
1442 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
1443 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
1444 },
1445 ],
1446 &quot;kind&quot;: &quot;A String&quot;, # Type of tranform this stage is executing.
1447 },
1448 ],
1449 },
1450 &quot;labels&quot;: { # User-defined labels for this job.
1451 #
1452 # The labels map can contain no more than 64 entries. Entries of the labels
1453 # map are UTF8 strings that comply with the following restrictions:
1454 #
1455 # * Keys must conform to regexp: \p{Ll}\p{Lo}{0,62}
1456 # * Values must conform to regexp: [\p{Ll}\p{Lo}\p{N}_-]{0,63}
1457 # * Both keys and values are additionally constrained to be &lt;= 128 bytes in
1458 # size.
1459 &quot;a_key&quot;: &quot;A String&quot;,
1460 },
1461 &quot;projectId&quot;: &quot;A String&quot;, # The ID of the Cloud Platform project that the job belongs to.
1462 &quot;environment&quot;: { # Describes the environment in which a Dataflow Job runs. # The environment for the job.
1463 &quot;flexResourceSchedulingGoal&quot;: &quot;A String&quot;, # Which Flexible Resource Scheduling mode to run in.
1464 &quot;workerRegion&quot;: &quot;A String&quot;, # The Compute Engine region
1465 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
1466 # which worker processing should occur, e.g. &quot;us-west1&quot;. Mutually exclusive
1467 # with worker_zone. If neither worker_region nor worker_zone is specified,
1468 # default to the control plane&#x27;s region.
1469 &quot;userAgent&quot;: { # A description of the process that generated the request.
1470 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
1471 },
1472 &quot;serviceAccountEmail&quot;: &quot;A String&quot;, # Identity to run virtual machines as. Defaults to the default account.
1473 &quot;version&quot;: { # A structure describing which components and their versions of the service
1474 # are required in order to run the job.
1475 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
1476 },
1477 &quot;serviceKmsKeyName&quot;: &quot;A String&quot;, # If set, contains the Cloud KMS key identifier used to encrypt data
1478 # at rest, AKA a Customer Managed Encryption Key (CMEK).
1479 #
1480 # Format:
1481 # projects/PROJECT_ID/locations/LOCATION/keyRings/KEY_RING/cryptoKeys/KEY
1482 &quot;experiments&quot;: [ # The list of experiments to enable.
1483 &quot;A String&quot;,
1484 ],
1485 &quot;workerZone&quot;: &quot;A String&quot;, # The Compute Engine zone
1486 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
1487 # which worker processing should occur, e.g. &quot;us-west1-a&quot;. Mutually exclusive
1488 # with worker_region. If neither worker_region nor worker_zone is specified,
1489 # a zone in the control plane&#x27;s region is chosen based on available capacity.
1490 &quot;workerPools&quot;: [ # The worker pools. At least one &quot;harness&quot; worker pool must be
1491 # specified in order for the job to have workers.
1492 { # Describes one particular pool of Cloud Dataflow workers to be
1493 # instantiated by the Cloud Dataflow service in order to perform the
1494 # computations required by a job. Note that a workflow job may use
1495 # multiple pools, in order to match the various computational
1496 # requirements of the various stages of the job.
1497 &quot;onHostMaintenance&quot;: &quot;A String&quot;, # The action to take on host maintenance, as defined by the Google
1498 # Compute Engine API.
1499 &quot;sdkHarnessContainerImages&quot;: [ # Set of SDK harness containers needed to execute this pipeline. This will
1500 # only be set in the Fn API path. For non-cross-language pipelines this
1501 # should have only one entry. Cross-language pipelines will have two or more
1502 # entries.
1503 { # Defines a SDK harness container for executing Dataflow pipelines.
1504 &quot;containerImage&quot;: &quot;A String&quot;, # A docker container image that resides in Google Container Registry.
1505 &quot;useSingleCorePerContainer&quot;: True or False, # If true, recommends the Dataflow service to use only one core per SDK
1506 # container instance with this image. If false (or unset) recommends using
1507 # more than one core per SDK container instance with this image for
1508 # efficiency. Note that Dataflow service may choose to override this property
1509 # if needed.
1510 },
1511 ],
1512 &quot;zone&quot;: &quot;A String&quot;, # Zone to run the worker pools in. If empty or unspecified, the service
1513 # will attempt to choose a reasonable default.
1514 &quot;kind&quot;: &quot;A String&quot;, # The kind of the worker pool; currently only `harness` and `shuffle`
1515 # are supported.
1516 &quot;metadata&quot;: { # Metadata to set on the Google Compute Engine VMs.
1517 &quot;a_key&quot;: &quot;A String&quot;,
1518 },
1519 &quot;diskSourceImage&quot;: &quot;A String&quot;, # Fully qualified source image for disks.
1520 &quot;dataDisks&quot;: [ # Data disks that are used by a VM in this workflow.
1521 { # Describes the data disk used by a workflow job.
1522 &quot;sizeGb&quot;: 42, # Size of disk in GB. If zero or unspecified, the service will
1523 # attempt to choose a reasonable default.
1524 &quot;diskType&quot;: &quot;A String&quot;, # Disk storage type, as defined by Google Compute Engine. This
1525 # must be a disk type appropriate to the project and zone in which
1526 # the workers will run. If unknown or unspecified, the service
1527 # will attempt to choose a reasonable default.
1528 #
1529 # For example, the standard persistent disk type is a resource name
1530 # typically ending in &quot;pd-standard&quot;. If SSD persistent disks are
1531 # available, the resource name typically ends with &quot;pd-ssd&quot;. The
1532 # actual valid values are defined the Google Compute Engine API,
1533 # not by the Cloud Dataflow API; consult the Google Compute Engine
1534 # documentation for more information about determining the set of
1535 # available disk types for a particular project and zone.
1536 #
1537 # Google Compute Engine Disk types are local to a particular
1538 # project in a particular zone, and so the resource name will
1539 # typically look something like this:
1540 #
1541 # compute.googleapis.com/projects/project-id/zones/zone/diskTypes/pd-standard
1542 &quot;mountPoint&quot;: &quot;A String&quot;, # Directory in a VM where disk is mounted.
1543 },
1544 ],
1545 &quot;packages&quot;: [ # Packages to be installed on workers.
1546 { # The packages that must be installed in order for a worker to run the
1547 # steps of the Cloud Dataflow job that will be assigned to its worker
1548 # pool.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001549 #
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001550 # This is the mechanism by which the Cloud Dataflow SDK causes code to
1551 # be loaded onto the workers. For example, the Cloud Dataflow Java SDK
1552 # might use this to install jars containing the user&#x27;s code and all of the
1553 # various dependencies (libraries, data files, etc.) required in order
1554 # for that code to run.
1555 &quot;name&quot;: &quot;A String&quot;, # The name of the package.
1556 &quot;location&quot;: &quot;A String&quot;, # The resource to read the package from. The supported resource type is:
1557 #
1558 # Google Cloud Storage:
1559 #
1560 # storage.googleapis.com/{bucket}
1561 # bucket.storage.googleapis.com/
1562 },
1563 ],
1564 &quot;teardownPolicy&quot;: &quot;A String&quot;, # Sets the policy for determining when to turndown worker pool.
1565 # Allowed values are: `TEARDOWN_ALWAYS`, `TEARDOWN_ON_SUCCESS`, and
1566 # `TEARDOWN_NEVER`.
1567 # `TEARDOWN_ALWAYS` means workers are always torn down regardless of whether
1568 # the job succeeds. `TEARDOWN_ON_SUCCESS` means workers are torn down
1569 # if the job succeeds. `TEARDOWN_NEVER` means the workers are never torn
1570 # down.
1571 #
1572 # If the workers are not torn down by the service, they will
1573 # continue to run and use Google Compute Engine VM resources in the
1574 # user&#x27;s project until they are explicitly terminated by the user.
1575 # Because of this, Google recommends using the `TEARDOWN_ALWAYS`
1576 # policy except for small, manually supervised test jobs.
1577 #
1578 # If unknown or unspecified, the service will attempt to choose a reasonable
1579 # default.
1580 &quot;network&quot;: &quot;A String&quot;, # Network to which VMs will be assigned. If empty or unspecified,
1581 # the service will use the network &quot;default&quot;.
1582 &quot;ipConfiguration&quot;: &quot;A String&quot;, # Configuration for VM IPs.
1583 &quot;diskSizeGb&quot;: 42, # Size of root disk for VMs, in GB. If zero or unspecified, the service will
1584 # attempt to choose a reasonable default.
1585 &quot;autoscalingSettings&quot;: { # Settings for WorkerPool autoscaling. # Settings for autoscaling of this WorkerPool.
1586 &quot;maxNumWorkers&quot;: 42, # The maximum number of workers to cap scaling at.
1587 &quot;algorithm&quot;: &quot;A String&quot;, # The algorithm to use for autoscaling.
1588 },
1589 &quot;poolArgs&quot;: { # Extra arguments for this worker pool.
1590 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
1591 },
1592 &quot;subnetwork&quot;: &quot;A String&quot;, # Subnetwork to which VMs will be assigned, if desired. Expected to be of
1593 # the form &quot;regions/REGION/subnetworks/SUBNETWORK&quot;.
1594 &quot;numWorkers&quot;: 42, # Number of Google Compute Engine workers in this pool needed to
1595 # execute the job. If zero or unspecified, the service will
1596 # attempt to choose a reasonable default.
1597 &quot;numThreadsPerWorker&quot;: 42, # The number of threads per worker harness. If empty or unspecified, the
1598 # service will choose a number of threads (according to the number of cores
1599 # on the selected machine type for batch, or 1 by convention for streaming).
1600 &quot;workerHarnessContainerImage&quot;: &quot;A String&quot;, # Required. Docker container image that executes the Cloud Dataflow worker
1601 # harness, residing in Google Container Registry.
1602 #
1603 # Deprecated for the Fn API path. Use sdk_harness_container_images instead.
1604 &quot;taskrunnerSettings&quot;: { # Taskrunner configuration settings. # Settings passed through to Google Compute Engine workers when
1605 # using the standard Dataflow task runner. Users should ignore
1606 # this field.
1607 &quot;dataflowApiVersion&quot;: &quot;A String&quot;, # The API version of endpoint, e.g. &quot;v1b3&quot;
1608 &quot;oauthScopes&quot;: [ # The OAuth2 scopes to be requested by the taskrunner in order to
1609 # access the Cloud Dataflow API.
1610 &quot;A String&quot;,
1611 ],
1612 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for the taskrunner to use when accessing Google Cloud APIs.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001613 #
1614 # When workers access Google Cloud APIs, they logically do so via
1615 # relative URLs. If this field is specified, it supplies the base
1616 # URL to use for resolving these relative URLs. The normative
1617 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
1618 # Locators&quot;.
1619 #
1620 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001621 &quot;workflowFileName&quot;: &quot;A String&quot;, # The file to store the workflow in.
1622 &quot;logToSerialconsole&quot;: True or False, # Whether to send taskrunner log info to Google Compute Engine VM serial
1623 # console.
1624 &quot;baseTaskDir&quot;: &quot;A String&quot;, # The location on the worker for task-specific subdirectories.
1625 &quot;taskUser&quot;: &quot;A String&quot;, # The UNIX user ID on the worker VM to use for tasks launched by
1626 # taskrunner; e.g. &quot;root&quot;.
1627 &quot;vmId&quot;: &quot;A String&quot;, # The ID string of the VM.
1628 &quot;alsologtostderr&quot;: True or False, # Whether to also send taskrunner log info to stderr.
1629 &quot;parallelWorkerSettings&quot;: { # Provides data to pass through to the worker harness. # The settings to pass to the parallel worker harness.
1630 &quot;shuffleServicePath&quot;: &quot;A String&quot;, # The Shuffle service path relative to the root URL, for example,
1631 # &quot;shuffle/v1beta1&quot;.
1632 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
1633 # storage.
1634 #
1635 # The supported resource type is:
1636 #
1637 # Google Cloud Storage:
1638 #
1639 # storage.googleapis.com/{bucket}/{object}
1640 # bucket.storage.googleapis.com/{object}
1641 &quot;reportingEnabled&quot;: True or False, # Whether to send work progress updates to the service.
1642 &quot;servicePath&quot;: &quot;A String&quot;, # The Cloud Dataflow service path relative to the root URL, for example,
1643 # &quot;dataflow/v1b3/projects&quot;.
1644 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for accessing Google Cloud APIs.
1645 #
1646 # When workers access Google Cloud APIs, they logically do so via
1647 # relative URLs. If this field is specified, it supplies the base
1648 # URL to use for resolving these relative URLs. The normative
1649 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
1650 # Locators&quot;.
1651 #
1652 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
1653 &quot;workerId&quot;: &quot;A String&quot;, # The ID of the worker running this pipeline.
1654 },
1655 &quot;harnessCommand&quot;: &quot;A String&quot;, # The command to launch the worker harness.
1656 &quot;logDir&quot;: &quot;A String&quot;, # The directory on the VM to store logs.
1657 &quot;streamingWorkerMainClass&quot;: &quot;A String&quot;, # The streaming worker main class name.
1658 &quot;languageHint&quot;: &quot;A String&quot;, # The suggested backend language.
1659 &quot;taskGroup&quot;: &quot;A String&quot;, # The UNIX group ID on the worker VM to use for tasks launched by
1660 # taskrunner; e.g. &quot;wheel&quot;.
1661 &quot;logUploadLocation&quot;: &quot;A String&quot;, # Indicates where to put logs. If this is not specified, the logs
1662 # will not be uploaded.
1663 #
1664 # The supported resource type is:
1665 #
1666 # Google Cloud Storage:
1667 # storage.googleapis.com/{bucket}/{object}
1668 # bucket.storage.googleapis.com/{object}
1669 &quot;commandlinesFileName&quot;: &quot;A String&quot;, # The file to store preprocessing commands in.
1670 &quot;continueOnException&quot;: True or False, # Whether to continue taskrunner if an exception is hit.
1671 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the taskrunner should use for
1672 # temporary storage.
1673 #
1674 # The supported resource type is:
1675 #
1676 # Google Cloud Storage:
1677 # storage.googleapis.com/{bucket}/{object}
1678 # bucket.storage.googleapis.com/{object}
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001679 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001680 &quot;diskType&quot;: &quot;A String&quot;, # Type of root disk for VMs. If empty or unspecified, the service will
1681 # attempt to choose a reasonable default.
1682 &quot;defaultPackageSet&quot;: &quot;A String&quot;, # The default package set to install. This allows the service to
1683 # select a default set of packages which are useful to worker
1684 # harnesses written in a particular language.
1685 &quot;machineType&quot;: &quot;A String&quot;, # Machine type (e.g. &quot;n1-standard-1&quot;). If empty or unspecified, the
1686 # service will attempt to choose a reasonable default.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001687 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001688 ],
1689 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
1690 # storage. The system will append the suffix &quot;/temp-{JOBNAME} to
1691 # this resource prefix, where {JOBNAME} is the value of the
1692 # job_name field. The resulting bucket and object prefix is used
1693 # as the prefix of the resources used to store temporary data
1694 # needed during the job execution. NOTE: This will override the
1695 # value in taskrunner_settings.
1696 # The supported resource type is:
1697 #
1698 # Google Cloud Storage:
1699 #
1700 # storage.googleapis.com/{bucket}/{object}
1701 # bucket.storage.googleapis.com/{object}
1702 &quot;internalExperiments&quot;: { # Experimental settings.
1703 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07001704 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001705 &quot;sdkPipelineOptions&quot;: { # The Cloud Dataflow SDK pipeline options specified by the user. These
1706 # options are passed through the service and are used to recreate the
1707 # SDK pipeline options on the worker in a language agnostic and platform
1708 # independent way.
Bu Sun Kim65020912020-05-20 12:08:20 -07001709 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08001710 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001711 &quot;dataset&quot;: &quot;A String&quot;, # The dataset for the current project where various workflow
1712 # related tables are stored.
1713 #
1714 # The supported resource type is:
1715 #
1716 # Google BigQuery:
1717 # bigquery.googleapis.com/{dataset}
1718 &quot;clusterManagerApiService&quot;: &quot;A String&quot;, # The type of cluster manager API to use. If unknown or
1719 # unspecified, the service will attempt to choose a reasonable
1720 # default. This should be in the form of the API service name,
1721 # e.g. &quot;compute.googleapis.com&quot;.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08001722 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001723 &quot;stepsLocation&quot;: &quot;A String&quot;, # The GCS location where the steps are stored.
1724 &quot;steps&quot;: [ # Exactly one of step or steps_location should be specified.
1725 #
1726 # The top-level steps that constitute the entire job.
1727 { # Defines a particular step within a Cloud Dataflow job.
1728 #
1729 # A job consists of multiple steps, each of which performs some
1730 # specific operation as part of the overall job. Data is typically
1731 # passed from one step to another as part of the job.
1732 #
1733 # Here&#x27;s an example of a sequence of steps which together implement a
1734 # Map-Reduce job:
1735 #
1736 # * Read a collection of data from some source, parsing the
1737 # collection&#x27;s elements.
1738 #
1739 # * Validate the elements.
1740 #
1741 # * Apply a user-defined function to map each element to some value
1742 # and extract an element-specific key value.
1743 #
1744 # * Group elements with the same key into a single element with
1745 # that key, transforming a multiply-keyed collection into a
1746 # uniquely-keyed collection.
1747 #
1748 # * Write the elements out to some data sink.
1749 #
1750 # Note that the Cloud Dataflow service may be used to run many different
1751 # types of jobs, not just Map-Reduce.
1752 &quot;kind&quot;: &quot;A String&quot;, # The kind of step in the Cloud Dataflow job.
1753 &quot;properties&quot;: { # Named properties associated with the step. Each kind of
1754 # predefined step has its own required set of properties.
1755 # Must be provided on Create. Only retrieved with JOB_VIEW_ALL.
1756 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
1757 },
1758 &quot;name&quot;: &quot;A String&quot;, # The name that identifies the step. This must be unique for each
1759 # step with respect to all other steps in the Cloud Dataflow job.
1760 },
1761 ],
1762 &quot;stageStates&quot;: [ # This field may be mutated by the Cloud Dataflow service;
1763 # callers cannot mutate it.
1764 { # A message describing the state of a particular execution stage.
1765 &quot;executionStageState&quot;: &quot;A String&quot;, # Executions stage states allow the same set of values as JobState.
1766 &quot;executionStageName&quot;: &quot;A String&quot;, # The name of the execution stage.
1767 &quot;currentStateTime&quot;: &quot;A String&quot;, # The time at which the stage transitioned to this state.
1768 },
1769 ],
1770 &quot;replacedByJobId&quot;: &quot;A String&quot;, # If another job is an update of this job (and thus, this job is in
1771 # `JOB_STATE_UPDATED`), this field contains the ID of that job.
1772 &quot;jobMetadata&quot;: { # Metadata available primarily for filtering jobs. Will be included in the # This field is populated by the Dataflow service to support filtering jobs
1773 # by the metadata values provided here. Populated for ListJobs and all GetJob
1774 # views SUMMARY and higher.
1775 # ListJob response and Job SUMMARY view.
1776 &quot;sdkVersion&quot;: { # The version of the SDK used to run the job. # The SDK version used to run the job.
1777 &quot;sdkSupportStatus&quot;: &quot;A String&quot;, # The support status for this SDK version.
1778 &quot;versionDisplayName&quot;: &quot;A String&quot;, # A readable string describing the version of the SDK.
1779 &quot;version&quot;: &quot;A String&quot;, # The version of the SDK used to run the job.
1780 },
1781 &quot;bigTableDetails&quot;: [ # Identification of a BigTable source used in the Dataflow job.
1782 { # Metadata for a BigTable connector used by the job.
1783 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
1784 &quot;tableId&quot;: &quot;A String&quot;, # TableId accessed in the connection.
1785 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
1786 },
1787 ],
1788 &quot;pubsubDetails&quot;: [ # Identification of a PubSub source used in the Dataflow job.
1789 { # Metadata for a PubSub connector used by the job.
1790 &quot;subscription&quot;: &quot;A String&quot;, # Subscription used in the connection.
1791 &quot;topic&quot;: &quot;A String&quot;, # Topic accessed in the connection.
1792 },
1793 ],
1794 &quot;bigqueryDetails&quot;: [ # Identification of a BigQuery source used in the Dataflow job.
1795 { # Metadata for a BigQuery connector used by the job.
1796 &quot;dataset&quot;: &quot;A String&quot;, # Dataset accessed in the connection.
1797 &quot;projectId&quot;: &quot;A String&quot;, # Project accessed in the connection.
1798 &quot;query&quot;: &quot;A String&quot;, # Query used to access data in the connection.
1799 &quot;table&quot;: &quot;A String&quot;, # Table accessed in the connection.
1800 },
1801 ],
1802 &quot;fileDetails&quot;: [ # Identification of a File source used in the Dataflow job.
1803 { # Metadata for a File connector used by the job.
1804 &quot;filePattern&quot;: &quot;A String&quot;, # File Pattern used to access files by the connector.
1805 },
1806 ],
1807 &quot;datastoreDetails&quot;: [ # Identification of a Datastore source used in the Dataflow job.
1808 { # Metadata for a Datastore connector used by the job.
1809 &quot;namespace&quot;: &quot;A String&quot;, # Namespace used in the connection.
1810 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
1811 },
1812 ],
1813 &quot;spannerDetails&quot;: [ # Identification of a Spanner source used in the Dataflow job.
1814 { # Metadata for a Spanner connector used by the job.
1815 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
1816 &quot;databaseId&quot;: &quot;A String&quot;, # DatabaseId accessed in the connection.
1817 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
1818 },
1819 ],
1820 },
1821 &quot;location&quot;: &quot;A String&quot;, # The [regional endpoint]
1822 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
1823 # contains this job.
1824 &quot;transformNameMapping&quot;: { # The map of transform name prefixes of the job to be replaced to the
1825 # corresponding name prefixes of the new job.
1826 &quot;a_key&quot;: &quot;A String&quot;,
1827 },
1828 &quot;startTime&quot;: &quot;A String&quot;, # The timestamp when the job was started (transitioned to JOB_STATE_PENDING).
1829 # Flexible resource scheduling jobs are started with some delay after job
1830 # creation, so start_time is unset before start and is updated when the
1831 # job is started by the Cloud Dataflow service. For other jobs, start_time
1832 # always equals to create_time and is immutable and set by the Cloud Dataflow
1833 # service.
1834 &quot;clientRequestId&quot;: &quot;A String&quot;, # The client&#x27;s unique identifier of the job, re-used across retried attempts.
1835 # If this field is set, the service will ensure its uniqueness.
1836 # The request to create a job will fail if the service has knowledge of a
1837 # previously submitted job with the same client&#x27;s ID and job name.
1838 # The caller may use this field to ensure idempotence of job
1839 # creation across retried attempts to create a job.
1840 # By default, the field is empty and, in that case, the service ignores it.
1841 &quot;executionInfo&quot;: { # Additional information about how a Cloud Dataflow job will be executed that # Deprecated.
1842 # isn&#x27;t contained in the submitted job.
1843 &quot;stages&quot;: { # A mapping from each stage to the information about that stage.
1844 &quot;a_key&quot;: { # Contains information about how a particular
1845 # google.dataflow.v1beta3.Step will be executed.
1846 &quot;stepName&quot;: [ # The steps associated with the execution stage.
1847 # Note that stages may have several steps, and that a given step
1848 # might be run by more than one stage.
1849 &quot;A String&quot;,
1850 ],
1851 },
Bu Sun Kim65020912020-05-20 12:08:20 -07001852 },
1853 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001854 &quot;type&quot;: &quot;A String&quot;, # The type of Cloud Dataflow job.
1855 &quot;createTime&quot;: &quot;A String&quot;, # The timestamp when the job was initially created. Immutable and set by the
1856 # Cloud Dataflow service.
1857 &quot;tempFiles&quot;: [ # A set of files the system should be aware of that are used
1858 # for temporary storage. These temporary files will be
1859 # removed on job completion.
1860 # No duplicates are allowed.
1861 # No file patterns are supported.
1862 #
1863 # The supported files are:
1864 #
1865 # Google Cloud Storage:
1866 #
1867 # storage.googleapis.com/{bucket}/{object}
1868 # bucket.storage.googleapis.com/{object}
1869 &quot;A String&quot;,
1870 ],
1871 &quot;id&quot;: &quot;A String&quot;, # The unique ID of this job.
1872 #
1873 # This field is set by the Cloud Dataflow service when the Job is
1874 # created, and is immutable for the life of the job.
1875 &quot;requestedState&quot;: &quot;A String&quot;, # The job&#x27;s requested state.
1876 #
1877 # `UpdateJob` may be used to switch between the `JOB_STATE_STOPPED` and
1878 # `JOB_STATE_RUNNING` states, by setting requested_state. `UpdateJob` may
1879 # also be used to directly set a job&#x27;s requested state to
1880 # `JOB_STATE_CANCELLED` or `JOB_STATE_DONE`, irrevocably terminating the
1881 # job if it has not already reached a terminal state.
1882 &quot;replaceJobId&quot;: &quot;A String&quot;, # If this job is an update of an existing job, this field is the job ID
1883 # of the job it replaced.
1884 #
1885 # When sending a `CreateJobRequest`, you can update a job by specifying it
1886 # here. The job named here is stopped, and its intermediate state is
1887 # transferred to this job.
1888 &quot;createdFromSnapshotId&quot;: &quot;A String&quot;, # If this is specified, the job&#x27;s initial state is populated from the given
1889 # snapshot.
1890 &quot;currentState&quot;: &quot;A String&quot;, # The current state of the job.
1891 #
1892 # Jobs are created in the `JOB_STATE_STOPPED` state unless otherwise
1893 # specified.
1894 #
1895 # A job in the `JOB_STATE_RUNNING` state may asynchronously enter a
1896 # terminal state. After a job has reached a terminal state, no
1897 # further state updates may be made.
1898 #
1899 # This field may be mutated by the Cloud Dataflow service;
1900 # callers cannot mutate it.
1901 &quot;name&quot;: &quot;A String&quot;, # The user-specified Cloud Dataflow job name.
1902 #
1903 # Only one Job with a given name may exist in a project at any
1904 # given time. If a caller attempts to create a Job with the same
1905 # name as an already-existing Job, the attempt returns the
1906 # existing Job.
1907 #
1908 # The name must match the regular expression
1909 # `[a-z]([-a-z0-9]{0,38}[a-z0-9])?`
1910 &quot;currentStateTime&quot;: &quot;A String&quot;, # The timestamp associated with the current state.
1911 }</pre>
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08001912</div>
1913
1914<div class="method">
1915 <code class="details" id="getMetrics">getMetrics(projectId, location, jobId, startTime=None, x__xgafv=None)</code>
1916 <pre>Request the job status.
1917
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001918To request the status of a job, we recommend using
1919`projects.locations.jobs.getMetrics` with a [regional endpoint]
1920(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints). Using
1921`projects.jobs.getMetrics` is not recommended, as you can only request the
1922status of jobs that are running in `us-central1`.
1923
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08001924Args:
1925 projectId: string, A project id. (required)
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001926 location: string, The [regional endpoint]
1927(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
1928contains the job specified by job_id. (required)
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08001929 jobId: string, The job to get messages for. (required)
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04001930 startTime: string, Return only metric data that has changed since this time.
1931Default is to return all information about all metrics for the job.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08001932 x__xgafv: string, V1 error format.
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04001933 Allowed values
1934 1 - v1 error format
1935 2 - v2 error format
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08001936
1937Returns:
1938 An object of the form:
1939
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001940 { # JobMetrics contains a collection of metrics describing the detailed progress
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04001941 # of a Dataflow job. Metrics correspond to user-defined and system-defined
1942 # metrics in the job.
1943 #
1944 # This resource captures only the most recent values of each metric;
1945 # time-series data can be queried for them (under the same metric names)
1946 # from Cloud Monitoring.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001947 &quot;metricTime&quot;: &quot;A String&quot;, # Timestamp as of which metric values are current.
Bu Sun Kim65020912020-05-20 12:08:20 -07001948 &quot;metrics&quot;: [ # All metrics for this job.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08001949 { # Describes the state of a metric.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001950 &quot;distribution&quot;: &quot;&quot;, # A struct value describing properties of a distribution of numeric values.
Bu Sun Kim65020912020-05-20 12:08:20 -07001951 &quot;kind&quot;: &quot;A String&quot;, # Metric aggregation kind. The possible metric aggregation kinds are
1952 # &quot;Sum&quot;, &quot;Max&quot;, &quot;Min&quot;, &quot;Mean&quot;, &quot;Set&quot;, &quot;And&quot;, &quot;Or&quot;, and &quot;Distribution&quot;.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001953 # The specified aggregation kind is case-insensitive.
1954 #
1955 # If omitted, this is not an aggregated value but instead
1956 # a single metric sample value.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001957 &quot;gauge&quot;: &quot;&quot;, # A struct value describing properties of a Gauge.
1958 # Metrics of gauge type show the value of a metric across time, and is
1959 # aggregated based on the newest value.
Bu Sun Kim65020912020-05-20 12:08:20 -07001960 &quot;updateTime&quot;: &quot;A String&quot;, # Timestamp associated with the metric value. Optional when workers are
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07001961 # reporting work progress; it will be filled in responses from the
1962 # metrics API.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001963 &quot;scalar&quot;: &quot;&quot;, # Worker-computed aggregate value for aggregation kinds &quot;Sum&quot;, &quot;Max&quot;, &quot;Min&quot;,
1964 # &quot;And&quot;, and &quot;Or&quot;. The possible value types are Long, Double, and Boolean.
1965 &quot;cumulative&quot;: True or False, # True if this metric is reported as the total cumulative aggregate
1966 # value accumulated since the worker started working on this WorkItem.
1967 # By default this is false, indicating that this metric is reported
1968 # as a delta that is not associated with any WorkItem.
Bu Sun Kim65020912020-05-20 12:08:20 -07001969 &quot;name&quot;: { # Identifies a metric, by describing the source which generated the # Name of the metric.
1970 # metric.
1971 &quot;context&quot;: { # Zero or more labeled fields which identify the part of the job this
1972 # metric is associated with, such as the name of a step or collection.
1973 #
1974 # For example, built-in counters associated with steps will have
1975 # context[&#x27;step&#x27;] = &lt;step-name&gt;. Counters associated with PCollections
1976 # in the SDK will have context[&#x27;pcollection&#x27;] = &lt;pcollection-name&gt;.
1977 &quot;a_key&quot;: &quot;A String&quot;,
1978 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001979 &quot;name&quot;: &quot;A String&quot;, # Worker-defined metric name.
1980 &quot;origin&quot;: &quot;A String&quot;, # Origin (namespace) of metric name. May be blank for user-define metrics;
1981 # will be &quot;dataflow&quot; for metrics defined by the Dataflow service or SDK.
Bu Sun Kim65020912020-05-20 12:08:20 -07001982 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07001983 &quot;meanCount&quot;: &quot;&quot;, # Worker-computed aggregate value for the &quot;Mean&quot; aggregation kind.
1984 # This holds the count of the aggregated values and is used in combination
1985 # with mean_sum above to obtain the actual mean aggregate value.
1986 # The only possible value type is Long.
1987 &quot;meanSum&quot;: &quot;&quot;, # Worker-computed aggregate value for the &quot;Mean&quot; aggregation kind.
1988 # This holds the sum of the aggregated values and is used in combination
1989 # with mean_count below to obtain the actual mean aggregate value.
1990 # The only possible value types are Long and Double.
1991 &quot;set&quot;: &quot;&quot;, # Worker-computed aggregate value for the &quot;Set&quot; aggregation kind. The only
1992 # possible value type is a list of Values whose type can be Long, Double,
1993 # or String, according to the metric&#x27;s type. All Values in the list must
1994 # be of the same type.
1995 &quot;internal&quot;: &quot;&quot;, # Worker-computed aggregate value for internal use by the Dataflow
1996 # service.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08001997 },
1998 ],
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08001999 }</pre>
2000</div>
2001
2002<div class="method">
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002003 <code class="details" id="list">list(projectId, location, pageToken=None, view=None, pageSize=None, filter=None, x__xgafv=None)</code>
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04002004 <pre>List the jobs of a project.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08002005
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07002006To list the jobs of a project in a region, we recommend using
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002007`projects.locations.jobs.list` with a [regional endpoint]
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07002008(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints). To
2009list the all jobs across all regions, use `projects.jobs.aggregated`. Using
2010`projects.jobs.list` is not recommended, as you can only get the list of
2011jobs that are running in `us-central1`.
2012
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08002013Args:
2014 projectId: string, The project which owns the jobs. (required)
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07002015 location: string, The [regional endpoint]
2016(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
2017contains this job. (required)
Bu Sun Kim65020912020-05-20 12:08:20 -07002018 pageToken: string, Set this to the &#x27;next_page_token&#x27; field of a previous response
2019to request additional results in a long list.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002020 view: string, Level of information requested in response. Default is `JOB_VIEW_SUMMARY`.
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04002021 pageSize: integer, If there are many jobs, limit response to at most this many.
2022The actual number of jobs returned will be the lesser of max_responses
2023and an unspecified server-defined limit.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002024 filter: string, The kind of filter to use.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08002025 x__xgafv: string, V1 error format.
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04002026 Allowed values
2027 1 - v1 error format
2028 2 - v2 error format
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08002029
2030Returns:
2031 An object of the form:
2032
Dan O'Mearadd494642020-05-01 07:42:23 -07002033 { # Response to a request to list Cloud Dataflow jobs in a project. This might
2034 # be a partial response, depending on the page size in the ListJobsRequest.
2035 # However, if the project does not have any jobs, an instance of
Bu Sun Kim65020912020-05-20 12:08:20 -07002036 # ListJobsResponse is not returned and the requests&#x27;s response
Dan O'Mearadd494642020-05-01 07:42:23 -07002037 # body is empty {}.
Bu Sun Kim65020912020-05-20 12:08:20 -07002038 &quot;jobs&quot;: [ # A subset of the requested job information.
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04002039 { # Defines a job to be run by the Cloud Dataflow service.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002040 &quot;pipelineDescription&quot;: { # A descriptive representation of submitted pipeline as well as the executed # Preliminary field: The format of this data may change at any time.
2041 # A description of the user pipeline and stages through which it is executed.
2042 # Created by Cloud Dataflow service. Only retrieved with
2043 # JOB_VIEW_DESCRIPTION or JOB_VIEW_ALL.
2044 # form. This data is provided by the Dataflow service for ease of visualizing
2045 # the pipeline and interpreting Dataflow provided metrics.
2046 &quot;displayData&quot;: [ # Pipeline level display data.
2047 { # Data provided with a pipeline or transform to provide descriptive info.
2048 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
2049 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
2050 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
2051 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
2052 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
2053 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
2054 # This is intended to be used as a label for the display data
2055 # when viewed in a dax monitoring system.
2056 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
2057 # language namespace (i.e. python module) which defines the display data.
2058 # This allows a dax monitoring system to specially handle the data
2059 # and perform custom rendering.
2060 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
2061 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
2062 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
2063 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
2064 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
2065 # For example a java_class_name_value of com.mypackage.MyDoFn
2066 # will be stored with MyDoFn as the short_str_value and
2067 # com.mypackage.MyDoFn as the java_class_name value.
2068 # short_str_value can be displayed and java_class_name_value
2069 # will be displayed as a tooltip.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002070 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002071 ],
2072 &quot;originalPipelineTransform&quot;: [ # Description of each transform in the pipeline and collections between them.
2073 { # Description of the type, names/ids, and input/outputs for a transform.
2074 &quot;outputCollectionName&quot;: [ # User names for all collection outputs to this transform.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002075 &quot;A String&quot;,
2076 ],
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002077 &quot;displayData&quot;: [ # Transform-specific display data.
2078 { # Data provided with a pipeline or transform to provide descriptive info.
2079 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
2080 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
2081 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
2082 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
2083 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
2084 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
2085 # This is intended to be used as a label for the display data
2086 # when viewed in a dax monitoring system.
2087 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
2088 # language namespace (i.e. python module) which defines the display data.
2089 # This allows a dax monitoring system to specially handle the data
2090 # and perform custom rendering.
2091 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
2092 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
2093 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
2094 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
2095 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
2096 # For example a java_class_name_value of com.mypackage.MyDoFn
2097 # will be stored with MyDoFn as the short_str_value and
2098 # com.mypackage.MyDoFn as the java_class_name value.
2099 # short_str_value can be displayed and java_class_name_value
2100 # will be displayed as a tooltip.
2101 },
2102 ],
2103 &quot;id&quot;: &quot;A String&quot;, # SDK generated id of this transform instance.
2104 &quot;inputCollectionName&quot;: [ # User names for all collection inputs to this transform.
2105 &quot;A String&quot;,
2106 ],
2107 &quot;name&quot;: &quot;A String&quot;, # User provided name for this transform instance.
2108 &quot;kind&quot;: &quot;A String&quot;, # Type of transform.
2109 },
2110 ],
2111 &quot;executionPipelineStage&quot;: [ # Description of each stage of execution of the pipeline.
2112 { # Description of the composing transforms, names/ids, and input/outputs of a
2113 # stage of execution. Some composing transforms and sources may have been
2114 # generated by the Dataflow service during execution planning.
2115 &quot;componentSource&quot;: [ # Collections produced and consumed by component transforms of this stage.
2116 { # Description of an interstitial value between transforms in an execution
2117 # stage.
2118 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
2119 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
2120 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
2121 # source is most closely associated.
2122 },
2123 ],
2124 &quot;inputSource&quot;: [ # Input sources for this stage.
2125 { # Description of an input or output of an execution stage.
2126 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
2127 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
2128 # source is most closely associated.
2129 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
2130 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
2131 },
2132 ],
2133 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this stage.
2134 &quot;componentTransform&quot;: [ # Transforms that comprise this execution stage.
2135 { # Description of a transform executed as part of an execution stage.
2136 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
2137 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
2138 &quot;originalTransform&quot;: &quot;A String&quot;, # User name for the original user transform with which this transform is
2139 # most closely associated.
2140 },
2141 ],
2142 &quot;id&quot;: &quot;A String&quot;, # Dataflow service generated id for this stage.
2143 &quot;outputSource&quot;: [ # Output sources for this stage.
2144 { # Description of an input or output of an execution stage.
2145 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
2146 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
2147 # source is most closely associated.
2148 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
2149 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
2150 },
2151 ],
2152 &quot;kind&quot;: &quot;A String&quot;, # Type of tranform this stage is executing.
2153 },
2154 ],
2155 },
2156 &quot;labels&quot;: { # User-defined labels for this job.
2157 #
2158 # The labels map can contain no more than 64 entries. Entries of the labels
2159 # map are UTF8 strings that comply with the following restrictions:
2160 #
2161 # * Keys must conform to regexp: \p{Ll}\p{Lo}{0,62}
2162 # * Values must conform to regexp: [\p{Ll}\p{Lo}\p{N}_-]{0,63}
2163 # * Both keys and values are additionally constrained to be &lt;= 128 bytes in
2164 # size.
2165 &quot;a_key&quot;: &quot;A String&quot;,
2166 },
2167 &quot;projectId&quot;: &quot;A String&quot;, # The ID of the Cloud Platform project that the job belongs to.
2168 &quot;environment&quot;: { # Describes the environment in which a Dataflow Job runs. # The environment for the job.
2169 &quot;flexResourceSchedulingGoal&quot;: &quot;A String&quot;, # Which Flexible Resource Scheduling mode to run in.
2170 &quot;workerRegion&quot;: &quot;A String&quot;, # The Compute Engine region
2171 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
2172 # which worker processing should occur, e.g. &quot;us-west1&quot;. Mutually exclusive
2173 # with worker_zone. If neither worker_region nor worker_zone is specified,
2174 # default to the control plane&#x27;s region.
2175 &quot;userAgent&quot;: { # A description of the process that generated the request.
2176 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
2177 },
2178 &quot;serviceAccountEmail&quot;: &quot;A String&quot;, # Identity to run virtual machines as. Defaults to the default account.
2179 &quot;version&quot;: { # A structure describing which components and their versions of the service
2180 # are required in order to run the job.
2181 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
2182 },
2183 &quot;serviceKmsKeyName&quot;: &quot;A String&quot;, # If set, contains the Cloud KMS key identifier used to encrypt data
2184 # at rest, AKA a Customer Managed Encryption Key (CMEK).
2185 #
2186 # Format:
2187 # projects/PROJECT_ID/locations/LOCATION/keyRings/KEY_RING/cryptoKeys/KEY
2188 &quot;experiments&quot;: [ # The list of experiments to enable.
2189 &quot;A String&quot;,
2190 ],
2191 &quot;workerZone&quot;: &quot;A String&quot;, # The Compute Engine zone
2192 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
2193 # which worker processing should occur, e.g. &quot;us-west1-a&quot;. Mutually exclusive
2194 # with worker_region. If neither worker_region nor worker_zone is specified,
2195 # a zone in the control plane&#x27;s region is chosen based on available capacity.
2196 &quot;workerPools&quot;: [ # The worker pools. At least one &quot;harness&quot; worker pool must be
2197 # specified in order for the job to have workers.
2198 { # Describes one particular pool of Cloud Dataflow workers to be
2199 # instantiated by the Cloud Dataflow service in order to perform the
2200 # computations required by a job. Note that a workflow job may use
2201 # multiple pools, in order to match the various computational
2202 # requirements of the various stages of the job.
2203 &quot;onHostMaintenance&quot;: &quot;A String&quot;, # The action to take on host maintenance, as defined by the Google
2204 # Compute Engine API.
2205 &quot;sdkHarnessContainerImages&quot;: [ # Set of SDK harness containers needed to execute this pipeline. This will
2206 # only be set in the Fn API path. For non-cross-language pipelines this
2207 # should have only one entry. Cross-language pipelines will have two or more
2208 # entries.
2209 { # Defines a SDK harness container for executing Dataflow pipelines.
2210 &quot;containerImage&quot;: &quot;A String&quot;, # A docker container image that resides in Google Container Registry.
2211 &quot;useSingleCorePerContainer&quot;: True or False, # If true, recommends the Dataflow service to use only one core per SDK
2212 # container instance with this image. If false (or unset) recommends using
2213 # more than one core per SDK container instance with this image for
2214 # efficiency. Note that Dataflow service may choose to override this property
2215 # if needed.
2216 },
2217 ],
2218 &quot;zone&quot;: &quot;A String&quot;, # Zone to run the worker pools in. If empty or unspecified, the service
2219 # will attempt to choose a reasonable default.
2220 &quot;kind&quot;: &quot;A String&quot;, # The kind of the worker pool; currently only `harness` and `shuffle`
2221 # are supported.
2222 &quot;metadata&quot;: { # Metadata to set on the Google Compute Engine VMs.
2223 &quot;a_key&quot;: &quot;A String&quot;,
2224 },
2225 &quot;diskSourceImage&quot;: &quot;A String&quot;, # Fully qualified source image for disks.
2226 &quot;dataDisks&quot;: [ # Data disks that are used by a VM in this workflow.
2227 { # Describes the data disk used by a workflow job.
2228 &quot;sizeGb&quot;: 42, # Size of disk in GB. If zero or unspecified, the service will
2229 # attempt to choose a reasonable default.
2230 &quot;diskType&quot;: &quot;A String&quot;, # Disk storage type, as defined by Google Compute Engine. This
2231 # must be a disk type appropriate to the project and zone in which
2232 # the workers will run. If unknown or unspecified, the service
2233 # will attempt to choose a reasonable default.
2234 #
2235 # For example, the standard persistent disk type is a resource name
2236 # typically ending in &quot;pd-standard&quot;. If SSD persistent disks are
2237 # available, the resource name typically ends with &quot;pd-ssd&quot;. The
2238 # actual valid values are defined the Google Compute Engine API,
2239 # not by the Cloud Dataflow API; consult the Google Compute Engine
2240 # documentation for more information about determining the set of
2241 # available disk types for a particular project and zone.
2242 #
2243 # Google Compute Engine Disk types are local to a particular
2244 # project in a particular zone, and so the resource name will
2245 # typically look something like this:
2246 #
2247 # compute.googleapis.com/projects/project-id/zones/zone/diskTypes/pd-standard
2248 &quot;mountPoint&quot;: &quot;A String&quot;, # Directory in a VM where disk is mounted.
2249 },
2250 ],
2251 &quot;packages&quot;: [ # Packages to be installed on workers.
2252 { # The packages that must be installed in order for a worker to run the
2253 # steps of the Cloud Dataflow job that will be assigned to its worker
2254 # pool.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002255 #
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002256 # This is the mechanism by which the Cloud Dataflow SDK causes code to
2257 # be loaded onto the workers. For example, the Cloud Dataflow Java SDK
2258 # might use this to install jars containing the user&#x27;s code and all of the
2259 # various dependencies (libraries, data files, etc.) required in order
2260 # for that code to run.
2261 &quot;name&quot;: &quot;A String&quot;, # The name of the package.
2262 &quot;location&quot;: &quot;A String&quot;, # The resource to read the package from. The supported resource type is:
2263 #
2264 # Google Cloud Storage:
2265 #
2266 # storage.googleapis.com/{bucket}
2267 # bucket.storage.googleapis.com/
2268 },
2269 ],
2270 &quot;teardownPolicy&quot;: &quot;A String&quot;, # Sets the policy for determining when to turndown worker pool.
2271 # Allowed values are: `TEARDOWN_ALWAYS`, `TEARDOWN_ON_SUCCESS`, and
2272 # `TEARDOWN_NEVER`.
2273 # `TEARDOWN_ALWAYS` means workers are always torn down regardless of whether
2274 # the job succeeds. `TEARDOWN_ON_SUCCESS` means workers are torn down
2275 # if the job succeeds. `TEARDOWN_NEVER` means the workers are never torn
2276 # down.
2277 #
2278 # If the workers are not torn down by the service, they will
2279 # continue to run and use Google Compute Engine VM resources in the
2280 # user&#x27;s project until they are explicitly terminated by the user.
2281 # Because of this, Google recommends using the `TEARDOWN_ALWAYS`
2282 # policy except for small, manually supervised test jobs.
2283 #
2284 # If unknown or unspecified, the service will attempt to choose a reasonable
2285 # default.
2286 &quot;network&quot;: &quot;A String&quot;, # Network to which VMs will be assigned. If empty or unspecified,
2287 # the service will use the network &quot;default&quot;.
2288 &quot;ipConfiguration&quot;: &quot;A String&quot;, # Configuration for VM IPs.
2289 &quot;diskSizeGb&quot;: 42, # Size of root disk for VMs, in GB. If zero or unspecified, the service will
2290 # attempt to choose a reasonable default.
2291 &quot;autoscalingSettings&quot;: { # Settings for WorkerPool autoscaling. # Settings for autoscaling of this WorkerPool.
2292 &quot;maxNumWorkers&quot;: 42, # The maximum number of workers to cap scaling at.
2293 &quot;algorithm&quot;: &quot;A String&quot;, # The algorithm to use for autoscaling.
2294 },
2295 &quot;poolArgs&quot;: { # Extra arguments for this worker pool.
2296 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
2297 },
2298 &quot;subnetwork&quot;: &quot;A String&quot;, # Subnetwork to which VMs will be assigned, if desired. Expected to be of
2299 # the form &quot;regions/REGION/subnetworks/SUBNETWORK&quot;.
2300 &quot;numWorkers&quot;: 42, # Number of Google Compute Engine workers in this pool needed to
2301 # execute the job. If zero or unspecified, the service will
2302 # attempt to choose a reasonable default.
2303 &quot;numThreadsPerWorker&quot;: 42, # The number of threads per worker harness. If empty or unspecified, the
2304 # service will choose a number of threads (according to the number of cores
2305 # on the selected machine type for batch, or 1 by convention for streaming).
2306 &quot;workerHarnessContainerImage&quot;: &quot;A String&quot;, # Required. Docker container image that executes the Cloud Dataflow worker
2307 # harness, residing in Google Container Registry.
2308 #
2309 # Deprecated for the Fn API path. Use sdk_harness_container_images instead.
2310 &quot;taskrunnerSettings&quot;: { # Taskrunner configuration settings. # Settings passed through to Google Compute Engine workers when
2311 # using the standard Dataflow task runner. Users should ignore
2312 # this field.
2313 &quot;dataflowApiVersion&quot;: &quot;A String&quot;, # The API version of endpoint, e.g. &quot;v1b3&quot;
2314 &quot;oauthScopes&quot;: [ # The OAuth2 scopes to be requested by the taskrunner in order to
2315 # access the Cloud Dataflow API.
2316 &quot;A String&quot;,
2317 ],
2318 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for the taskrunner to use when accessing Google Cloud APIs.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002319 #
2320 # When workers access Google Cloud APIs, they logically do so via
2321 # relative URLs. If this field is specified, it supplies the base
2322 # URL to use for resolving these relative URLs. The normative
2323 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
2324 # Locators&quot;.
2325 #
2326 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002327 &quot;workflowFileName&quot;: &quot;A String&quot;, # The file to store the workflow in.
2328 &quot;logToSerialconsole&quot;: True or False, # Whether to send taskrunner log info to Google Compute Engine VM serial
2329 # console.
2330 &quot;baseTaskDir&quot;: &quot;A String&quot;, # The location on the worker for task-specific subdirectories.
2331 &quot;taskUser&quot;: &quot;A String&quot;, # The UNIX user ID on the worker VM to use for tasks launched by
2332 # taskrunner; e.g. &quot;root&quot;.
2333 &quot;vmId&quot;: &quot;A String&quot;, # The ID string of the VM.
2334 &quot;alsologtostderr&quot;: True or False, # Whether to also send taskrunner log info to stderr.
2335 &quot;parallelWorkerSettings&quot;: { # Provides data to pass through to the worker harness. # The settings to pass to the parallel worker harness.
2336 &quot;shuffleServicePath&quot;: &quot;A String&quot;, # The Shuffle service path relative to the root URL, for example,
2337 # &quot;shuffle/v1beta1&quot;.
2338 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
2339 # storage.
2340 #
2341 # The supported resource type is:
2342 #
2343 # Google Cloud Storage:
2344 #
2345 # storage.googleapis.com/{bucket}/{object}
2346 # bucket.storage.googleapis.com/{object}
2347 &quot;reportingEnabled&quot;: True or False, # Whether to send work progress updates to the service.
2348 &quot;servicePath&quot;: &quot;A String&quot;, # The Cloud Dataflow service path relative to the root URL, for example,
2349 # &quot;dataflow/v1b3/projects&quot;.
2350 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for accessing Google Cloud APIs.
2351 #
2352 # When workers access Google Cloud APIs, they logically do so via
2353 # relative URLs. If this field is specified, it supplies the base
2354 # URL to use for resolving these relative URLs. The normative
2355 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
2356 # Locators&quot;.
2357 #
2358 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
2359 &quot;workerId&quot;: &quot;A String&quot;, # The ID of the worker running this pipeline.
2360 },
2361 &quot;harnessCommand&quot;: &quot;A String&quot;, # The command to launch the worker harness.
2362 &quot;logDir&quot;: &quot;A String&quot;, # The directory on the VM to store logs.
2363 &quot;streamingWorkerMainClass&quot;: &quot;A String&quot;, # The streaming worker main class name.
2364 &quot;languageHint&quot;: &quot;A String&quot;, # The suggested backend language.
2365 &quot;taskGroup&quot;: &quot;A String&quot;, # The UNIX group ID on the worker VM to use for tasks launched by
2366 # taskrunner; e.g. &quot;wheel&quot;.
2367 &quot;logUploadLocation&quot;: &quot;A String&quot;, # Indicates where to put logs. If this is not specified, the logs
2368 # will not be uploaded.
2369 #
2370 # The supported resource type is:
2371 #
2372 # Google Cloud Storage:
2373 # storage.googleapis.com/{bucket}/{object}
2374 # bucket.storage.googleapis.com/{object}
2375 &quot;commandlinesFileName&quot;: &quot;A String&quot;, # The file to store preprocessing commands in.
2376 &quot;continueOnException&quot;: True or False, # Whether to continue taskrunner if an exception is hit.
2377 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the taskrunner should use for
2378 # temporary storage.
2379 #
2380 # The supported resource type is:
2381 #
2382 # Google Cloud Storage:
2383 # storage.googleapis.com/{bucket}/{object}
2384 # bucket.storage.googleapis.com/{object}
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002385 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002386 &quot;diskType&quot;: &quot;A String&quot;, # Type of root disk for VMs. If empty or unspecified, the service will
2387 # attempt to choose a reasonable default.
2388 &quot;defaultPackageSet&quot;: &quot;A String&quot;, # The default package set to install. This allows the service to
2389 # select a default set of packages which are useful to worker
2390 # harnesses written in a particular language.
2391 &quot;machineType&quot;: &quot;A String&quot;, # Machine type (e.g. &quot;n1-standard-1&quot;). If empty or unspecified, the
2392 # service will attempt to choose a reasonable default.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002393 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002394 ],
2395 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
2396 # storage. The system will append the suffix &quot;/temp-{JOBNAME} to
2397 # this resource prefix, where {JOBNAME} is the value of the
2398 # job_name field. The resulting bucket and object prefix is used
2399 # as the prefix of the resources used to store temporary data
2400 # needed during the job execution. NOTE: This will override the
2401 # value in taskrunner_settings.
2402 # The supported resource type is:
2403 #
2404 # Google Cloud Storage:
2405 #
2406 # storage.googleapis.com/{bucket}/{object}
2407 # bucket.storage.googleapis.com/{object}
2408 &quot;internalExperiments&quot;: { # Experimental settings.
2409 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002410 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002411 &quot;sdkPipelineOptions&quot;: { # The Cloud Dataflow SDK pipeline options specified by the user. These
2412 # options are passed through the service and are used to recreate the
2413 # SDK pipeline options on the worker in a language agnostic and platform
2414 # independent way.
Bu Sun Kim65020912020-05-20 12:08:20 -07002415 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07002416 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002417 &quot;dataset&quot;: &quot;A String&quot;, # The dataset for the current project where various workflow
2418 # related tables are stored.
2419 #
2420 # The supported resource type is:
2421 #
2422 # Google BigQuery:
2423 # bigquery.googleapis.com/{dataset}
2424 &quot;clusterManagerApiService&quot;: &quot;A String&quot;, # The type of cluster manager API to use. If unknown or
2425 # unspecified, the service will attempt to choose a reasonable
2426 # default. This should be in the form of the API service name,
2427 # e.g. &quot;compute.googleapis.com&quot;.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07002428 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002429 &quot;stepsLocation&quot;: &quot;A String&quot;, # The GCS location where the steps are stored.
2430 &quot;steps&quot;: [ # Exactly one of step or steps_location should be specified.
2431 #
2432 # The top-level steps that constitute the entire job.
2433 { # Defines a particular step within a Cloud Dataflow job.
2434 #
2435 # A job consists of multiple steps, each of which performs some
2436 # specific operation as part of the overall job. Data is typically
2437 # passed from one step to another as part of the job.
2438 #
2439 # Here&#x27;s an example of a sequence of steps which together implement a
2440 # Map-Reduce job:
2441 #
2442 # * Read a collection of data from some source, parsing the
2443 # collection&#x27;s elements.
2444 #
2445 # * Validate the elements.
2446 #
2447 # * Apply a user-defined function to map each element to some value
2448 # and extract an element-specific key value.
2449 #
2450 # * Group elements with the same key into a single element with
2451 # that key, transforming a multiply-keyed collection into a
2452 # uniquely-keyed collection.
2453 #
2454 # * Write the elements out to some data sink.
2455 #
2456 # Note that the Cloud Dataflow service may be used to run many different
2457 # types of jobs, not just Map-Reduce.
2458 &quot;kind&quot;: &quot;A String&quot;, # The kind of step in the Cloud Dataflow job.
2459 &quot;properties&quot;: { # Named properties associated with the step. Each kind of
2460 # predefined step has its own required set of properties.
2461 # Must be provided on Create. Only retrieved with JOB_VIEW_ALL.
2462 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
2463 },
2464 &quot;name&quot;: &quot;A String&quot;, # The name that identifies the step. This must be unique for each
2465 # step with respect to all other steps in the Cloud Dataflow job.
2466 },
2467 ],
2468 &quot;stageStates&quot;: [ # This field may be mutated by the Cloud Dataflow service;
2469 # callers cannot mutate it.
2470 { # A message describing the state of a particular execution stage.
2471 &quot;executionStageState&quot;: &quot;A String&quot;, # Executions stage states allow the same set of values as JobState.
2472 &quot;executionStageName&quot;: &quot;A String&quot;, # The name of the execution stage.
2473 &quot;currentStateTime&quot;: &quot;A String&quot;, # The time at which the stage transitioned to this state.
2474 },
2475 ],
2476 &quot;replacedByJobId&quot;: &quot;A String&quot;, # If another job is an update of this job (and thus, this job is in
2477 # `JOB_STATE_UPDATED`), this field contains the ID of that job.
2478 &quot;jobMetadata&quot;: { # Metadata available primarily for filtering jobs. Will be included in the # This field is populated by the Dataflow service to support filtering jobs
2479 # by the metadata values provided here. Populated for ListJobs and all GetJob
2480 # views SUMMARY and higher.
2481 # ListJob response and Job SUMMARY view.
2482 &quot;sdkVersion&quot;: { # The version of the SDK used to run the job. # The SDK version used to run the job.
2483 &quot;sdkSupportStatus&quot;: &quot;A String&quot;, # The support status for this SDK version.
2484 &quot;versionDisplayName&quot;: &quot;A String&quot;, # A readable string describing the version of the SDK.
2485 &quot;version&quot;: &quot;A String&quot;, # The version of the SDK used to run the job.
2486 },
2487 &quot;bigTableDetails&quot;: [ # Identification of a BigTable source used in the Dataflow job.
2488 { # Metadata for a BigTable connector used by the job.
2489 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
2490 &quot;tableId&quot;: &quot;A String&quot;, # TableId accessed in the connection.
2491 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
2492 },
2493 ],
2494 &quot;pubsubDetails&quot;: [ # Identification of a PubSub source used in the Dataflow job.
2495 { # Metadata for a PubSub connector used by the job.
2496 &quot;subscription&quot;: &quot;A String&quot;, # Subscription used in the connection.
2497 &quot;topic&quot;: &quot;A String&quot;, # Topic accessed in the connection.
2498 },
2499 ],
2500 &quot;bigqueryDetails&quot;: [ # Identification of a BigQuery source used in the Dataflow job.
2501 { # Metadata for a BigQuery connector used by the job.
2502 &quot;dataset&quot;: &quot;A String&quot;, # Dataset accessed in the connection.
2503 &quot;projectId&quot;: &quot;A String&quot;, # Project accessed in the connection.
2504 &quot;query&quot;: &quot;A String&quot;, # Query used to access data in the connection.
2505 &quot;table&quot;: &quot;A String&quot;, # Table accessed in the connection.
2506 },
2507 ],
2508 &quot;fileDetails&quot;: [ # Identification of a File source used in the Dataflow job.
2509 { # Metadata for a File connector used by the job.
2510 &quot;filePattern&quot;: &quot;A String&quot;, # File Pattern used to access files by the connector.
2511 },
2512 ],
2513 &quot;datastoreDetails&quot;: [ # Identification of a Datastore source used in the Dataflow job.
2514 { # Metadata for a Datastore connector used by the job.
2515 &quot;namespace&quot;: &quot;A String&quot;, # Namespace used in the connection.
2516 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
2517 },
2518 ],
2519 &quot;spannerDetails&quot;: [ # Identification of a Spanner source used in the Dataflow job.
2520 { # Metadata for a Spanner connector used by the job.
2521 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
2522 &quot;databaseId&quot;: &quot;A String&quot;, # DatabaseId accessed in the connection.
2523 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
2524 },
2525 ],
2526 },
2527 &quot;location&quot;: &quot;A String&quot;, # The [regional endpoint]
2528 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
2529 # contains this job.
2530 &quot;transformNameMapping&quot;: { # The map of transform name prefixes of the job to be replaced to the
2531 # corresponding name prefixes of the new job.
2532 &quot;a_key&quot;: &quot;A String&quot;,
2533 },
2534 &quot;startTime&quot;: &quot;A String&quot;, # The timestamp when the job was started (transitioned to JOB_STATE_PENDING).
2535 # Flexible resource scheduling jobs are started with some delay after job
2536 # creation, so start_time is unset before start and is updated when the
2537 # job is started by the Cloud Dataflow service. For other jobs, start_time
2538 # always equals to create_time and is immutable and set by the Cloud Dataflow
2539 # service.
2540 &quot;clientRequestId&quot;: &quot;A String&quot;, # The client&#x27;s unique identifier of the job, re-used across retried attempts.
2541 # If this field is set, the service will ensure its uniqueness.
2542 # The request to create a job will fail if the service has knowledge of a
2543 # previously submitted job with the same client&#x27;s ID and job name.
2544 # The caller may use this field to ensure idempotence of job
2545 # creation across retried attempts to create a job.
2546 # By default, the field is empty and, in that case, the service ignores it.
2547 &quot;executionInfo&quot;: { # Additional information about how a Cloud Dataflow job will be executed that # Deprecated.
2548 # isn&#x27;t contained in the submitted job.
2549 &quot;stages&quot;: { # A mapping from each stage to the information about that stage.
2550 &quot;a_key&quot;: { # Contains information about how a particular
2551 # google.dataflow.v1beta3.Step will be executed.
2552 &quot;stepName&quot;: [ # The steps associated with the execution stage.
2553 # Note that stages may have several steps, and that a given step
2554 # might be run by more than one stage.
2555 &quot;A String&quot;,
2556 ],
2557 },
Bu Sun Kim65020912020-05-20 12:08:20 -07002558 },
2559 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002560 &quot;type&quot;: &quot;A String&quot;, # The type of Cloud Dataflow job.
2561 &quot;createTime&quot;: &quot;A String&quot;, # The timestamp when the job was initially created. Immutable and set by the
2562 # Cloud Dataflow service.
2563 &quot;tempFiles&quot;: [ # A set of files the system should be aware of that are used
2564 # for temporary storage. These temporary files will be
2565 # removed on job completion.
2566 # No duplicates are allowed.
2567 # No file patterns are supported.
2568 #
2569 # The supported files are:
2570 #
2571 # Google Cloud Storage:
2572 #
2573 # storage.googleapis.com/{bucket}/{object}
2574 # bucket.storage.googleapis.com/{object}
2575 &quot;A String&quot;,
2576 ],
2577 &quot;id&quot;: &quot;A String&quot;, # The unique ID of this job.
2578 #
2579 # This field is set by the Cloud Dataflow service when the Job is
2580 # created, and is immutable for the life of the job.
2581 &quot;requestedState&quot;: &quot;A String&quot;, # The job&#x27;s requested state.
2582 #
2583 # `UpdateJob` may be used to switch between the `JOB_STATE_STOPPED` and
2584 # `JOB_STATE_RUNNING` states, by setting requested_state. `UpdateJob` may
2585 # also be used to directly set a job&#x27;s requested state to
2586 # `JOB_STATE_CANCELLED` or `JOB_STATE_DONE`, irrevocably terminating the
2587 # job if it has not already reached a terminal state.
2588 &quot;replaceJobId&quot;: &quot;A String&quot;, # If this job is an update of an existing job, this field is the job ID
2589 # of the job it replaced.
2590 #
2591 # When sending a `CreateJobRequest`, you can update a job by specifying it
2592 # here. The job named here is stopped, and its intermediate state is
2593 # transferred to this job.
2594 &quot;createdFromSnapshotId&quot;: &quot;A String&quot;, # If this is specified, the job&#x27;s initial state is populated from the given
2595 # snapshot.
2596 &quot;currentState&quot;: &quot;A String&quot;, # The current state of the job.
2597 #
2598 # Jobs are created in the `JOB_STATE_STOPPED` state unless otherwise
2599 # specified.
2600 #
2601 # A job in the `JOB_STATE_RUNNING` state may asynchronously enter a
2602 # terminal state. After a job has reached a terminal state, no
2603 # further state updates may be made.
2604 #
2605 # This field may be mutated by the Cloud Dataflow service;
2606 # callers cannot mutate it.
2607 &quot;name&quot;: &quot;A String&quot;, # The user-specified Cloud Dataflow job name.
2608 #
2609 # Only one Job with a given name may exist in a project at any
2610 # given time. If a caller attempts to create a Job with the same
2611 # name as an already-existing Job, the attempt returns the
2612 # existing Job.
2613 #
2614 # The name must match the regular expression
2615 # `[a-z]([-a-z0-9]{0,38}[a-z0-9])?`
2616 &quot;currentStateTime&quot;: &quot;A String&quot;, # The timestamp associated with the current state.
Bu Sun Kim65020912020-05-20 12:08:20 -07002617 },
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08002618 ],
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002619 &quot;nextPageToken&quot;: &quot;A String&quot;, # Set if there may be more results than fit in this response.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002620 &quot;failedLocation&quot;: [ # Zero or more messages describing the [regional endpoints]
2621 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
2622 # failed to respond.
2623 { # Indicates which [regional endpoint]
2624 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) failed
2625 # to respond to a request for data.
2626 &quot;name&quot;: &quot;A String&quot;, # The name of the [regional endpoint]
2627 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
2628 # failed to respond.
2629 },
2630 ],
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08002631 }</pre>
2632</div>
2633
2634<div class="method">
2635 <code class="details" id="list_next">list_next(previous_request, previous_response)</code>
2636 <pre>Retrieves the next page of results.
2637
2638Args:
2639 previous_request: The request for the previous page. (required)
2640 previous_response: The response from the request for the previous page. (required)
2641
2642Returns:
Bu Sun Kim65020912020-05-20 12:08:20 -07002643 A request object that you can call &#x27;execute()&#x27; on to request the next
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08002644 page. Returns None if there are no more items in the collection.
2645 </pre>
2646</div>
2647
2648<div class="method">
Dan O'Mearadd494642020-05-01 07:42:23 -07002649 <code class="details" id="snapshot">snapshot(projectId, location, jobId, body=None, x__xgafv=None)</code>
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07002650 <pre>Snapshot the state of a streaming job.
2651
2652Args:
2653 projectId: string, The project which owns the job to be snapshotted. (required)
2654 location: string, The location that contains this job. (required)
2655 jobId: string, The job to be snapshotted. (required)
Dan O'Mearadd494642020-05-01 07:42:23 -07002656 body: object, The request body.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07002657 The object takes the form of:
2658
2659{ # Request to create a snapshot of a job.
Bu Sun Kim65020912020-05-20 12:08:20 -07002660 &quot;snapshotSources&quot;: True or False, # If true, perform snapshots for sources which support this.
Bu Sun Kim65020912020-05-20 12:08:20 -07002661 &quot;location&quot;: &quot;A String&quot;, # The location that contains this job.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002662 &quot;description&quot;: &quot;A String&quot;, # User specified description of the snapshot. Maybe empty.
2663 &quot;ttl&quot;: &quot;A String&quot;, # TTL for the snapshot.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07002664 }
2665
2666 x__xgafv: string, V1 error format.
2667 Allowed values
2668 1 - v1 error format
2669 2 - v2 error format
2670
2671Returns:
2672 An object of the form:
2673
2674 { # Represents a snapshot of a job.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002675 &quot;ttl&quot;: &quot;A String&quot;, # The time after which this snapshot will be automatically deleted.
2676 &quot;state&quot;: &quot;A String&quot;, # State of the snapshot.
2677 &quot;id&quot;: &quot;A String&quot;, # The unique ID of this snapshot.
2678 &quot;sourceJobId&quot;: &quot;A String&quot;, # The job this snapshot was created from.
2679 &quot;creationTime&quot;: &quot;A String&quot;, # The time this snapshot was created.
2680 &quot;description&quot;: &quot;A String&quot;, # User specified description of the snapshot. Maybe empty.
Bu Sun Kim65020912020-05-20 12:08:20 -07002681 &quot;pubsubMetadata&quot;: [ # PubSub snapshot metadata.
Dan O'Mearadd494642020-05-01 07:42:23 -07002682 { # Represents a Pubsub snapshot.
Bu Sun Kim65020912020-05-20 12:08:20 -07002683 &quot;snapshotName&quot;: &quot;A String&quot;, # The name of the Pubsub snapshot.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002684 &quot;expireTime&quot;: &quot;A String&quot;, # The expire time of the Pubsub snapshot.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002685 &quot;topicName&quot;: &quot;A String&quot;, # The name of the Pubsub topic.
Dan O'Mearadd494642020-05-01 07:42:23 -07002686 },
2687 ],
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002688 &quot;projectId&quot;: &quot;A String&quot;, # The project this snapshot belongs to.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002689 &quot;diskSizeBytes&quot;: &quot;A String&quot;, # The disk byte size of the snapshot. Only available for snapshots in READY
2690 # state.
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07002691 }</pre>
2692</div>
2693
2694<div class="method">
Dan O'Mearadd494642020-05-01 07:42:23 -07002695 <code class="details" id="update">update(projectId, location, jobId, body=None, x__xgafv=None)</code>
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04002696 <pre>Updates the state of an existing Cloud Dataflow job.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08002697
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07002698To update the state of an existing job, we recommend using
2699`projects.locations.jobs.update` with a [regional endpoint]
2700(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints). Using
2701`projects.jobs.update` is not recommended, as you can only update the state
2702of jobs that are running in `us-central1`.
2703
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08002704Args:
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04002705 projectId: string, The ID of the Cloud Platform project that the job belongs to. (required)
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07002706 location: string, The [regional endpoint]
2707(https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
2708contains this job. (required)
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04002709 jobId: string, The job ID. (required)
Dan O'Mearadd494642020-05-01 07:42:23 -07002710 body: object, The request body.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08002711 The object takes the form of:
2712
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04002713{ # Defines a job to be run by the Cloud Dataflow service.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002714 &quot;pipelineDescription&quot;: { # A descriptive representation of submitted pipeline as well as the executed # Preliminary field: The format of this data may change at any time.
2715 # A description of the user pipeline and stages through which it is executed.
2716 # Created by Cloud Dataflow service. Only retrieved with
2717 # JOB_VIEW_DESCRIPTION or JOB_VIEW_ALL.
2718 # form. This data is provided by the Dataflow service for ease of visualizing
2719 # the pipeline and interpreting Dataflow provided metrics.
2720 &quot;displayData&quot;: [ # Pipeline level display data.
2721 { # Data provided with a pipeline or transform to provide descriptive info.
2722 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
2723 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
2724 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
2725 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
2726 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
2727 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
2728 # This is intended to be used as a label for the display data
2729 # when viewed in a dax monitoring system.
2730 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
2731 # language namespace (i.e. python module) which defines the display data.
2732 # This allows a dax monitoring system to specially handle the data
2733 # and perform custom rendering.
2734 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
2735 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
2736 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
2737 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
2738 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
2739 # For example a java_class_name_value of com.mypackage.MyDoFn
2740 # will be stored with MyDoFn as the short_str_value and
2741 # com.mypackage.MyDoFn as the java_class_name value.
2742 # short_str_value can be displayed and java_class_name_value
2743 # will be displayed as a tooltip.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002744 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002745 ],
2746 &quot;originalPipelineTransform&quot;: [ # Description of each transform in the pipeline and collections between them.
2747 { # Description of the type, names/ids, and input/outputs for a transform.
2748 &quot;outputCollectionName&quot;: [ # User names for all collection outputs to this transform.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002749 &quot;A String&quot;,
2750 ],
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002751 &quot;displayData&quot;: [ # Transform-specific display data.
2752 { # Data provided with a pipeline or transform to provide descriptive info.
2753 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
2754 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
2755 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
2756 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
2757 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
2758 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
2759 # This is intended to be used as a label for the display data
2760 # when viewed in a dax monitoring system.
2761 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
2762 # language namespace (i.e. python module) which defines the display data.
2763 # This allows a dax monitoring system to specially handle the data
2764 # and perform custom rendering.
2765 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
2766 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
2767 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
2768 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
2769 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
2770 # For example a java_class_name_value of com.mypackage.MyDoFn
2771 # will be stored with MyDoFn as the short_str_value and
2772 # com.mypackage.MyDoFn as the java_class_name value.
2773 # short_str_value can be displayed and java_class_name_value
2774 # will be displayed as a tooltip.
2775 },
2776 ],
2777 &quot;id&quot;: &quot;A String&quot;, # SDK generated id of this transform instance.
2778 &quot;inputCollectionName&quot;: [ # User names for all collection inputs to this transform.
2779 &quot;A String&quot;,
2780 ],
2781 &quot;name&quot;: &quot;A String&quot;, # User provided name for this transform instance.
2782 &quot;kind&quot;: &quot;A String&quot;, # Type of transform.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002783 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002784 ],
2785 &quot;executionPipelineStage&quot;: [ # Description of each stage of execution of the pipeline.
2786 { # Description of the composing transforms, names/ids, and input/outputs of a
2787 # stage of execution. Some composing transforms and sources may have been
2788 # generated by the Dataflow service during execution planning.
2789 &quot;componentSource&quot;: [ # Collections produced and consumed by component transforms of this stage.
2790 { # Description of an interstitial value between transforms in an execution
2791 # stage.
2792 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
2793 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
2794 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
2795 # source is most closely associated.
2796 },
2797 ],
2798 &quot;inputSource&quot;: [ # Input sources for this stage.
2799 { # Description of an input or output of an execution stage.
2800 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
2801 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
2802 # source is most closely associated.
2803 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
2804 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
2805 },
2806 ],
2807 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this stage.
2808 &quot;componentTransform&quot;: [ # Transforms that comprise this execution stage.
2809 { # Description of a transform executed as part of an execution stage.
2810 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
2811 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
2812 &quot;originalTransform&quot;: &quot;A String&quot;, # User name for the original user transform with which this transform is
2813 # most closely associated.
2814 },
2815 ],
2816 &quot;id&quot;: &quot;A String&quot;, # Dataflow service generated id for this stage.
2817 &quot;outputSource&quot;: [ # Output sources for this stage.
2818 { # Description of an input or output of an execution stage.
2819 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
2820 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
2821 # source is most closely associated.
2822 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
2823 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
2824 },
2825 ],
2826 &quot;kind&quot;: &quot;A String&quot;, # Type of tranform this stage is executing.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002827 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002828 ],
Bu Sun Kim65020912020-05-20 12:08:20 -07002829 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002830 &quot;labels&quot;: { # User-defined labels for this job.
2831 #
2832 # The labels map can contain no more than 64 entries. Entries of the labels
2833 # map are UTF8 strings that comply with the following restrictions:
2834 #
2835 # * Keys must conform to regexp: \p{Ll}\p{Lo}{0,62}
2836 # * Values must conform to regexp: [\p{Ll}\p{Lo}\p{N}_-]{0,63}
2837 # * Both keys and values are additionally constrained to be &lt;= 128 bytes in
2838 # size.
Bu Sun Kim65020912020-05-20 12:08:20 -07002839 &quot;a_key&quot;: &quot;A String&quot;,
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08002840 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002841 &quot;projectId&quot;: &quot;A String&quot;, # The ID of the Cloud Platform project that the job belongs to.
Bu Sun Kim65020912020-05-20 12:08:20 -07002842 &quot;environment&quot;: { # Describes the environment in which a Dataflow Job runs. # The environment for the job.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002843 &quot;flexResourceSchedulingGoal&quot;: &quot;A String&quot;, # Which Flexible Resource Scheduling mode to run in.
Bu Sun Kim65020912020-05-20 12:08:20 -07002844 &quot;workerRegion&quot;: &quot;A String&quot;, # The Compute Engine region
2845 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
2846 # which worker processing should occur, e.g. &quot;us-west1&quot;. Mutually exclusive
2847 # with worker_zone. If neither worker_region nor worker_zone is specified,
2848 # default to the control plane&#x27;s region.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002849 &quot;userAgent&quot;: { # A description of the process that generated the request.
2850 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
2851 },
2852 &quot;serviceAccountEmail&quot;: &quot;A String&quot;, # Identity to run virtual machines as. Defaults to the default account.
2853 &quot;version&quot;: { # A structure describing which components and their versions of the service
2854 # are required in order to run the job.
2855 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
2856 },
Bu Sun Kim65020912020-05-20 12:08:20 -07002857 &quot;serviceKmsKeyName&quot;: &quot;A String&quot;, # If set, contains the Cloud KMS key identifier used to encrypt data
2858 # at rest, AKA a Customer Managed Encryption Key (CMEK).
2859 #
2860 # Format:
2861 # projects/PROJECT_ID/locations/LOCATION/keyRings/KEY_RING/cryptoKeys/KEY
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002862 &quot;experiments&quot;: [ # The list of experiments to enable.
2863 &quot;A String&quot;,
2864 ],
Bu Sun Kim65020912020-05-20 12:08:20 -07002865 &quot;workerZone&quot;: &quot;A String&quot;, # The Compute Engine zone
2866 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
2867 # which worker processing should occur, e.g. &quot;us-west1-a&quot;. Mutually exclusive
2868 # with worker_region. If neither worker_region nor worker_zone is specified,
2869 # a zone in the control plane&#x27;s region is chosen based on available capacity.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002870 &quot;workerPools&quot;: [ # The worker pools. At least one &quot;harness&quot; worker pool must be
2871 # specified in order for the job to have workers.
2872 { # Describes one particular pool of Cloud Dataflow workers to be
2873 # instantiated by the Cloud Dataflow service in order to perform the
2874 # computations required by a job. Note that a workflow job may use
2875 # multiple pools, in order to match the various computational
2876 # requirements of the various stages of the job.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002877 &quot;onHostMaintenance&quot;: &quot;A String&quot;, # The action to take on host maintenance, as defined by the Google
2878 # Compute Engine API.
2879 &quot;sdkHarnessContainerImages&quot;: [ # Set of SDK harness containers needed to execute this pipeline. This will
2880 # only be set in the Fn API path. For non-cross-language pipelines this
2881 # should have only one entry. Cross-language pipelines will have two or more
2882 # entries.
2883 { # Defines a SDK harness container for executing Dataflow pipelines.
2884 &quot;containerImage&quot;: &quot;A String&quot;, # A docker container image that resides in Google Container Registry.
2885 &quot;useSingleCorePerContainer&quot;: True or False, # If true, recommends the Dataflow service to use only one core per SDK
2886 # container instance with this image. If false (or unset) recommends using
2887 # more than one core per SDK container instance with this image for
2888 # efficiency. Note that Dataflow service may choose to override this property
2889 # if needed.
2890 },
2891 ],
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002892 &quot;zone&quot;: &quot;A String&quot;, # Zone to run the worker pools in. If empty or unspecified, the service
2893 # will attempt to choose a reasonable default.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002894 &quot;kind&quot;: &quot;A String&quot;, # The kind of the worker pool; currently only `harness` and `shuffle`
2895 # are supported.
2896 &quot;metadata&quot;: { # Metadata to set on the Google Compute Engine VMs.
2897 &quot;a_key&quot;: &quot;A String&quot;,
2898 },
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002899 &quot;diskSourceImage&quot;: &quot;A String&quot;, # Fully qualified source image for disks.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002900 &quot;dataDisks&quot;: [ # Data disks that are used by a VM in this workflow.
2901 { # Describes the data disk used by a workflow job.
2902 &quot;sizeGb&quot;: 42, # Size of disk in GB. If zero or unspecified, the service will
2903 # attempt to choose a reasonable default.
2904 &quot;diskType&quot;: &quot;A String&quot;, # Disk storage type, as defined by Google Compute Engine. This
2905 # must be a disk type appropriate to the project and zone in which
2906 # the workers will run. If unknown or unspecified, the service
2907 # will attempt to choose a reasonable default.
2908 #
2909 # For example, the standard persistent disk type is a resource name
2910 # typically ending in &quot;pd-standard&quot;. If SSD persistent disks are
2911 # available, the resource name typically ends with &quot;pd-ssd&quot;. The
2912 # actual valid values are defined the Google Compute Engine API,
2913 # not by the Cloud Dataflow API; consult the Google Compute Engine
2914 # documentation for more information about determining the set of
2915 # available disk types for a particular project and zone.
2916 #
2917 # Google Compute Engine Disk types are local to a particular
2918 # project in a particular zone, and so the resource name will
2919 # typically look something like this:
2920 #
2921 # compute.googleapis.com/projects/project-id/zones/zone/diskTypes/pd-standard
2922 &quot;mountPoint&quot;: &quot;A String&quot;, # Directory in a VM where disk is mounted.
2923 },
2924 ],
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002925 &quot;packages&quot;: [ # Packages to be installed on workers.
2926 { # The packages that must be installed in order for a worker to run the
2927 # steps of the Cloud Dataflow job that will be assigned to its worker
2928 # pool.
2929 #
2930 # This is the mechanism by which the Cloud Dataflow SDK causes code to
2931 # be loaded onto the workers. For example, the Cloud Dataflow Java SDK
2932 # might use this to install jars containing the user&#x27;s code and all of the
2933 # various dependencies (libraries, data files, etc.) required in order
2934 # for that code to run.
2935 &quot;name&quot;: &quot;A String&quot;, # The name of the package.
2936 &quot;location&quot;: &quot;A String&quot;, # The resource to read the package from. The supported resource type is:
2937 #
2938 # Google Cloud Storage:
2939 #
2940 # storage.googleapis.com/{bucket}
2941 # bucket.storage.googleapis.com/
2942 },
2943 ],
2944 &quot;teardownPolicy&quot;: &quot;A String&quot;, # Sets the policy for determining when to turndown worker pool.
2945 # Allowed values are: `TEARDOWN_ALWAYS`, `TEARDOWN_ON_SUCCESS`, and
2946 # `TEARDOWN_NEVER`.
2947 # `TEARDOWN_ALWAYS` means workers are always torn down regardless of whether
2948 # the job succeeds. `TEARDOWN_ON_SUCCESS` means workers are torn down
2949 # if the job succeeds. `TEARDOWN_NEVER` means the workers are never torn
2950 # down.
2951 #
2952 # If the workers are not torn down by the service, they will
2953 # continue to run and use Google Compute Engine VM resources in the
2954 # user&#x27;s project until they are explicitly terminated by the user.
2955 # Because of this, Google recommends using the `TEARDOWN_ALWAYS`
2956 # policy except for small, manually supervised test jobs.
2957 #
2958 # If unknown or unspecified, the service will attempt to choose a reasonable
2959 # default.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002960 &quot;network&quot;: &quot;A String&quot;, # Network to which VMs will be assigned. If empty or unspecified,
2961 # the service will use the network &quot;default&quot;.
2962 &quot;ipConfiguration&quot;: &quot;A String&quot;, # Configuration for VM IPs.
2963 &quot;diskSizeGb&quot;: 42, # Size of root disk for VMs, in GB. If zero or unspecified, the service will
2964 # attempt to choose a reasonable default.
2965 &quot;autoscalingSettings&quot;: { # Settings for WorkerPool autoscaling. # Settings for autoscaling of this WorkerPool.
2966 &quot;maxNumWorkers&quot;: 42, # The maximum number of workers to cap scaling at.
2967 &quot;algorithm&quot;: &quot;A String&quot;, # The algorithm to use for autoscaling.
2968 },
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002969 &quot;poolArgs&quot;: { # Extra arguments for this worker pool.
2970 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
2971 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002972 &quot;subnetwork&quot;: &quot;A String&quot;, # Subnetwork to which VMs will be assigned, if desired. Expected to be of
2973 # the form &quot;regions/REGION/subnetworks/SUBNETWORK&quot;.
2974 &quot;numWorkers&quot;: 42, # Number of Google Compute Engine workers in this pool needed to
2975 # execute the job. If zero or unspecified, the service will
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002976 # attempt to choose a reasonable default.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002977 &quot;numThreadsPerWorker&quot;: 42, # The number of threads per worker harness. If empty or unspecified, the
2978 # service will choose a number of threads (according to the number of cores
2979 # on the selected machine type for batch, or 1 by convention for streaming).
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002980 &quot;workerHarnessContainerImage&quot;: &quot;A String&quot;, # Required. Docker container image that executes the Cloud Dataflow worker
2981 # harness, residing in Google Container Registry.
2982 #
2983 # Deprecated for the Fn API path. Use sdk_harness_container_images instead.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002984 &quot;taskrunnerSettings&quot;: { # Taskrunner configuration settings. # Settings passed through to Google Compute Engine workers when
2985 # using the standard Dataflow task runner. Users should ignore
2986 # this field.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07002987 &quot;dataflowApiVersion&quot;: &quot;A String&quot;, # The API version of endpoint, e.g. &quot;v1b3&quot;
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002988 &quot;oauthScopes&quot;: [ # The OAuth2 scopes to be requested by the taskrunner in order to
2989 # access the Cloud Dataflow API.
2990 &quot;A String&quot;,
2991 ],
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07002992 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for the taskrunner to use when accessing Google Cloud APIs.
2993 #
2994 # When workers access Google Cloud APIs, they logically do so via
2995 # relative URLs. If this field is specified, it supplies the base
2996 # URL to use for resolving these relative URLs. The normative
2997 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
2998 # Locators&quot;.
2999 #
3000 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003001 &quot;workflowFileName&quot;: &quot;A String&quot;, # The file to store the workflow in.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07003002 &quot;logToSerialconsole&quot;: True or False, # Whether to send taskrunner log info to Google Compute Engine VM serial
3003 # console.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003004 &quot;baseTaskDir&quot;: &quot;A String&quot;, # The location on the worker for task-specific subdirectories.
3005 &quot;taskUser&quot;: &quot;A String&quot;, # The UNIX user ID on the worker VM to use for tasks launched by
3006 # taskrunner; e.g. &quot;root&quot;.
3007 &quot;vmId&quot;: &quot;A String&quot;, # The ID string of the VM.
3008 &quot;alsologtostderr&quot;: True or False, # Whether to also send taskrunner log info to stderr.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07003009 &quot;parallelWorkerSettings&quot;: { # Provides data to pass through to the worker harness. # The settings to pass to the parallel worker harness.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003010 &quot;shuffleServicePath&quot;: &quot;A String&quot;, # The Shuffle service path relative to the root URL, for example,
3011 # &quot;shuffle/v1beta1&quot;.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07003012 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
3013 # storage.
3014 #
3015 # The supported resource type is:
3016 #
3017 # Google Cloud Storage:
3018 #
3019 # storage.googleapis.com/{bucket}/{object}
3020 # bucket.storage.googleapis.com/{object}
3021 &quot;reportingEnabled&quot;: True or False, # Whether to send work progress updates to the service.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003022 &quot;servicePath&quot;: &quot;A String&quot;, # The Cloud Dataflow service path relative to the root URL, for example,
3023 # &quot;dataflow/v1b3/projects&quot;.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07003024 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for accessing Google Cloud APIs.
3025 #
3026 # When workers access Google Cloud APIs, they logically do so via
3027 # relative URLs. If this field is specified, it supplies the base
3028 # URL to use for resolving these relative URLs. The normative
3029 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
3030 # Locators&quot;.
3031 #
3032 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07003033 &quot;workerId&quot;: &quot;A String&quot;, # The ID of the worker running this pipeline.
3034 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003035 &quot;harnessCommand&quot;: &quot;A String&quot;, # The command to launch the worker harness.
3036 &quot;logDir&quot;: &quot;A String&quot;, # The directory on the VM to store logs.
3037 &quot;streamingWorkerMainClass&quot;: &quot;A String&quot;, # The streaming worker main class name.
3038 &quot;languageHint&quot;: &quot;A String&quot;, # The suggested backend language.
3039 &quot;taskGroup&quot;: &quot;A String&quot;, # The UNIX group ID on the worker VM to use for tasks launched by
3040 # taskrunner; e.g. &quot;wheel&quot;.
3041 &quot;logUploadLocation&quot;: &quot;A String&quot;, # Indicates where to put logs. If this is not specified, the logs
3042 # will not be uploaded.
3043 #
3044 # The supported resource type is:
3045 #
3046 # Google Cloud Storage:
3047 # storage.googleapis.com/{bucket}/{object}
3048 # bucket.storage.googleapis.com/{object}
3049 &quot;commandlinesFileName&quot;: &quot;A String&quot;, # The file to store preprocessing commands in.
3050 &quot;continueOnException&quot;: True or False, # Whether to continue taskrunner if an exception is hit.
3051 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the taskrunner should use for
3052 # temporary storage.
3053 #
3054 # The supported resource type is:
3055 #
3056 # Google Cloud Storage:
3057 # storage.googleapis.com/{bucket}/{object}
3058 # bucket.storage.googleapis.com/{object}
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07003059 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003060 &quot;diskType&quot;: &quot;A String&quot;, # Type of root disk for VMs. If empty or unspecified, the service will
3061 # attempt to choose a reasonable default.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07003062 &quot;defaultPackageSet&quot;: &quot;A String&quot;, # The default package set to install. This allows the service to
3063 # select a default set of packages which are useful to worker
3064 # harnesses written in a particular language.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003065 &quot;machineType&quot;: &quot;A String&quot;, # Machine type (e.g. &quot;n1-standard-1&quot;). If empty or unspecified, the
3066 # service will attempt to choose a reasonable default.
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07003067 },
3068 ],
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003069 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
3070 # storage. The system will append the suffix &quot;/temp-{JOBNAME} to
3071 # this resource prefix, where {JOBNAME} is the value of the
3072 # job_name field. The resulting bucket and object prefix is used
3073 # as the prefix of the resources used to store temporary data
3074 # needed during the job execution. NOTE: This will override the
3075 # value in taskrunner_settings.
3076 # The supported resource type is:
3077 #
3078 # Google Cloud Storage:
3079 #
3080 # storage.googleapis.com/{bucket}/{object}
3081 # bucket.storage.googleapis.com/{object}
3082 &quot;internalExperiments&quot;: { # Experimental settings.
3083 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
3084 },
3085 &quot;sdkPipelineOptions&quot;: { # The Cloud Dataflow SDK pipeline options specified by the user. These
3086 # options are passed through the service and are used to recreate the
3087 # SDK pipeline options on the worker in a language agnostic and platform
3088 # independent way.
3089 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
3090 },
Bu Sun Kim4ed7d3f2020-05-27 12:20:54 -07003091 &quot;dataset&quot;: &quot;A String&quot;, # The dataset for the current project where various workflow
3092 # related tables are stored.
3093 #
3094 # The supported resource type is:
3095 #
3096 # Google BigQuery:
3097 # bigquery.googleapis.com/{dataset}
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003098 &quot;clusterManagerApiService&quot;: &quot;A String&quot;, # The type of cluster manager API to use. If unknown or
3099 # unspecified, the service will attempt to choose a reasonable
3100 # default. This should be in the form of the API service name,
3101 # e.g. &quot;compute.googleapis.com&quot;.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08003102 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003103 &quot;stepsLocation&quot;: &quot;A String&quot;, # The GCS location where the steps are stored.
Bu Sun Kim65020912020-05-20 12:08:20 -07003104 &quot;steps&quot;: [ # Exactly one of step or steps_location should be specified.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003105 #
Bu Sun Kim715bd7f2019-06-14 16:50:42 -07003106 # The top-level steps that constitute the entire job.
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04003107 { # Defines a particular step within a Cloud Dataflow job.
3108 #
3109 # A job consists of multiple steps, each of which performs some
3110 # specific operation as part of the overall job. Data is typically
3111 # passed from one step to another as part of the job.
3112 #
Bu Sun Kim65020912020-05-20 12:08:20 -07003113 # Here&#x27;s an example of a sequence of steps which together implement a
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04003114 # Map-Reduce job:
3115 #
3116 # * Read a collection of data from some source, parsing the
Bu Sun Kim65020912020-05-20 12:08:20 -07003117 # collection&#x27;s elements.
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04003118 #
3119 # * Validate the elements.
3120 #
3121 # * Apply a user-defined function to map each element to some value
3122 # and extract an element-specific key value.
3123 #
3124 # * Group elements with the same key into a single element with
3125 # that key, transforming a multiply-keyed collection into a
3126 # uniquely-keyed collection.
3127 #
3128 # * Write the elements out to some data sink.
3129 #
3130 # Note that the Cloud Dataflow service may be used to run many different
3131 # types of jobs, not just Map-Reduce.
Bu Sun Kim65020912020-05-20 12:08:20 -07003132 &quot;kind&quot;: &quot;A String&quot;, # The kind of step in the Cloud Dataflow job.
3133 &quot;properties&quot;: { # Named properties associated with the step. Each kind of
Sai Cheemalapatic30d2b52017-03-13 12:12:03 -04003134 # predefined step has its own required set of properties.
3135 # Must be provided on Create. Only retrieved with JOB_VIEW_ALL.
Bu Sun Kim65020912020-05-20 12:08:20 -07003136 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08003137 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003138 &quot;name&quot;: &quot;A String&quot;, # The name that identifies the step. This must be unique for each
3139 # step with respect to all other steps in the Cloud Dataflow job.
3140 },
3141 ],
3142 &quot;stageStates&quot;: [ # This field may be mutated by the Cloud Dataflow service;
3143 # callers cannot mutate it.
3144 { # A message describing the state of a particular execution stage.
3145 &quot;executionStageState&quot;: &quot;A String&quot;, # Executions stage states allow the same set of values as JobState.
3146 &quot;executionStageName&quot;: &quot;A String&quot;, # The name of the execution stage.
3147 &quot;currentStateTime&quot;: &quot;A String&quot;, # The time at which the stage transitioned to this state.
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08003148 },
3149 ],
Bu Sun Kim65020912020-05-20 12:08:20 -07003150 &quot;replacedByJobId&quot;: &quot;A String&quot;, # If another job is an update of this job (and thus, this job is in
3151 # `JOB_STATE_UPDATED`), this field contains the ID of that job.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003152 &quot;jobMetadata&quot;: { # Metadata available primarily for filtering jobs. Will be included in the # This field is populated by the Dataflow service to support filtering jobs
3153 # by the metadata values provided here. Populated for ListJobs and all GetJob
3154 # views SUMMARY and higher.
3155 # ListJob response and Job SUMMARY view.
3156 &quot;sdkVersion&quot;: { # The version of the SDK used to run the job. # The SDK version used to run the job.
3157 &quot;sdkSupportStatus&quot;: &quot;A String&quot;, # The support status for this SDK version.
3158 &quot;versionDisplayName&quot;: &quot;A String&quot;, # A readable string describing the version of the SDK.
3159 &quot;version&quot;: &quot;A String&quot;, # The version of the SDK used to run the job.
3160 },
3161 &quot;bigTableDetails&quot;: [ # Identification of a BigTable source used in the Dataflow job.
3162 { # Metadata for a BigTable connector used by the job.
3163 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
3164 &quot;tableId&quot;: &quot;A String&quot;, # TableId accessed in the connection.
3165 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
3166 },
3167 ],
3168 &quot;pubsubDetails&quot;: [ # Identification of a PubSub source used in the Dataflow job.
3169 { # Metadata for a PubSub connector used by the job.
3170 &quot;subscription&quot;: &quot;A String&quot;, # Subscription used in the connection.
3171 &quot;topic&quot;: &quot;A String&quot;, # Topic accessed in the connection.
3172 },
3173 ],
3174 &quot;bigqueryDetails&quot;: [ # Identification of a BigQuery source used in the Dataflow job.
3175 { # Metadata for a BigQuery connector used by the job.
3176 &quot;dataset&quot;: &quot;A String&quot;, # Dataset accessed in the connection.
3177 &quot;projectId&quot;: &quot;A String&quot;, # Project accessed in the connection.
3178 &quot;query&quot;: &quot;A String&quot;, # Query used to access data in the connection.
3179 &quot;table&quot;: &quot;A String&quot;, # Table accessed in the connection.
3180 },
3181 ],
3182 &quot;fileDetails&quot;: [ # Identification of a File source used in the Dataflow job.
3183 { # Metadata for a File connector used by the job.
3184 &quot;filePattern&quot;: &quot;A String&quot;, # File Pattern used to access files by the connector.
3185 },
3186 ],
3187 &quot;datastoreDetails&quot;: [ # Identification of a Datastore source used in the Dataflow job.
3188 { # Metadata for a Datastore connector used by the job.
3189 &quot;namespace&quot;: &quot;A String&quot;, # Namespace used in the connection.
3190 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
3191 },
3192 ],
3193 &quot;spannerDetails&quot;: [ # Identification of a Spanner source used in the Dataflow job.
3194 { # Metadata for a Spanner connector used by the job.
3195 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
3196 &quot;databaseId&quot;: &quot;A String&quot;, # DatabaseId accessed in the connection.
3197 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
3198 },
3199 ],
3200 },
3201 &quot;location&quot;: &quot;A String&quot;, # The [regional endpoint]
3202 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
3203 # contains this job.
3204 &quot;transformNameMapping&quot;: { # The map of transform name prefixes of the job to be replaced to the
3205 # corresponding name prefixes of the new job.
3206 &quot;a_key&quot;: &quot;A String&quot;,
3207 },
3208 &quot;startTime&quot;: &quot;A String&quot;, # The timestamp when the job was started (transitioned to JOB_STATE_PENDING).
3209 # Flexible resource scheduling jobs are started with some delay after job
3210 # creation, so start_time is unset before start and is updated when the
3211 # job is started by the Cloud Dataflow service. For other jobs, start_time
3212 # always equals to create_time and is immutable and set by the Cloud Dataflow
3213 # service.
3214 &quot;clientRequestId&quot;: &quot;A String&quot;, # The client&#x27;s unique identifier of the job, re-used across retried attempts.
3215 # If this field is set, the service will ensure its uniqueness.
3216 # The request to create a job will fail if the service has knowledge of a
3217 # previously submitted job with the same client&#x27;s ID and job name.
3218 # The caller may use this field to ensure idempotence of job
3219 # creation across retried attempts to create a job.
3220 # By default, the field is empty and, in that case, the service ignores it.
Bu Sun Kim65020912020-05-20 12:08:20 -07003221 &quot;executionInfo&quot;: { # Additional information about how a Cloud Dataflow job will be executed that # Deprecated.
3222 # isn&#x27;t contained in the submitted job.
3223 &quot;stages&quot;: { # A mapping from each stage to the information about that stage.
3224 &quot;a_key&quot;: { # Contains information about how a particular
3225 # google.dataflow.v1beta3.Step will be executed.
3226 &quot;stepName&quot;: [ # The steps associated with the execution stage.
3227 # Note that stages may have several steps, and that a given step
3228 # might be run by more than one stage.
3229 &quot;A String&quot;,
3230 ],
3231 },
3232 },
3233 },
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003234 &quot;type&quot;: &quot;A String&quot;, # The type of Cloud Dataflow job.
Bu Sun Kim65020912020-05-20 12:08:20 -07003235 &quot;createTime&quot;: &quot;A String&quot;, # The timestamp when the job was initially created. Immutable and set by the
3236 # Cloud Dataflow service.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003237 &quot;tempFiles&quot;: [ # A set of files the system should be aware of that are used
3238 # for temporary storage. These temporary files will be
3239 # removed on job completion.
3240 # No duplicates are allowed.
3241 # No file patterns are supported.
3242 #
3243 # The supported files are:
3244 #
3245 # Google Cloud Storage:
3246 #
3247 # storage.googleapis.com/{bucket}/{object}
3248 # bucket.storage.googleapis.com/{object}
3249 &quot;A String&quot;,
3250 ],
3251 &quot;id&quot;: &quot;A String&quot;, # The unique ID of this job.
3252 #
3253 # This field is set by the Cloud Dataflow service when the Job is
3254 # created, and is immutable for the life of the job.
Bu Sun Kim65020912020-05-20 12:08:20 -07003255 &quot;requestedState&quot;: &quot;A String&quot;, # The job&#x27;s requested state.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003256 #
Bu Sun Kim65020912020-05-20 12:08:20 -07003257 # `UpdateJob` may be used to switch between the `JOB_STATE_STOPPED` and
3258 # `JOB_STATE_RUNNING` states, by setting requested_state. `UpdateJob` may
3259 # also be used to directly set a job&#x27;s requested state to
3260 # `JOB_STATE_CANCELLED` or `JOB_STATE_DONE`, irrevocably terminating the
3261 # job if it has not already reached a terminal state.
Bu Sun Kimd059ad82020-07-22 17:02:09 -07003262 &quot;replaceJobId&quot;: &quot;A String&quot;, # If this job is an update of an existing job, this field is the job ID
3263 # of the job it replaced.
3264 #
3265 # When sending a `CreateJobRequest`, you can update a job by specifying it
3266 # here. The job named here is stopped, and its intermediate state is
3267 # transferred to this job.
3268 &quot;createdFromSnapshotId&quot;: &quot;A String&quot;, # If this is specified, the job&#x27;s initial state is populated from the given
3269 # snapshot.
3270 &quot;currentState&quot;: &quot;A String&quot;, # The current state of the job.
3271 #
3272 # Jobs are created in the `JOB_STATE_STOPPED` state unless otherwise
3273 # specified.
3274 #
3275 # A job in the `JOB_STATE_RUNNING` state may asynchronously enter a
3276 # terminal state. After a job has reached a terminal state, no
3277 # further state updates may be made.
3278 #
3279 # This field may be mutated by the Cloud Dataflow service;
3280 # callers cannot mutate it.
3281 &quot;name&quot;: &quot;A String&quot;, # The user-specified Cloud Dataflow job name.
3282 #
3283 # Only one Job with a given name may exist in a project at any
3284 # given time. If a caller attempts to create a Job with the same
3285 # name as an already-existing Job, the attempt returns the
3286 # existing Job.
3287 #
3288 # The name must match the regular expression
3289 # `[a-z]([-a-z0-9]{0,38}[a-z0-9])?`
3290 &quot;currentStateTime&quot;: &quot;A String&quot;, # The timestamp associated with the current state.
3291 }
3292
3293 x__xgafv: string, V1 error format.
3294 Allowed values
3295 1 - v1 error format
3296 2 - v2 error format
3297
3298Returns:
3299 An object of the form:
3300
3301 { # Defines a job to be run by the Cloud Dataflow service.
3302 &quot;pipelineDescription&quot;: { # A descriptive representation of submitted pipeline as well as the executed # Preliminary field: The format of this data may change at any time.
3303 # A description of the user pipeline and stages through which it is executed.
3304 # Created by Cloud Dataflow service. Only retrieved with
3305 # JOB_VIEW_DESCRIPTION or JOB_VIEW_ALL.
3306 # form. This data is provided by the Dataflow service for ease of visualizing
3307 # the pipeline and interpreting Dataflow provided metrics.
3308 &quot;displayData&quot;: [ # Pipeline level display data.
3309 { # Data provided with a pipeline or transform to provide descriptive info.
3310 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
3311 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
3312 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
3313 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
3314 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
3315 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
3316 # This is intended to be used as a label for the display data
3317 # when viewed in a dax monitoring system.
3318 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
3319 # language namespace (i.e. python module) which defines the display data.
3320 # This allows a dax monitoring system to specially handle the data
3321 # and perform custom rendering.
3322 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
3323 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
3324 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
3325 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
3326 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
3327 # For example a java_class_name_value of com.mypackage.MyDoFn
3328 # will be stored with MyDoFn as the short_str_value and
3329 # com.mypackage.MyDoFn as the java_class_name value.
3330 # short_str_value can be displayed and java_class_name_value
3331 # will be displayed as a tooltip.
3332 },
3333 ],
3334 &quot;originalPipelineTransform&quot;: [ # Description of each transform in the pipeline and collections between them.
3335 { # Description of the type, names/ids, and input/outputs for a transform.
3336 &quot;outputCollectionName&quot;: [ # User names for all collection outputs to this transform.
3337 &quot;A String&quot;,
3338 ],
3339 &quot;displayData&quot;: [ # Transform-specific display data.
3340 { # Data provided with a pipeline or transform to provide descriptive info.
3341 &quot;url&quot;: &quot;A String&quot;, # An optional full URL.
3342 &quot;javaClassValue&quot;: &quot;A String&quot;, # Contains value if the data is of java class type.
3343 &quot;timestampValue&quot;: &quot;A String&quot;, # Contains value if the data is of timestamp type.
3344 &quot;durationValue&quot;: &quot;A String&quot;, # Contains value if the data is of duration type.
3345 &quot;label&quot;: &quot;A String&quot;, # An optional label to display in a dax UI for the element.
3346 &quot;key&quot;: &quot;A String&quot;, # The key identifying the display data.
3347 # This is intended to be used as a label for the display data
3348 # when viewed in a dax monitoring system.
3349 &quot;namespace&quot;: &quot;A String&quot;, # The namespace for the key. This is usually a class name or programming
3350 # language namespace (i.e. python module) which defines the display data.
3351 # This allows a dax monitoring system to specially handle the data
3352 # and perform custom rendering.
3353 &quot;floatValue&quot;: 3.14, # Contains value if the data is of float type.
3354 &quot;strValue&quot;: &quot;A String&quot;, # Contains value if the data is of string type.
3355 &quot;int64Value&quot;: &quot;A String&quot;, # Contains value if the data is of int64 type.
3356 &quot;boolValue&quot;: True or False, # Contains value if the data is of a boolean type.
3357 &quot;shortStrValue&quot;: &quot;A String&quot;, # A possible additional shorter value to display.
3358 # For example a java_class_name_value of com.mypackage.MyDoFn
3359 # will be stored with MyDoFn as the short_str_value and
3360 # com.mypackage.MyDoFn as the java_class_name value.
3361 # short_str_value can be displayed and java_class_name_value
3362 # will be displayed as a tooltip.
3363 },
3364 ],
3365 &quot;id&quot;: &quot;A String&quot;, # SDK generated id of this transform instance.
3366 &quot;inputCollectionName&quot;: [ # User names for all collection inputs to this transform.
3367 &quot;A String&quot;,
3368 ],
3369 &quot;name&quot;: &quot;A String&quot;, # User provided name for this transform instance.
3370 &quot;kind&quot;: &quot;A String&quot;, # Type of transform.
3371 },
3372 ],
3373 &quot;executionPipelineStage&quot;: [ # Description of each stage of execution of the pipeline.
3374 { # Description of the composing transforms, names/ids, and input/outputs of a
3375 # stage of execution. Some composing transforms and sources may have been
3376 # generated by the Dataflow service during execution planning.
3377 &quot;componentSource&quot;: [ # Collections produced and consumed by component transforms of this stage.
3378 { # Description of an interstitial value between transforms in an execution
3379 # stage.
3380 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
3381 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
3382 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
3383 # source is most closely associated.
3384 },
3385 ],
3386 &quot;inputSource&quot;: [ # Input sources for this stage.
3387 { # Description of an input or output of an execution stage.
3388 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
3389 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
3390 # source is most closely associated.
3391 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
3392 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
3393 },
3394 ],
3395 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this stage.
3396 &quot;componentTransform&quot;: [ # Transforms that comprise this execution stage.
3397 { # Description of a transform executed as part of an execution stage.
3398 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
3399 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this transform; may be user or system generated.
3400 &quot;originalTransform&quot;: &quot;A String&quot;, # User name for the original user transform with which this transform is
3401 # most closely associated.
3402 },
3403 ],
3404 &quot;id&quot;: &quot;A String&quot;, # Dataflow service generated id for this stage.
3405 &quot;outputSource&quot;: [ # Output sources for this stage.
3406 { # Description of an input or output of an execution stage.
3407 &quot;userName&quot;: &quot;A String&quot;, # Human-readable name for this source; may be user or system generated.
3408 &quot;originalTransformOrCollection&quot;: &quot;A String&quot;, # User name for the original user transform or collection with which this
3409 # source is most closely associated.
3410 &quot;sizeBytes&quot;: &quot;A String&quot;, # Size of the source, if measurable.
3411 &quot;name&quot;: &quot;A String&quot;, # Dataflow service generated name for this source.
3412 },
3413 ],
3414 &quot;kind&quot;: &quot;A String&quot;, # Type of tranform this stage is executing.
3415 },
3416 ],
3417 },
3418 &quot;labels&quot;: { # User-defined labels for this job.
3419 #
3420 # The labels map can contain no more than 64 entries. Entries of the labels
3421 # map are UTF8 strings that comply with the following restrictions:
3422 #
3423 # * Keys must conform to regexp: \p{Ll}\p{Lo}{0,62}
3424 # * Values must conform to regexp: [\p{Ll}\p{Lo}\p{N}_-]{0,63}
3425 # * Both keys and values are additionally constrained to be &lt;= 128 bytes in
3426 # size.
3427 &quot;a_key&quot;: &quot;A String&quot;,
3428 },
3429 &quot;projectId&quot;: &quot;A String&quot;, # The ID of the Cloud Platform project that the job belongs to.
3430 &quot;environment&quot;: { # Describes the environment in which a Dataflow Job runs. # The environment for the job.
3431 &quot;flexResourceSchedulingGoal&quot;: &quot;A String&quot;, # Which Flexible Resource Scheduling mode to run in.
3432 &quot;workerRegion&quot;: &quot;A String&quot;, # The Compute Engine region
3433 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
3434 # which worker processing should occur, e.g. &quot;us-west1&quot;. Mutually exclusive
3435 # with worker_zone. If neither worker_region nor worker_zone is specified,
3436 # default to the control plane&#x27;s region.
3437 &quot;userAgent&quot;: { # A description of the process that generated the request.
3438 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
3439 },
3440 &quot;serviceAccountEmail&quot;: &quot;A String&quot;, # Identity to run virtual machines as. Defaults to the default account.
3441 &quot;version&quot;: { # A structure describing which components and their versions of the service
3442 # are required in order to run the job.
3443 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
3444 },
3445 &quot;serviceKmsKeyName&quot;: &quot;A String&quot;, # If set, contains the Cloud KMS key identifier used to encrypt data
3446 # at rest, AKA a Customer Managed Encryption Key (CMEK).
3447 #
3448 # Format:
3449 # projects/PROJECT_ID/locations/LOCATION/keyRings/KEY_RING/cryptoKeys/KEY
3450 &quot;experiments&quot;: [ # The list of experiments to enable.
3451 &quot;A String&quot;,
3452 ],
3453 &quot;workerZone&quot;: &quot;A String&quot;, # The Compute Engine zone
3454 # (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in
3455 # which worker processing should occur, e.g. &quot;us-west1-a&quot;. Mutually exclusive
3456 # with worker_region. If neither worker_region nor worker_zone is specified,
3457 # a zone in the control plane&#x27;s region is chosen based on available capacity.
3458 &quot;workerPools&quot;: [ # The worker pools. At least one &quot;harness&quot; worker pool must be
3459 # specified in order for the job to have workers.
3460 { # Describes one particular pool of Cloud Dataflow workers to be
3461 # instantiated by the Cloud Dataflow service in order to perform the
3462 # computations required by a job. Note that a workflow job may use
3463 # multiple pools, in order to match the various computational
3464 # requirements of the various stages of the job.
3465 &quot;onHostMaintenance&quot;: &quot;A String&quot;, # The action to take on host maintenance, as defined by the Google
3466 # Compute Engine API.
3467 &quot;sdkHarnessContainerImages&quot;: [ # Set of SDK harness containers needed to execute this pipeline. This will
3468 # only be set in the Fn API path. For non-cross-language pipelines this
3469 # should have only one entry. Cross-language pipelines will have two or more
3470 # entries.
3471 { # Defines a SDK harness container for executing Dataflow pipelines.
3472 &quot;containerImage&quot;: &quot;A String&quot;, # A docker container image that resides in Google Container Registry.
3473 &quot;useSingleCorePerContainer&quot;: True or False, # If true, recommends the Dataflow service to use only one core per SDK
3474 # container instance with this image. If false (or unset) recommends using
3475 # more than one core per SDK container instance with this image for
3476 # efficiency. Note that Dataflow service may choose to override this property
3477 # if needed.
3478 },
3479 ],
3480 &quot;zone&quot;: &quot;A String&quot;, # Zone to run the worker pools in. If empty or unspecified, the service
3481 # will attempt to choose a reasonable default.
3482 &quot;kind&quot;: &quot;A String&quot;, # The kind of the worker pool; currently only `harness` and `shuffle`
3483 # are supported.
3484 &quot;metadata&quot;: { # Metadata to set on the Google Compute Engine VMs.
3485 &quot;a_key&quot;: &quot;A String&quot;,
3486 },
3487 &quot;diskSourceImage&quot;: &quot;A String&quot;, # Fully qualified source image for disks.
3488 &quot;dataDisks&quot;: [ # Data disks that are used by a VM in this workflow.
3489 { # Describes the data disk used by a workflow job.
3490 &quot;sizeGb&quot;: 42, # Size of disk in GB. If zero or unspecified, the service will
3491 # attempt to choose a reasonable default.
3492 &quot;diskType&quot;: &quot;A String&quot;, # Disk storage type, as defined by Google Compute Engine. This
3493 # must be a disk type appropriate to the project and zone in which
3494 # the workers will run. If unknown or unspecified, the service
3495 # will attempt to choose a reasonable default.
3496 #
3497 # For example, the standard persistent disk type is a resource name
3498 # typically ending in &quot;pd-standard&quot;. If SSD persistent disks are
3499 # available, the resource name typically ends with &quot;pd-ssd&quot;. The
3500 # actual valid values are defined the Google Compute Engine API,
3501 # not by the Cloud Dataflow API; consult the Google Compute Engine
3502 # documentation for more information about determining the set of
3503 # available disk types for a particular project and zone.
3504 #
3505 # Google Compute Engine Disk types are local to a particular
3506 # project in a particular zone, and so the resource name will
3507 # typically look something like this:
3508 #
3509 # compute.googleapis.com/projects/project-id/zones/zone/diskTypes/pd-standard
3510 &quot;mountPoint&quot;: &quot;A String&quot;, # Directory in a VM where disk is mounted.
3511 },
3512 ],
3513 &quot;packages&quot;: [ # Packages to be installed on workers.
3514 { # The packages that must be installed in order for a worker to run the
3515 # steps of the Cloud Dataflow job that will be assigned to its worker
3516 # pool.
3517 #
3518 # This is the mechanism by which the Cloud Dataflow SDK causes code to
3519 # be loaded onto the workers. For example, the Cloud Dataflow Java SDK
3520 # might use this to install jars containing the user&#x27;s code and all of the
3521 # various dependencies (libraries, data files, etc.) required in order
3522 # for that code to run.
3523 &quot;name&quot;: &quot;A String&quot;, # The name of the package.
3524 &quot;location&quot;: &quot;A String&quot;, # The resource to read the package from. The supported resource type is:
3525 #
3526 # Google Cloud Storage:
3527 #
3528 # storage.googleapis.com/{bucket}
3529 # bucket.storage.googleapis.com/
3530 },
3531 ],
3532 &quot;teardownPolicy&quot;: &quot;A String&quot;, # Sets the policy for determining when to turndown worker pool.
3533 # Allowed values are: `TEARDOWN_ALWAYS`, `TEARDOWN_ON_SUCCESS`, and
3534 # `TEARDOWN_NEVER`.
3535 # `TEARDOWN_ALWAYS` means workers are always torn down regardless of whether
3536 # the job succeeds. `TEARDOWN_ON_SUCCESS` means workers are torn down
3537 # if the job succeeds. `TEARDOWN_NEVER` means the workers are never torn
3538 # down.
3539 #
3540 # If the workers are not torn down by the service, they will
3541 # continue to run and use Google Compute Engine VM resources in the
3542 # user&#x27;s project until they are explicitly terminated by the user.
3543 # Because of this, Google recommends using the `TEARDOWN_ALWAYS`
3544 # policy except for small, manually supervised test jobs.
3545 #
3546 # If unknown or unspecified, the service will attempt to choose a reasonable
3547 # default.
3548 &quot;network&quot;: &quot;A String&quot;, # Network to which VMs will be assigned. If empty or unspecified,
3549 # the service will use the network &quot;default&quot;.
3550 &quot;ipConfiguration&quot;: &quot;A String&quot;, # Configuration for VM IPs.
3551 &quot;diskSizeGb&quot;: 42, # Size of root disk for VMs, in GB. If zero or unspecified, the service will
3552 # attempt to choose a reasonable default.
3553 &quot;autoscalingSettings&quot;: { # Settings for WorkerPool autoscaling. # Settings for autoscaling of this WorkerPool.
3554 &quot;maxNumWorkers&quot;: 42, # The maximum number of workers to cap scaling at.
3555 &quot;algorithm&quot;: &quot;A String&quot;, # The algorithm to use for autoscaling.
3556 },
3557 &quot;poolArgs&quot;: { # Extra arguments for this worker pool.
3558 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
3559 },
3560 &quot;subnetwork&quot;: &quot;A String&quot;, # Subnetwork to which VMs will be assigned, if desired. Expected to be of
3561 # the form &quot;regions/REGION/subnetworks/SUBNETWORK&quot;.
3562 &quot;numWorkers&quot;: 42, # Number of Google Compute Engine workers in this pool needed to
3563 # execute the job. If zero or unspecified, the service will
3564 # attempt to choose a reasonable default.
3565 &quot;numThreadsPerWorker&quot;: 42, # The number of threads per worker harness. If empty or unspecified, the
3566 # service will choose a number of threads (according to the number of cores
3567 # on the selected machine type for batch, or 1 by convention for streaming).
3568 &quot;workerHarnessContainerImage&quot;: &quot;A String&quot;, # Required. Docker container image that executes the Cloud Dataflow worker
3569 # harness, residing in Google Container Registry.
3570 #
3571 # Deprecated for the Fn API path. Use sdk_harness_container_images instead.
3572 &quot;taskrunnerSettings&quot;: { # Taskrunner configuration settings. # Settings passed through to Google Compute Engine workers when
3573 # using the standard Dataflow task runner. Users should ignore
3574 # this field.
3575 &quot;dataflowApiVersion&quot;: &quot;A String&quot;, # The API version of endpoint, e.g. &quot;v1b3&quot;
3576 &quot;oauthScopes&quot;: [ # The OAuth2 scopes to be requested by the taskrunner in order to
3577 # access the Cloud Dataflow API.
3578 &quot;A String&quot;,
3579 ],
3580 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for the taskrunner to use when accessing Google Cloud APIs.
3581 #
3582 # When workers access Google Cloud APIs, they logically do so via
3583 # relative URLs. If this field is specified, it supplies the base
3584 # URL to use for resolving these relative URLs. The normative
3585 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
3586 # Locators&quot;.
3587 #
3588 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
3589 &quot;workflowFileName&quot;: &quot;A String&quot;, # The file to store the workflow in.
3590 &quot;logToSerialconsole&quot;: True or False, # Whether to send taskrunner log info to Google Compute Engine VM serial
3591 # console.
3592 &quot;baseTaskDir&quot;: &quot;A String&quot;, # The location on the worker for task-specific subdirectories.
3593 &quot;taskUser&quot;: &quot;A String&quot;, # The UNIX user ID on the worker VM to use for tasks launched by
3594 # taskrunner; e.g. &quot;root&quot;.
3595 &quot;vmId&quot;: &quot;A String&quot;, # The ID string of the VM.
3596 &quot;alsologtostderr&quot;: True or False, # Whether to also send taskrunner log info to stderr.
3597 &quot;parallelWorkerSettings&quot;: { # Provides data to pass through to the worker harness. # The settings to pass to the parallel worker harness.
3598 &quot;shuffleServicePath&quot;: &quot;A String&quot;, # The Shuffle service path relative to the root URL, for example,
3599 # &quot;shuffle/v1beta1&quot;.
3600 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
3601 # storage.
3602 #
3603 # The supported resource type is:
3604 #
3605 # Google Cloud Storage:
3606 #
3607 # storage.googleapis.com/{bucket}/{object}
3608 # bucket.storage.googleapis.com/{object}
3609 &quot;reportingEnabled&quot;: True or False, # Whether to send work progress updates to the service.
3610 &quot;servicePath&quot;: &quot;A String&quot;, # The Cloud Dataflow service path relative to the root URL, for example,
3611 # &quot;dataflow/v1b3/projects&quot;.
3612 &quot;baseUrl&quot;: &quot;A String&quot;, # The base URL for accessing Google Cloud APIs.
3613 #
3614 # When workers access Google Cloud APIs, they logically do so via
3615 # relative URLs. If this field is specified, it supplies the base
3616 # URL to use for resolving these relative URLs. The normative
3617 # algorithm used is defined by RFC 1808, &quot;Relative Uniform Resource
3618 # Locators&quot;.
3619 #
3620 # If not specified, the default value is &quot;http://www.googleapis.com/&quot;
3621 &quot;workerId&quot;: &quot;A String&quot;, # The ID of the worker running this pipeline.
3622 },
3623 &quot;harnessCommand&quot;: &quot;A String&quot;, # The command to launch the worker harness.
3624 &quot;logDir&quot;: &quot;A String&quot;, # The directory on the VM to store logs.
3625 &quot;streamingWorkerMainClass&quot;: &quot;A String&quot;, # The streaming worker main class name.
3626 &quot;languageHint&quot;: &quot;A String&quot;, # The suggested backend language.
3627 &quot;taskGroup&quot;: &quot;A String&quot;, # The UNIX group ID on the worker VM to use for tasks launched by
3628 # taskrunner; e.g. &quot;wheel&quot;.
3629 &quot;logUploadLocation&quot;: &quot;A String&quot;, # Indicates where to put logs. If this is not specified, the logs
3630 # will not be uploaded.
3631 #
3632 # The supported resource type is:
3633 #
3634 # Google Cloud Storage:
3635 # storage.googleapis.com/{bucket}/{object}
3636 # bucket.storage.googleapis.com/{object}
3637 &quot;commandlinesFileName&quot;: &quot;A String&quot;, # The file to store preprocessing commands in.
3638 &quot;continueOnException&quot;: True or False, # Whether to continue taskrunner if an exception is hit.
3639 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the taskrunner should use for
3640 # temporary storage.
3641 #
3642 # The supported resource type is:
3643 #
3644 # Google Cloud Storage:
3645 # storage.googleapis.com/{bucket}/{object}
3646 # bucket.storage.googleapis.com/{object}
3647 },
3648 &quot;diskType&quot;: &quot;A String&quot;, # Type of root disk for VMs. If empty or unspecified, the service will
3649 # attempt to choose a reasonable default.
3650 &quot;defaultPackageSet&quot;: &quot;A String&quot;, # The default package set to install. This allows the service to
3651 # select a default set of packages which are useful to worker
3652 # harnesses written in a particular language.
3653 &quot;machineType&quot;: &quot;A String&quot;, # Machine type (e.g. &quot;n1-standard-1&quot;). If empty or unspecified, the
3654 # service will attempt to choose a reasonable default.
3655 },
3656 ],
3657 &quot;tempStoragePrefix&quot;: &quot;A String&quot;, # The prefix of the resources the system should use for temporary
3658 # storage. The system will append the suffix &quot;/temp-{JOBNAME} to
3659 # this resource prefix, where {JOBNAME} is the value of the
3660 # job_name field. The resulting bucket and object prefix is used
3661 # as the prefix of the resources used to store temporary data
3662 # needed during the job execution. NOTE: This will override the
3663 # value in taskrunner_settings.
3664 # The supported resource type is:
3665 #
3666 # Google Cloud Storage:
3667 #
3668 # storage.googleapis.com/{bucket}/{object}
3669 # bucket.storage.googleapis.com/{object}
3670 &quot;internalExperiments&quot;: { # Experimental settings.
3671 &quot;a_key&quot;: &quot;&quot;, # Properties of the object. Contains field @type with type URL.
3672 },
3673 &quot;sdkPipelineOptions&quot;: { # The Cloud Dataflow SDK pipeline options specified by the user. These
3674 # options are passed through the service and are used to recreate the
3675 # SDK pipeline options on the worker in a language agnostic and platform
3676 # independent way.
3677 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
3678 },
3679 &quot;dataset&quot;: &quot;A String&quot;, # The dataset for the current project where various workflow
3680 # related tables are stored.
3681 #
3682 # The supported resource type is:
3683 #
3684 # Google BigQuery:
3685 # bigquery.googleapis.com/{dataset}
3686 &quot;clusterManagerApiService&quot;: &quot;A String&quot;, # The type of cluster manager API to use. If unknown or
3687 # unspecified, the service will attempt to choose a reasonable
3688 # default. This should be in the form of the API service name,
3689 # e.g. &quot;compute.googleapis.com&quot;.
3690 },
3691 &quot;stepsLocation&quot;: &quot;A String&quot;, # The GCS location where the steps are stored.
3692 &quot;steps&quot;: [ # Exactly one of step or steps_location should be specified.
3693 #
3694 # The top-level steps that constitute the entire job.
3695 { # Defines a particular step within a Cloud Dataflow job.
3696 #
3697 # A job consists of multiple steps, each of which performs some
3698 # specific operation as part of the overall job. Data is typically
3699 # passed from one step to another as part of the job.
3700 #
3701 # Here&#x27;s an example of a sequence of steps which together implement a
3702 # Map-Reduce job:
3703 #
3704 # * Read a collection of data from some source, parsing the
3705 # collection&#x27;s elements.
3706 #
3707 # * Validate the elements.
3708 #
3709 # * Apply a user-defined function to map each element to some value
3710 # and extract an element-specific key value.
3711 #
3712 # * Group elements with the same key into a single element with
3713 # that key, transforming a multiply-keyed collection into a
3714 # uniquely-keyed collection.
3715 #
3716 # * Write the elements out to some data sink.
3717 #
3718 # Note that the Cloud Dataflow service may be used to run many different
3719 # types of jobs, not just Map-Reduce.
3720 &quot;kind&quot;: &quot;A String&quot;, # The kind of step in the Cloud Dataflow job.
3721 &quot;properties&quot;: { # Named properties associated with the step. Each kind of
3722 # predefined step has its own required set of properties.
3723 # Must be provided on Create. Only retrieved with JOB_VIEW_ALL.
3724 &quot;a_key&quot;: &quot;&quot;, # Properties of the object.
3725 },
3726 &quot;name&quot;: &quot;A String&quot;, # The name that identifies the step. This must be unique for each
3727 # step with respect to all other steps in the Cloud Dataflow job.
3728 },
3729 ],
3730 &quot;stageStates&quot;: [ # This field may be mutated by the Cloud Dataflow service;
3731 # callers cannot mutate it.
3732 { # A message describing the state of a particular execution stage.
3733 &quot;executionStageState&quot;: &quot;A String&quot;, # Executions stage states allow the same set of values as JobState.
3734 &quot;executionStageName&quot;: &quot;A String&quot;, # The name of the execution stage.
3735 &quot;currentStateTime&quot;: &quot;A String&quot;, # The time at which the stage transitioned to this state.
3736 },
3737 ],
3738 &quot;replacedByJobId&quot;: &quot;A String&quot;, # If another job is an update of this job (and thus, this job is in
3739 # `JOB_STATE_UPDATED`), this field contains the ID of that job.
3740 &quot;jobMetadata&quot;: { # Metadata available primarily for filtering jobs. Will be included in the # This field is populated by the Dataflow service to support filtering jobs
3741 # by the metadata values provided here. Populated for ListJobs and all GetJob
3742 # views SUMMARY and higher.
3743 # ListJob response and Job SUMMARY view.
3744 &quot;sdkVersion&quot;: { # The version of the SDK used to run the job. # The SDK version used to run the job.
3745 &quot;sdkSupportStatus&quot;: &quot;A String&quot;, # The support status for this SDK version.
3746 &quot;versionDisplayName&quot;: &quot;A String&quot;, # A readable string describing the version of the SDK.
3747 &quot;version&quot;: &quot;A String&quot;, # The version of the SDK used to run the job.
3748 },
3749 &quot;bigTableDetails&quot;: [ # Identification of a BigTable source used in the Dataflow job.
3750 { # Metadata for a BigTable connector used by the job.
3751 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
3752 &quot;tableId&quot;: &quot;A String&quot;, # TableId accessed in the connection.
3753 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
3754 },
3755 ],
3756 &quot;pubsubDetails&quot;: [ # Identification of a PubSub source used in the Dataflow job.
3757 { # Metadata for a PubSub connector used by the job.
3758 &quot;subscription&quot;: &quot;A String&quot;, # Subscription used in the connection.
3759 &quot;topic&quot;: &quot;A String&quot;, # Topic accessed in the connection.
3760 },
3761 ],
3762 &quot;bigqueryDetails&quot;: [ # Identification of a BigQuery source used in the Dataflow job.
3763 { # Metadata for a BigQuery connector used by the job.
3764 &quot;dataset&quot;: &quot;A String&quot;, # Dataset accessed in the connection.
3765 &quot;projectId&quot;: &quot;A String&quot;, # Project accessed in the connection.
3766 &quot;query&quot;: &quot;A String&quot;, # Query used to access data in the connection.
3767 &quot;table&quot;: &quot;A String&quot;, # Table accessed in the connection.
3768 },
3769 ],
3770 &quot;fileDetails&quot;: [ # Identification of a File source used in the Dataflow job.
3771 { # Metadata for a File connector used by the job.
3772 &quot;filePattern&quot;: &quot;A String&quot;, # File Pattern used to access files by the connector.
3773 },
3774 ],
3775 &quot;datastoreDetails&quot;: [ # Identification of a Datastore source used in the Dataflow job.
3776 { # Metadata for a Datastore connector used by the job.
3777 &quot;namespace&quot;: &quot;A String&quot;, # Namespace used in the connection.
3778 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
3779 },
3780 ],
3781 &quot;spannerDetails&quot;: [ # Identification of a Spanner source used in the Dataflow job.
3782 { # Metadata for a Spanner connector used by the job.
3783 &quot;instanceId&quot;: &quot;A String&quot;, # InstanceId accessed in the connection.
3784 &quot;databaseId&quot;: &quot;A String&quot;, # DatabaseId accessed in the connection.
3785 &quot;projectId&quot;: &quot;A String&quot;, # ProjectId accessed in the connection.
3786 },
3787 ],
3788 },
3789 &quot;location&quot;: &quot;A String&quot;, # The [regional endpoint]
3790 # (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that
3791 # contains this job.
3792 &quot;transformNameMapping&quot;: { # The map of transform name prefixes of the job to be replaced to the
3793 # corresponding name prefixes of the new job.
3794 &quot;a_key&quot;: &quot;A String&quot;,
3795 },
3796 &quot;startTime&quot;: &quot;A String&quot;, # The timestamp when the job was started (transitioned to JOB_STATE_PENDING).
3797 # Flexible resource scheduling jobs are started with some delay after job
3798 # creation, so start_time is unset before start and is updated when the
3799 # job is started by the Cloud Dataflow service. For other jobs, start_time
3800 # always equals to create_time and is immutable and set by the Cloud Dataflow
3801 # service.
3802 &quot;clientRequestId&quot;: &quot;A String&quot;, # The client&#x27;s unique identifier of the job, re-used across retried attempts.
3803 # If this field is set, the service will ensure its uniqueness.
3804 # The request to create a job will fail if the service has knowledge of a
3805 # previously submitted job with the same client&#x27;s ID and job name.
3806 # The caller may use this field to ensure idempotence of job
3807 # creation across retried attempts to create a job.
3808 # By default, the field is empty and, in that case, the service ignores it.
3809 &quot;executionInfo&quot;: { # Additional information about how a Cloud Dataflow job will be executed that # Deprecated.
3810 # isn&#x27;t contained in the submitted job.
3811 &quot;stages&quot;: { # A mapping from each stage to the information about that stage.
3812 &quot;a_key&quot;: { # Contains information about how a particular
3813 # google.dataflow.v1beta3.Step will be executed.
3814 &quot;stepName&quot;: [ # The steps associated with the execution stage.
3815 # Note that stages may have several steps, and that a given step
3816 # might be run by more than one stage.
3817 &quot;A String&quot;,
3818 ],
3819 },
3820 },
3821 },
3822 &quot;type&quot;: &quot;A String&quot;, # The type of Cloud Dataflow job.
3823 &quot;createTime&quot;: &quot;A String&quot;, # The timestamp when the job was initially created. Immutable and set by the
3824 # Cloud Dataflow service.
3825 &quot;tempFiles&quot;: [ # A set of files the system should be aware of that are used
3826 # for temporary storage. These temporary files will be
3827 # removed on job completion.
3828 # No duplicates are allowed.
3829 # No file patterns are supported.
3830 #
3831 # The supported files are:
3832 #
3833 # Google Cloud Storage:
3834 #
3835 # storage.googleapis.com/{bucket}/{object}
3836 # bucket.storage.googleapis.com/{object}
3837 &quot;A String&quot;,
3838 ],
3839 &quot;id&quot;: &quot;A String&quot;, # The unique ID of this job.
3840 #
3841 # This field is set by the Cloud Dataflow service when the Job is
3842 # created, and is immutable for the life of the job.
3843 &quot;requestedState&quot;: &quot;A String&quot;, # The job&#x27;s requested state.
3844 #
3845 # `UpdateJob` may be used to switch between the `JOB_STATE_STOPPED` and
3846 # `JOB_STATE_RUNNING` states, by setting requested_state. `UpdateJob` may
3847 # also be used to directly set a job&#x27;s requested state to
3848 # `JOB_STATE_CANCELLED` or `JOB_STATE_DONE`, irrevocably terminating the
3849 # job if it has not already reached a terminal state.
3850 &quot;replaceJobId&quot;: &quot;A String&quot;, # If this job is an update of an existing job, this field is the job ID
3851 # of the job it replaced.
3852 #
3853 # When sending a `CreateJobRequest`, you can update a job by specifying it
3854 # here. The job named here is stopped, and its intermediate state is
3855 # transferred to this job.
3856 &quot;createdFromSnapshotId&quot;: &quot;A String&quot;, # If this is specified, the job&#x27;s initial state is populated from the given
3857 # snapshot.
3858 &quot;currentState&quot;: &quot;A String&quot;, # The current state of the job.
3859 #
3860 # Jobs are created in the `JOB_STATE_STOPPED` state unless otherwise
3861 # specified.
3862 #
3863 # A job in the `JOB_STATE_RUNNING` state may asynchronously enter a
3864 # terminal state. After a job has reached a terminal state, no
3865 # further state updates may be made.
3866 #
3867 # This field may be mutated by the Cloud Dataflow service;
3868 # callers cannot mutate it.
3869 &quot;name&quot;: &quot;A String&quot;, # The user-specified Cloud Dataflow job name.
3870 #
3871 # Only one Job with a given name may exist in a project at any
3872 # given time. If a caller attempts to create a Job with the same
3873 # name as an already-existing Job, the attempt returns the
3874 # existing Job.
3875 #
3876 # The name must match the regular expression
3877 # `[a-z]([-a-z0-9]{0,38}[a-z0-9])?`
3878 &quot;currentStateTime&quot;: &quot;A String&quot;, # The timestamp associated with the current state.
3879 }</pre>
Jon Wayne Parrott692617a2017-01-06 09:58:29 -08003880</div>
3881
3882</body></html>