VictoriaMetrics/vendor/cloud.google.com/go/storage/doc.go

415 lines
16 KiB
Go
Raw Normal View History

// Copyright 2016 Google LLC
//
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
// http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.
/*
Package storage provides an easy way to work with Google Cloud Storage.
Google Cloud Storage stores data in named objects, which are grouped into buckets.
More information about Google Cloud Storage is available at
https://cloud.google.com/storage/docs.
2022-01-27 11:16:16 +00:00
See https://pkg.go.dev/cloud.google.com/go for authentication, timeouts,
connection pooling and similar aspects of this package.
2022-08-14 21:53:41 +00:00
# Creating a Client
2022-09-26 12:44:55 +00:00
To start working with this package, create a [Client]:
2022-08-14 21:53:41 +00:00
ctx := context.Background()
client, err := storage.NewClient(ctx)
if err != nil {
// TODO: Handle error.
}
2020-09-01 14:43:21 +00:00
The client will use your default application credentials. Clients should be
2022-09-26 12:44:55 +00:00
reused instead of created as needed. The methods of [Client] are safe for
2020-09-01 14:43:21 +00:00
concurrent use by multiple goroutines.
2023-03-15 20:24:12 +00:00
You may configure the client by passing in options from the [google.golang.org/api/option]
package. You may also use options defined in this package, such as [WithJSONReads].
If you only wish to access public data, you can create
an unauthenticated client with
2022-08-14 21:53:41 +00:00
client, err := storage.NewClient(ctx, option.WithoutAuthentication())
To use an emulator with this library, you can set the STORAGE_EMULATOR_HOST
environment variable to the address at which your emulator is running. This will
send requests to that address instead of to Cloud Storage. You can then create
and use a client as usual:
2022-08-14 21:53:41 +00:00
// Set STORAGE_EMULATOR_HOST environment variable.
err := os.Setenv("STORAGE_EMULATOR_HOST", "localhost:9000")
if err != nil {
// TODO: Handle error.
}
2022-08-14 21:53:41 +00:00
// Create client as usual.
client, err := storage.NewClient(ctx)
if err != nil {
// TODO: Handle error.
}
2022-08-14 21:53:41 +00:00
// This request is now directed to http://localhost:9000/storage/v1/b
// instead of https://storage.googleapis.com/storage/v1/b
if err := client.Bucket("my-bucket").Create(ctx, projectID, nil); err != nil {
// TODO: Handle error.
}
Please note that there is no official emulator for Cloud Storage.
2022-08-14 21:53:41 +00:00
# Buckets
A Google Cloud Storage bucket is a collection of objects. To work with a
bucket, make a bucket handle:
2022-08-14 21:53:41 +00:00
bkt := client.Bucket(bucketName)
A handle is a reference to a bucket. You can have a handle even if the
bucket doesn't exist yet. To create a bucket in Google Cloud Storage,
2022-09-26 12:44:55 +00:00
call [BucketHandle.Create]:
2022-08-14 21:53:41 +00:00
if err := bkt.Create(ctx, projectID, nil); err != nil {
// TODO: Handle error.
}
Note that although buckets are associated with projects, bucket names are
global across all projects.
Each bucket has associated metadata, represented in this package by
2022-09-26 12:44:55 +00:00
[BucketAttrs]. The third argument to [BucketHandle.Create] allows you to set
the initial [BucketAttrs] of a bucket. To retrieve a bucket's attributes, use
[BucketHandle.Attrs]:
2022-08-14 21:53:41 +00:00
attrs, err := bkt.Attrs(ctx)
if err != nil {
// TODO: Handle error.
}
fmt.Printf("bucket %s, created at %s, is located in %s with storage class %s\n",
attrs.Name, attrs.Created, attrs.Location, attrs.StorageClass)
2022-08-14 21:53:41 +00:00
# Objects
An object holds arbitrary data as a sequence of bytes, like a file. You
refer to objects using a handle, just as with buckets, but unlike buckets
you don't explicitly create an object. Instead, the first time you write
2022-09-26 12:44:55 +00:00
to an object it will be created. You can use the standard Go [io.Reader]
and [io.Writer] interfaces to read and write object data:
2022-08-14 21:53:41 +00:00
obj := bkt.Object("data")
// Write something to obj.
// w implements io.Writer.
w := obj.NewWriter(ctx)
// Write some text to obj. This will either create the object or overwrite whatever is there already.
if _, err := fmt.Fprintf(w, "This object contains text.\n"); err != nil {
// TODO: Handle error.
}
// Close, just like writing a file.
if err := w.Close(); err != nil {
// TODO: Handle error.
}
// Read it back.
r, err := obj.NewReader(ctx)
if err != nil {
// TODO: Handle error.
}
defer r.Close()
if _, err := io.Copy(os.Stdout, r); err != nil {
// TODO: Handle error.
}
// Prints "This object contains text."
2022-09-26 12:44:55 +00:00
Objects also have attributes, which you can fetch with [ObjectHandle.Attrs]:
2022-08-14 21:53:41 +00:00
objAttrs, err := obj.Attrs(ctx)
if err != nil {
// TODO: Handle error.
}
fmt.Printf("object %s has size %d and can be read using %s\n",
objAttrs.Name, objAttrs.Size, objAttrs.MediaLink)
2022-08-14 21:53:41 +00:00
# Listing objects
2019-11-19 19:29:35 +00:00
2022-09-26 12:44:55 +00:00
Listing objects in a bucket is done with the [BucketHandle.Objects] method:
2019-11-19 19:29:35 +00:00
2022-08-14 21:53:41 +00:00
query := &storage.Query{Prefix: ""}
var names []string
it := bkt.Objects(ctx, query)
for {
attrs, err := it.Next()
if err == iterator.Done {
break
}
if err != nil {
log.Fatal(err)
}
names = append(names, attrs.Name)
}
2019-11-19 19:29:35 +00:00
2020-09-23 11:23:39 +00:00
Objects are listed lexicographically by name. To filter objects
2022-09-26 12:44:55 +00:00
lexicographically, [Query.StartOffset] and/or [Query.EndOffset] can be used:
2020-09-23 11:23:39 +00:00
2022-08-14 21:53:41 +00:00
query := &storage.Query{
Prefix: "",
StartOffset: "bar/", // Only list objects lexicographically >= "bar/"
EndOffset: "foo/", // Only list objects lexicographically < "foo/"
}
2020-09-23 11:23:39 +00:00
2022-08-14 21:53:41 +00:00
// ... as before
2020-09-23 11:23:39 +00:00
2019-11-19 19:29:35 +00:00
If only a subset of object attributes is needed when listing, specifying this
2022-09-26 12:44:55 +00:00
subset using [Query.SetAttrSelection] may speed up the listing process:
2019-11-19 19:29:35 +00:00
2022-08-14 21:53:41 +00:00
query := &storage.Query{Prefix: ""}
query.SetAttrSelection([]string{"Name"})
2019-11-19 19:29:35 +00:00
2022-08-14 21:53:41 +00:00
// ... as before
2019-11-19 19:29:35 +00:00
2022-08-14 21:53:41 +00:00
# ACLs
Both objects and buckets have ACLs (Access Control Lists). An ACL is a list of
ACLRules, each of which specifies the role of a user, group or project. ACLs
are suitable for fine-grained control, but you may prefer using IAM to control
2022-09-26 12:44:55 +00:00
access at the project level (see [Cloud Storage IAM docs].
2022-09-26 12:44:55 +00:00
To list the ACLs of a bucket or object, obtain an [ACLHandle] and call [ACLHandle.List]:
2022-08-14 21:53:41 +00:00
acls, err := obj.ACL().List(ctx)
if err != nil {
// TODO: Handle error.
}
for _, rule := range acls {
fmt.Printf("%s has role %s\n", rule.Entity, rule.Role)
}
You can also set and delete ACLs.
2022-08-14 21:53:41 +00:00
# Conditions
Every object has a generation and a metageneration. The generation changes
whenever the content changes, and the metageneration changes whenever the
2022-09-26 12:44:55 +00:00
metadata changes. [Conditions] let you check these values before an operation;
the operation only executes if the conditions match. You can use conditions to
prevent race conditions in read-modify-write operations.
For example, say you've read an object's metadata into objAttrs. Now
you want to write to that object, but only if its contents haven't changed
since you read it. Here is how to express that:
2022-08-14 21:53:41 +00:00
w = obj.If(storage.Conditions{GenerationMatch: objAttrs.Generation}).NewWriter(ctx)
// Proceed with writing as above.
2022-08-14 21:53:41 +00:00
# Signed URLs
You can obtain a URL that lets anyone read or write an object for a limited time.
2022-04-12 09:51:54 +00:00
Signing a URL requires credentials authorized to sign a URL. To use the same
2022-09-26 12:44:55 +00:00
authentication that was used when instantiating the Storage client, use
[BucketHandle.SignedURL].
2022-04-12 09:51:54 +00:00
2022-08-14 21:53:41 +00:00
url, err := client.Bucket(bucketName).SignedURL(objectName, opts)
if err != nil {
// TODO: Handle error.
}
fmt.Println(url)
2022-04-12 09:51:54 +00:00
2022-09-26 12:44:55 +00:00
You can also sign a URL without creating a client. See the documentation of
[SignedURL] for details.
2022-08-14 21:53:41 +00:00
url, err := storage.SignedURL(bucketName, "shared-object", opts)
if err != nil {
// TODO: Handle error.
}
fmt.Println(url)
2022-08-14 21:53:41 +00:00
# Post Policy V4 Signed Request
2020-05-15 12:03:06 +00:00
A type of signed request that allows uploads through HTML forms directly to Cloud Storage with
temporary permission. Conditions can be applied to restrict how the HTML form is used and exercised
by a user.
2022-09-26 12:44:55 +00:00
For more information, please see the [XML POST Object docs] as well
as the documentation of [BucketHandle.GenerateSignedPostPolicyV4].
2020-05-15 12:03:06 +00:00
2022-08-14 21:53:41 +00:00
pv4, err := client.Bucket(bucketName).GenerateSignedPostPolicyV4(objectName, opts)
if err != nil {
// TODO: Handle error.
}
fmt.Printf("URL: %s\nFields; %v\n", pv4.URL, pv4.Fields)
2020-05-15 12:03:06 +00:00
2022-09-26 12:44:55 +00:00
# Credential requirements for signing
If the GoogleAccessID and PrivateKey option fields are not provided, they will
be automatically detected by [BucketHandle.SignedURL] and
[BucketHandle.GenerateSignedPostPolicyV4] if any of the following are true:
- you are authenticated to the Storage Client with a service account's
downloaded private key, either directly in code or by setting the
GOOGLE_APPLICATION_CREDENTIALS environment variable (see [Other Environments]),
- your application is running on Google Compute Engine (GCE), or
- you are logged into [gcloud using application default credentials]
with [impersonation enabled].
Detecting GoogleAccessID may not be possible if you are authenticated using a
token source or using [option.WithHTTPClient]. In this case, you can provide a
service account email for GoogleAccessID and the client will attempt to sign
the URL or Post Policy using that service account.
To generate the signature, you must have:
- iam.serviceAccounts.signBlob permissions on the GoogleAccessID service
account, and
- the [IAM Service Account Credentials API] enabled (unless authenticating
with a downloaded private key).
2022-08-14 21:53:41 +00:00
# Errors
2022-09-26 12:44:55 +00:00
Errors returned by this client are often of the type [googleapi.Error].
These errors can be introspected for more information by using [errors.As]
with the richer [googleapi.Error] type. For example:
2021-10-11 18:51:32 +00:00
var e *googleapi.Error
if ok := errors.As(err, &e); ok {
if e.Code == 409 { ... }
}
2022-01-27 11:16:16 +00:00
2022-08-14 21:53:41 +00:00
# Retrying failed requests
2022-01-27 11:16:16 +00:00
Methods in this package may retry calls that fail with transient errors.
Retrying continues indefinitely unless the controlling context is canceled, the
client is closed, or a non-transient error is received. To stop retries from
continuing, use context timeouts or cancellation.
The retry strategy in this library follows best practices for Cloud Storage. By
default, operations are retried only if they are idempotent, and exponential
backoff with jitter is employed. In addition, errors are only retried if they
2022-09-26 12:44:55 +00:00
are defined as transient by the service. See the [Cloud Storage retry docs]
for more information.
2022-01-27 11:16:16 +00:00
Users can configure non-default retry behavior for a single library call (using
2022-09-26 12:44:55 +00:00
[BucketHandle.Retryer] and [ObjectHandle.Retryer]) or for all calls made by a
client (using [Client.SetRetry]). For example:
2022-01-27 11:16:16 +00:00
o := client.Bucket(bucket).Object(object).Retryer(
// Use WithBackoff to change the timing of the exponential backoff.
storage.WithBackoff(gax.Backoff{
Initial: 2 * time.Second,
}),
// Use WithPolicy to configure the idempotency policy. RetryAlways will
// retry the operation even if it is non-idempotent.
storage.WithPolicy(storage.RetryAlways),
)
// Use a context timeout to set an overall deadline on the call, including all
// potential retries.
ctx, cancel := context.WithTimeout(ctx, 5*time.Second)
defer cancel()
// Delete an object using the specified strategy and timeout.
if err := o.Delete(ctx); err != nil {
// Handle err.
}
2022-09-26 12:44:55 +00:00
2023-08-29 11:12:56 +00:00
# Sending Custom Headers
You can add custom headers to any API call made by this package by using
[callctx.SetHeaders] on the context which is passed to the method. For example,
to add a [custom audit logging] header:
ctx := context.Background()
ctx = callctx.SetHeaders(ctx, "x-goog-custom-audit-<key>", "<value>")
// Use client as usual with the context and the additional headers will be sent.
client.Bucket("my-bucket").Attrs(ctx)
2024-11-29 12:48:50 +00:00
# gRPC API
2023-10-02 19:49:16 +00:00
2024-11-29 12:48:50 +00:00
This package includes support for the Cloud Storage gRPC API. The
implementation uses gRPC rather than the Default
JSON & XML APIs to make requests to Cloud Storage.
The Go Storage gRPC client is generally available.
The Notifications, Serivce Account HMAC
and GetServiceAccount RPCs are not supported through the gRPC client.
2023-10-02 19:49:16 +00:00
To create a client which will use gRPC, use the alternate constructor:
ctx := context.Background()
client, err := storage.NewGRPCClient(ctx)
if err != nil {
// TODO: Handle error.
}
// Use client as usual.
2024-11-29 12:48:50 +00:00
Using the gRPC API inside GCP with a bucket in the same region can allow for
[Direct Connectivity] (enabling requests to skip some proxy steps and reducing
response latency). A warning is emmitted if gRPC is not used within GCP to
warn that Direct Connectivity could not be initialized. Direct Connectivity
is not required to access the gRPC API.
2023-10-02 19:49:16 +00:00
2024-11-29 12:48:50 +00:00
Dependencies for the gRPC API may slightly increase the size of binaries for
applications depending on this package. If you are not using gRPC, you can use
the build tag `disable_grpc_modules` to opt out of these dependencies and
reduce the binary size.
The gRPC client emits metrics by default and will export the
gRPC telemetry discussed in [gRFC/66] and [gRFC/78] to
[Google Cloud Monitoring]. The metrics are accessible through Cloud Monitoring
API and you incur no additional cost for publishing the metrics. Google Cloud
Support can use this information to more quickly diagnose problems related to
GCS and gRPC.
Sending this data does not incur any billing charges, and requires minimal
CPU (a single RPC every minute) or memory (a few KiB to batch the
telemetry).
To access the metrics you can view them through Cloud Monitoring
[metric explorer] with the prefix `storage.googleapis.com/client`. Metrics are emitted
every minute.
You can disable metrics using the following example when creating a new gRPC
client using [WithDisabledClientMetrics].
The metrics exporter uses Cloud Monitoring API which determines
project ID and credentials doing the following:
* Project ID is determined using OTel Resource Detector for the environment
otherwise it falls back to the project provided by [google.FindCredentials].
* Credentials are determined using [Application Default Credentials]. The
principal must have `roles/monitoring.metricWriter` role granted. If not a
logged warning will be emitted. Subsequent are silenced to prevent noisy logs.
2023-10-02 19:49:16 +00:00
2024-05-22 19:58:38 +00:00
# Storage Control API
Certain control plane and long-running operations for Cloud Storage (including Folder
and Managed Folder operations) are supported via the autogenerated Storage Control
client, which is available as a subpackage in this module. See package docs at
[cloud.google.com/go/storage/control/apiv2] or reference the [Storage Control API] docs.
2024-11-29 12:48:50 +00:00
[Application Default Credentials]: https://cloud.google.com/docs/authentication/application-default-credentials
[google.FindCredentials]: https://pkg.go.dev/golang.org/x/oauth2/google#FindDefaultCredentials
[gRFC/66]: https://github.com/grpc/proposal/blob/master/A66-otel-stats.md
[gRFC/78]: https://github.com/grpc/proposal/blob/master/A78-grpc-metrics-wrr-pf-xds.md
[Google Cloud Monitoring]: https://cloud.google.com/monitoring/docs
2022-09-26 12:44:55 +00:00
[Cloud Storage IAM docs]: https://cloud.google.com/storage/docs/access-control/iam
[XML POST Object docs]: https://cloud.google.com/storage/docs/xml-api/post-object
[Cloud Storage retry docs]: https://cloud.google.com/storage/docs/retry-strategy
[Other Environments]: https://cloud.google.com/storage/docs/authentication#libauth
[gcloud using application default credentials]: https://cloud.google.com/sdk/gcloud/reference/auth/application-default/login
[impersonation enabled]: https://cloud.google.com/sdk/gcloud/reference#--impersonate-service-account
[IAM Service Account Credentials API]: https://console.developers.google.com/apis/api/iamcredentials.googleapis.com/overview
2023-08-29 11:12:56 +00:00
[custom audit logging]: https://cloud.google.com/storage/docs/audit-logging#add-custom-metadata
2024-05-22 19:58:38 +00:00
[Storage Control API]: https://cloud.google.com/storage/docs/reference/rpc/google.storage.control.v2
2024-11-29 12:48:50 +00:00
[metric explorer]: https://console.cloud.google.com/projectselector/monitoring/metrics-explorer
[Direct Connectivity]: https://cloud.google.com/vpc-service-controls/docs/set-up-private-connectivity#direct-connectivity
*/
package storage // import "cloud.google.com/go/storage"