Add option to use S3 accelerated endpoint for faster transfers #3675

pchoisel · 2025-05-26T14:26:57Z

No description provided.

manthey · 2025-05-27T13:48:43Z

In a quick read about the use_accelerate_endpoint, it sounds like it only has benefit when the client is in a different region from the bucket and the files are largish. Should there be any sort of check for either of these conditions, since using use_accelerate_endpoint incurs higher transfer costs?

pchoisel · 2025-05-28T10:21:46Z

In a quick read about the use_accelerate_endpoint, it sounds like it only has benefit when the client is in a different region from the bucket and the files are largish. Should there be any sort of check for either of these conditions, since using use_accelerate_endpoint incurs higher transfer costs?

That sounds good, but I'm not sure how to implement this.
I can easily know the location of the S3 bucket, but it's harder to get the location of the client. Maybe using its IP address ? But that would work if Girder is reverse proxied.

zachmullen · 2025-05-28T12:01:57Z

@pchoisel it might help to explain the problem you're trying to solve, then we may be able to provide better input on design.

pchoisel · 2025-05-30T08:40:10Z

@zachmullen Thanks.
A customer wants to enable S3 transfer acceleration for his users that are not in the US to speed up their uploads/downloads to a Girder-based application using an S3 assetstore.
You can enable that by changing the domain name used to connect to S3, but Boto does that automatically if you set use_accelerate_endpoint in the config.

To make this more optimized, I could perhaps use this accelerated endpoint only when Girder sends a transfer link to a user so their client can transfer data to S3, and not when Girder directly transfers data to S3 ?

zachmullen · 2025-05-30T12:47:38Z

To make this more optimized, I could perhaps use this accelerated endpoint only when Girder sends a transfer link to a user so their client can transfer data to S3, and not when Girder directly transfers data to S3 ?

I do think this is the right place to make the choice -- at the point of building the presigned URLs rather than bound to the lifetime of the assetstore. I see two possibilities:

We attempt to infer the right decision based on properties of the client, e.g. geolocation based on the client IP, and make the choice for them
We allow the caller of the REST endpoint(s) to declare that they want to use accelerated transfer, moving the decision point to the client-side or the end users themselves (e.g. a checkbox to enable or disable it under "advanced options" or something).

I think option 2 is a much better idea, but am open to discussion.

pchoisel · 2025-06-02T12:03:42Z

I agree that option 2 is the best. However, there are some clients that I cannot change easily #girderwebcomponents
But that's fine, I'll just patch them a bit more. Inferring the location of the client using its IP address seems really shaky.

Do you think I should still add an assetstore option to enable accelerated transfer ? That would make the endpoints return an error if it's disabled but the client requested a accelerated transfer.

zachmullen · 2025-06-02T12:40:43Z

Yes, we should probably only allow accelerated transfer if the assetstore explicitly allows it.

pchoisel · 2025-06-05T13:01:24Z

Here it is. I originally wanted to store the fact that an upload should be made with acceleration in the upload document, but I was worried about S3 usage being restricted in between the upload init and the chunk upload.

Also, using acceleration just for uploading chunks and using the regular URL for the rest of the requests (completion for example) works fine.

I changed the extraParameters arg of the download API from a param to a jsonParam. I couldn't find anything using it and I think it made more sense. Let me know if I should revert that back.

S3 buckets transfer acceleration is an AWS feature that speeds up data transfer from a client to an S3 bucket using CloudFront.

pchoisel · 2025-06-20T08:00:13Z

@manthey If you have some time, could you have a look ?

manthey · 2025-06-20T19:43:46Z

girder/api/v1/file.py

@@ -254,8 +262,8 @@ def readChunk(self, upload, offset, params):
        .param('contentDisposition', 'Specify the Content-Disposition response '
               'header disposition-type value.', required=False,
               enum=['inline', 'attachment'], default='attachment')
-        .param('extraParameters', 'Arbitrary data to send along with the download request.',
-               required=False)
+        .jsonParam('extraParameters', 'Arbitrary data to send along with the download request.',


I'm not sure what the ramifications are of changing from param to jsonParam will be. This switching the extra parameters from a string to a parsed json object. The only place I see it used before this is in downloads.

pchoisel force-pushed the add_option_to_use_S3_accelerated_endpoint branch from 3164587 to 8e3966b Compare May 26, 2025 14:29

pchoisel force-pushed the add_option_to_use_S3_accelerated_endpoint branch 2 times, most recently from e42bb5c to 04d9618 Compare June 5, 2025 12:57

Allow clients to request S3 accelerated transfers

a902a13

S3 buckets transfer acceleration is an AWS feature that speeds up data transfer from a client to an S3 bucket using CloudFront.

pchoisel force-pushed the add_option_to_use_S3_accelerated_endpoint branch from 04d9618 to a902a13 Compare June 12, 2025 07:57

manthey reviewed Jun 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add option to use S3 accelerated endpoint for faster transfers #3675

Add option to use S3 accelerated endpoint for faster transfers #3675

Uh oh!

pchoisel commented May 26, 2025

Uh oh!

manthey commented May 27, 2025

Uh oh!

pchoisel commented May 28, 2025

Uh oh!

zachmullen commented May 28, 2025

Uh oh!

pchoisel commented May 30, 2025

Uh oh!

zachmullen commented May 30, 2025

Uh oh!

pchoisel commented Jun 2, 2025

Uh oh!

zachmullen commented Jun 2, 2025

Uh oh!

pchoisel commented Jun 5, 2025

Uh oh!

pchoisel commented Jun 20, 2025

Uh oh!

manthey Jun 20, 2025

Uh oh!

Uh oh!

Add option to use S3 accelerated endpoint for faster transfers #3675

Are you sure you want to change the base?

Add option to use S3 accelerated endpoint for faster transfers #3675

Uh oh!

Conversation

pchoisel commented May 26, 2025

Uh oh!

manthey commented May 27, 2025

Uh oh!

pchoisel commented May 28, 2025

Uh oh!

zachmullen commented May 28, 2025

Uh oh!

pchoisel commented May 30, 2025

Uh oh!

zachmullen commented May 30, 2025

Uh oh!

pchoisel commented Jun 2, 2025

Uh oh!

zachmullen commented Jun 2, 2025

Uh oh!

pchoisel commented Jun 5, 2025

Uh oh!

pchoisel commented Jun 20, 2025

Uh oh!

manthey Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!