S3 file upload stream using node js

Question

I am trying to find some solution to stream file on amazon S3 using node js server with requirements:

Don't store temp file on server or in memory. But up-to some limit not complete file, buffering can be used for uploading.
No restriction on uploaded file size.
Don't freeze server till complete file upload because in case of heavy file upload other request's waiting time will unexpectedly increase.

I don't want to use direct file upload from browser because S3 credentials needs to share in that case. One more reason to upload file from node js server is that some authentication may also needs to apply before uploading file.

I tried to achieve this using node-multiparty. But it was not working as expecting. You can see my solution and issue at https://github.com/andrewrk/node-multiparty/issues/49. It works fine for small files but fails for file of size 15MB.

Any solution or alternative ?

Johann Philipp Strathausen · Accepted Answer · 2020-08-07 07:00:50Z

46

You can now use streaming with the official Amazon SDK for nodejs in the section "Uploading a File to an Amazon S3 Bucket" or see their example on GitHub.

What's even more awesome, you finally can do so without knowing the file size in advance. Simply pass the stream as the Body:

var fs = require('fs');
var zlib = require('zlib');

var body = fs.createReadStream('bigfile').pipe(zlib.createGzip());
var s3obj = new AWS.S3({params: {Bucket: 'myBucket', Key: 'myKey'}});
s3obj.upload({Body: body})
  .on('httpUploadProgress', function(evt) { console.log(evt); })
  .send(function(err, data) { console.log(err, data) });

edited Aug 7, 2020 at 7:00

answered Apr 4, 2016 at 12:25

Johann Philipp Strathausen

5,7464 gold badges33 silver badges39 bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

Daniel Kobe Over a year ago

this isn't working with my output stream from yazl zip object?

Juan Over a year ago

Brilliant! You can also pipe Buffers to zlib.createGzip() by transforming it into a Stream. const { Duplex } = require('stream'); `

Zaheer Over a year ago

Does anyone know how this works? If each part is a fixed size, how do they fill in the last part if it doesn't exactly match the full size?

fIwJlxSzApHEZIl Over a year ago

Can you update the link Johann? It appears to have changed.

Johann Philipp Strathausen Over a year ago

@anon58192932 thanks for catching that, the link is now updated!

|

nfroidure · Accepted Answer · 2020-12-29 14:45:45Z

8

For your information, the v3 SDK were published with a dedicated module to handle that use case : https://www.npmjs.com/package/@aws-sdk/lib-storage

Took me a while to find it.

answered Dec 29, 2020 at 14:45

nfroidure

1,63114 silver badges23 bronze badges

1 Comment

knownasilya Over a year ago

Ran into issues with this where the stream passed in is transformed into a geojson feature collection.

Yaroslav Pogrebnyak · Accepted Answer · 2014-07-07 13:51:19Z

2

Give https://www.npmjs.org/package/streaming-s3 a try.

I used it for uploading several big files in parallel (>500Mb), and it worked very well. It very configurable and also allows you to track uploading statistics. You not need to know total size of the object, and nothing is written on disk.

answered Jul 7, 2014 at 13:51

Yaroslav Pogrebnyak

1,24710 silver badges22 bronze badges

Comments

mattdlockyer · Accepted Answer · 2017-04-25 20:24:27Z

If it helps anyone I was able to stream from the client to s3 successfully (without memory or disk storage):

https://gist.github.com/mattlockyer/532291b6194f6d9ca40cb82564db9d2a

The server endpoint assumes req is a stream object, I sent a File object from the client which modern browsers can send as binary data and added file info set in the headers.

const fileUploadStream = (req, res) => {
  //get "body" args from header
  const { id, fn } = JSON.parse(req.get('body'));
  const Key = id + '/' + fn; //upload to s3 folder "id" with filename === fn
  const params = {
    Key,
    Bucket: bucketName, //set somewhere
    Body: req, //req is a stream
  };
  s3.upload(params, (err, data) => {
    if (err) {
      res.send('Error Uploading Data: ' + JSON.stringify(err) + '\n' + JSON.stringify(err.stack));
    } else {
      res.send(Key);
    }
  });
};

Yes putting the file info in the headers breaks convention but if you look at the gist it's much cleaner than anything else I found using streaming libraries or multer, busboy etc...

+1 for pragmatism and thanks to @SalehenRahman for his help.

Daveee · Accepted Answer · 2014-08-18 19:43:44Z

0

I'm using the s3-upload-stream module in a working project here.

There is also some good examples from @raynos in his http-framework repository.

answered Aug 18, 2014 at 19:43

Daveee

494 bronze badges

Comments

Harshavardhana · Accepted Answer · 2015-11-07 02:33:17Z

0

Alternatively you can look at - https://github.com/minio/minio-js. It has minimal set of abstracted API's implementing most commonly used S3 calls.

Here is an example of streaming upload.

$ npm install minio
$ cat >> put-object.js << EOF

var Minio = require('minio')
var fs = require('fs')

// find out your s3 end point here:
// http://docs.aws.amazon.com/general/latest/gr/rande.html#s3_region

var s3Client = new Minio({
  url: 'https://<your-s3-endpoint>',
  accessKey: 'YOUR-ACCESSKEYID',
  secretKey: 'YOUR-SECRETACCESSKEY'
})

var outFile = fs.createWriteStream('your_localfile.zip');
var fileStat = Fs.stat(file, function(e, stat) {
  if (e) {
    return console.log(e)
  }
  s3Client.putObject('mybucket', 'hello/remote_file.zip', 'application/octet-stream', stat.size, fileStream, function(e) {
    return console.log(e) // should be null
  })
})
EOF

putObject() here is a fully managed single function call for file sizes over 5MB it automatically does multipart internally. You can resume a failed upload as well and it will start from where its left off by verifying previously upload parts.

Additionally this library is also isomorphic, can be used in browsers as well.

answered Nov 7, 2015 at 2:33

Harshavardhana

1,43811 silver badges18 bronze badges

2 Comments

securecurve Over a year ago

Can this library stream upload a file from an uploading user instead me having to buffer it to my server first (whether on memory or disk)?

Harshavardhana Over a year ago

It takes input stream, it can be a file stream or any stream whatsoever. It will upload automatically to server until the stream closes.

Collectives™ on Stack Overflow

S3 file upload stream using node js

6 Answers 6

6 Comments

1 Comment

Comments

Comments

Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

6 Comments

1 Comment

Comments

Comments

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related