make producer.Kinesis respect api limits by relud · Pull Request #140 · trivago/gollum

relud · 2017-06-09T23:47:20Z

The kinesis api has hard limits that require no more than 500 records or 5MB per PutRecords call, and no more than 1MB per record.

This change makes sure that gollum respects those three limits by making multiple PutRecords calls if needed, and making more records if needed.

The result is that BatchMaxMessages and RecordMaxMessages can safely be set to higher values that would have previously caused errors.

…sisPutRecords requests

arnecls · 2017-07-07T15:41:37Z

If I remember correctly the ratelimits were indirectly enforced by the batch size chosen.
Need to check that.

relud · 2017-07-07T19:28:55Z

it's true that rate limits can be enforced using batch size, but if gollum message size is unpredictable, and RecordMaxMessages is high enough, then some kinesis records may unnecessarily exceed the 1MB record size limit

arnecls · 2017-07-07T20:20:36Z

Ah, ok - that's correct.
The current batch implementation only looks at the count, not the size.

arnecls · 2017-07-10T08:50:43Z

producer/kinesis.go

+		recordMaxMessages: recordMaxMessages,
+		recordMaxSize: 1<<20, // 1 MB per record is the api limit
+		requestMaxSize: 5<<20, // 5 MB per request is the api limit
+		requestMaxRecords: 500, // 500 records per request is the api limit


Are those negotiable?
If not they should be constants.

arnecls · 2017-07-10T09:19:50Z

producer/kinesis.go

-	content            *kinesis.PutRecordsInput
-	original           [][]*core.Message
+	content            []*kinesis.PutRecordsInput
+	original           [][][]*core.Message


It is very confusing to have a 3D array here.
If I see this correctly you have a relationship between requests and originals.
So why don't you batch this together to get a clearer context?

StreamData may have n requests, limited by requestMaxRecords

each request may have n originals (and n records) limited by requestMaxSize

each message is limited by recordMaxSize

So it should be something like

type streamRequest struct { input *kinesis.PutRecordsInput original []core.Message } type streamData { // ... requests []streamRequest }

This should also make the AddRecord function much clearer as the constraints now get a typed context.

arnecls · 2017-07-10T09:26:14Z

producer/kinesis.go

@@ -115,9 +115,101 @@ const (
 )

 type streamData struct {


This type and related function should be moved to a separate file.

arnecls · 2017-07-10T10:20:07Z

producer/kinesis.go

+
+func (sd *streamData) AddMessage(delimiter []byte, data []byte, streamID core.MessageStreamID, msg *core.Message) {
+	record := sd.GetRecord()
+	if len(record.Data) > 0 {


A benefit from encapsulating the request in an extra type is that you can store the size for each request in there, too. This will make the check for "am I too large" a simple

func (sr streamRecord) OverLimt(data, delimiter []byte) { dataSize := len(data) if len(sr.records) > 0 { dataSize += len(delimiter) } return sr.currentSize + dataSize > sr.Limit } // ... // func (sd *streamData) AddMessage ... { record := sd.GetCurrentRecord() if record.OverLimit(data,delimiter) { record = sd.AddNewRecord(/* ... */) } record.AddMessage(/* ... */)

This will make it a lot clearer to understand and a lot errorprone as you don't have to expose internal state.
E.g. it was very confusing for me why you check for record.Data twice though it wasn't directly visible where it is modified.

relud · 2017-11-13T22:08:34Z

I'm no longer able to work on this

andygrunwald · 2017-11-16T19:16:14Z

@relud Thanks for letting us know. Do you still think it would make sense to integrate into Gollum?
If yes, we might want to check it out in detail.

relud · 2017-11-16T19:17:53Z

yes, I definitely think it makes sense. these limits are hard limits, that aws doesn't allow limit increases for.

andygrunwald · 2017-11-16T19:19:28Z

Okay, thank you. I reopened the PR. I can't promise anything in terms of timeline, but we will check it out if we can make it. Thank you for your effort.

make producer.Kinesis respect api limits on records and size for kine…

daa3571

…sisPutRecords requests

msiebeneicher requested a review from arnecls June 28, 2017 11:15

msiebeneicher assigned arnecls Jun 28, 2017

msiebeneicher added the v0.4.x label Jun 28, 2017

arnecls suggested changes Jul 10, 2017

View reviewed changes

arnecls requested a review from msiebeneicher July 12, 2017 12:50

arnecls mentioned this pull request Jul 18, 2017

Add size constraint to batches #179

Open

arnecls added the pending-submitter-response label Aug 9, 2017

relud closed this Nov 13, 2017

andygrunwald reopened this Nov 16, 2017

relud closed this Mar 10, 2020

relud deleted the kinesis_respect_hard_limits branch March 10, 2020 20:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make producer.Kinesis respect api limits#140

make producer.Kinesis respect api limits#140
relud wants to merge 1 commit intotrivago:v0.4.x-devfrom
relud:kinesis_respect_hard_limits

relud commented Jun 9, 2017

Uh oh!

arnecls commented Jul 7, 2017

Uh oh!

relud commented Jul 7, 2017

Uh oh!

arnecls commented Jul 7, 2017

Uh oh!

arnecls Jul 10, 2017

Uh oh!

arnecls Jul 10, 2017

Uh oh!

arnecls Jul 10, 2017

Uh oh!

arnecls Jul 10, 2017

Uh oh!

relud commented Nov 13, 2017

Uh oh!

andygrunwald commented Nov 16, 2017

Uh oh!

relud commented Nov 16, 2017

Uh oh!

andygrunwald commented Nov 16, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

relud commented Jun 9, 2017

Uh oh!

arnecls commented Jul 7, 2017

Uh oh!

relud commented Jul 7, 2017

Uh oh!

arnecls commented Jul 7, 2017

Uh oh!

arnecls Jul 10, 2017

Choose a reason for hiding this comment

Uh oh!

arnecls Jul 10, 2017

Choose a reason for hiding this comment

Uh oh!

arnecls Jul 10, 2017

Choose a reason for hiding this comment

Uh oh!

arnecls Jul 10, 2017

Choose a reason for hiding this comment

Uh oh!

relud commented Nov 13, 2017

Uh oh!

andygrunwald commented Nov 16, 2017

Uh oh!

relud commented Nov 16, 2017

Uh oh!

andygrunwald commented Nov 16, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants