Add check for degenerate padded case in decode #33

emilypi · 2020-04-24T15:51:04Z

This removes the odd inconsistency between failure modes for URL where degenerate inputs like ZE= pass decode, but not decodeUnpadded and decodePadded. Addresses #35

before:

П> U.decode "ZE="
Right "d"
П> U.decodeUnpadded  "ZE="
Left "Base64-encoded bytestring required to be unpadded"
П> U.decodePadded  "ZE="
Left "Base64-encoded bytestring required to be padded"

after (note the subtlety in the messages returned):

П> U.decode "ZE="
Left "Base64-encoded bytestring has invalid padding"
П> U.decodeUnpadded  "ZE="
Left "Base64-encoded bytestring required to be unpadded"
П> U.decodePadded  "ZE="
Left "Base64-encoded bytestring has invalid padding"
П> U.decode "ZE=="
Right "d"
П> U.decodeUnpadded  "ZE=="
Left "Base64-encoded bytestring required to be unpadded"
П> U.decodePadded  "ZE=="
Right "d"
П> U.decode "ZE"
Right "d"
П> U.decodeUnpadded  "ZE"
Right "d"
П> U.decodePadded  "ZE"
Left "Base64-encoded bytestring required to be padded"

TODO:

property tests for interplay between decodes

Data/ByteString/Base64/Internal.hs

add better padding validation + note about validation strategy

emilypi · 2020-05-29T18:21:42Z

@hvr @23Skidoo i think i've settled on an error msg reporting scheme I'm okay with. Could you guys give a review?

emilypi · 2020-05-29T18:35:57Z

Data/ByteString/Base64/Internal.hs

-       | otherwise -> err "Base64-encoded bytestring has invalid size"
+       | r == 0 -> validateLastPad bs noPad $ go bs
+       | r == 2 -> validateLastPad bs noPad $ go (B.append bs (B.replicate 2 0x3d))
+       | r == 3 -> validateLastPad bs noPad $ go (B.append bs (B.replicate 1 0x3d))


It's all written down in the $Validation note, but just to recap:

Let bs be a bytestring of length l. Then the following properties hold:

l == 0 mod 4: The input bytestring is assumed to be well-formed. This will always be the expected case for padded Base64 and Base64url values, or for unpadded Base64url values that happen to have a pre-encoded length multiple of 6. In any case, these will go through the standard decode routine, and any existing padding chars will be validated in the final quanta (see: finalChunk).

l == 1 mod 4: This is never a valid length for Base64 or Base64url-encoded values. The specification requires that the unpadded length of the encoded string be l == 0 mod 4, l == 2 mod 4, or l == 3 mod 4. There will never be a valid unpadded input of length l == 1 mod 4 as a result. This can be rejected outright.

l == 2 mod 4: In this case, two padding chars must appear in the final quanta. If any additional padding chars exist in the string, then they will fail as final quanta, as we require the final four bytes (say, (a b '=' '=')) to have that a /= '=' and b /= '='. Additional pads will fail that clause of finalChunk. Thus, it's safe to add 2 padding chars to the end of a supposedly unpadded input of length l == 2 mod 4, since the addition will never form a well-formed input if the unpadded string is already malformed.

l == 3 mod 4: This is the only tricky case. When inputs have this length, then we expect that that adding padding chars will result in the form (a b c '='). However, if the unpadded input has '=' in the c position, it is possible that adding padding chars to the string "completes" the input in the sense that it forms a valid input where the unpadded fragment can be seen as a bytestring of length l == 2 mod 4. This could potentially be an attack vector, and constitutes a security risk. Thankfully, this is also easy to check, since, we only need to validate that the last char of an unpadded bytestring of length l == 3 mod 4 is not '='. If any additional padding chars are present, then there is no risk that they will contribute to a well-formed input, since they will fail as final quanta in the a and b positions. So really, the requirement with padding bytestrings of length l == 3 mod 4 is that they are of the form (a b c '='), c /= '=' after padding.

emilypi · 2020-06-04T23:17:08Z

@23Skidoo and @hvr thoughts?

23Skidoo

As far as I can tell, changes in this PR look good.

emilypi · 2020-06-05T02:08:09Z

Thanks @23Skidoo, merging.

emilypi changed the title ~~Add check for degenerate padded case in decode~~ [WIP] Add check for degenerate padded case in decode Apr 24, 2020

emilypi changed the title ~~[WIP] Add check for degenerate padded case in decode~~ Add check for degenerate padded case in decode May 14, 2020

emilypi requested review from 23Skidoo and hvr May 28, 2020 19:59

emilypi commented May 28, 2020

View reviewed changes

Data/ByteString/Base64/Internal.hs Show resolved Hide resolved

add check for degenerate padded case in decode

5d36588

add better padding validation + note about validation strategy

emilypi force-pushed the emily/check-decode-padding branch from d2402e1 to 5d36588 Compare May 28, 2020 22:31

emilypi added 4 commits May 28, 2020 22:30

small amendment

d5e4407

fix messages

0f6fd65

fix messages

204d2ae

nopad in the unpadded case

d8dc4c3

emilypi commented May 29, 2020

View reviewed changes

23Skidoo reviewed Jun 5, 2020

View reviewed changes

emilypi merged commit 593ed26 into master Jun 5, 2020

emilypi deleted the emily/check-decode-padding branch June 5, 2020 02:08

This was referenced Jun 5, 2020

Add Head validations for correct padding #35

Closed

Refactor and Expand Test Coverage #34

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add check for degenerate padded case in decode #33

Add check for degenerate padded case in decode #33

Uh oh!

emilypi commented Apr 24, 2020 •

edited

Loading

Uh oh!

Uh oh!

emilypi commented May 29, 2020

Uh oh!

emilypi May 29, 2020 •

edited

Loading

Uh oh!

emilypi commented Jun 4, 2020

Uh oh!

23Skidoo left a comment

Uh oh!

emilypi commented Jun 5, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add check for degenerate padded case in decode #33

Add check for degenerate padded case in decode #33

Uh oh!

Conversation

emilypi commented Apr 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

emilypi commented May 29, 2020

Uh oh!

emilypi May 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

emilypi commented Jun 4, 2020

Uh oh!

23Skidoo left a comment

Choose a reason for hiding this comment

Uh oh!

emilypi commented Jun 5, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

emilypi commented Apr 24, 2020 •

edited

Loading

emilypi May 29, 2020 •

edited

Loading