"base64 decode assumes input lines are in multiples of 4 #96" #99

vandys · 2023-02-22T18:52:39Z

Convert base64 implementation of stateless to stateful, so you can feed in successive lines and get back the original binary. The previous stateless implementation assumes that the input has no residual after processing a single line, which is not a semantic guaranteed by the base64-in-email standards (and, in fact, was found while processing exports from Tutanota).

jkbzh · 2023-05-15T16:25:12Z

@vandys I'm reviewing your code, you don't need to update anything.

According to man, bzero has been deprecated in favor of memset, so I reverted that specific change and removed the strings.h include.

Seeing that base64 is so common in messages and headers and that hypermail is not processing message parts or headers in parallel, I wonder if it's worth it to allocate and free the state structure each time we get a new part and if it wouldn't be better to just use a variable and fixed memory instead of a pointer and allocated memory.

What do you think?

vandys · 2023-05-15T18:23:31Z

Yes, bcopy/bzero are bad old habits from my old BSD days!
If you keep a single state, I'd still recommend being sure to reset it at the points where its use starts and ends. I'm sure at some point a malformed base64 will try to bleed from one attachment to another.
OTOH, it's hard to imagine that the average processing of a base64 attachment not entirely swamping the small overhead of allocation and free. Either seems fine to me!

jkbzh · 2023-05-15T20:30:52Z

src/parse.c

 		    free(data);	/* this was allocatd by mdecodeQP() */
+		}
+		/*


as you saw, lines 3435-3440 cause a sigsev in your base64 changes because it frees the memory right after allocating it.

That free there is only meant for the QP decoder. It decodes everything in the single call to the decoder and the data variable is freed afterwards.

No need to change anything

jkbzh · 2023-05-16T14:12:39Z

@vandys Thank you for your excellent patch. All tests I wrote for it worked well.

I edited it a bit (function and state structure names, fields, ...) and manually merged it to the 3.0 branch. 2bd8ed5

I did a small fix. The place where you were freeing the state structure only worked in some cases. I added that release code in the two places where it should have gone.

Other than those small changes, I didn't need to change anything.

jkbzh reviewed May 15, 2023

View reviewed changes

jkbzh closed this May 16, 2023

Brunojoes69 approved these changes Jun 4, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

"base64 decode assumes input lines are in multiples of 4 #96" #99

"base64 decode assumes input lines are in multiples of 4 #96" #99

Uh oh!

vandys commented Feb 22, 2023

Uh oh!

jkbzh commented May 15, 2023

Uh oh!

vandys commented May 15, 2023

Uh oh!

jkbzh May 15, 2023

Uh oh!

jkbzh commented May 16, 2023

Uh oh!

Uh oh!

"base64 decode assumes input lines are in multiples of 4 #96" #99

"base64 decode assumes input lines are in multiples of 4 #96" #99

Uh oh!

Conversation

vandys commented Feb 22, 2023

Uh oh!

jkbzh commented May 15, 2023

Uh oh!

vandys commented May 15, 2023

Uh oh!

jkbzh May 15, 2023

Choose a reason for hiding this comment

Uh oh!

jkbzh commented May 16, 2023

Uh oh!

Uh oh!