How do I read an email log that captures bodies in different combinations of encodings plus underlying format

I have a log of spam emails collected from various sources, in JSON. The goal is to move it all into plaintext to become a training corpus for a machine learning exercise. Mail subject, etc, is plaintext. Bodies, however, are encoded…
As… Continue reading How do I read an email log that captures bodies in different combinations of encodings plus underlying format

Samsung’s Leap Month Bug Teaches Not to Skimp On Testing

Date and time handling is hard, that’s an ugly truth about software development we’ll all learn the hard way one day. Sure, it might seem like some trivial everyday thing that you can easily implement yourself without relying on a third-party library. I mean, it’s basically just adding seconds on …read more

Continue reading Samsung’s Leap Month Bug Teaches Not to Skimp On Testing

Day Clock Monitors Air Quality of the Great Indoors

As the world settles into this pandemic, some things are still difficult to mentally reckon, such as the day of the week. We featured a printed day clock a few months ago that used a large pointer to provide this basic psyche-grounding information. In the years since then, [Jeff Thieleke] …read more

Continue reading Day Clock Monitors Air Quality of the Great Indoors

MessagePack Is A More Efficient JSON

It is an age-old problem, that of having some data you want to store somewhere, and later bring it back. How do you format the data? Custom file formats are not that hard, but if you use an existing format you can probably steal code from a library to help …read more

Continue reading MessagePack Is A More Efficient JSON