Mark Rober, who is great at explaining and demonstrating math and engineering to a wide audience, gets into the gist of machine learning in his latest video:
-
The day-to-day changes a lot when you have kids. However, it seems to change more for women than it does for men.
-
I have a feeling we’re in for a lot of manipulated videos as we get closer to the election. The Washington Post provides a guide for the different types. I hope they keep building on this with a guide on how to spot the fakes, but as they say, knowing is half the battle.
-
Members Only
-
In some public places, such as schools and hospitals, microphones installed with software listen for noise that sounds like aggression. The systems alert the authorities. It sounds useful, but in practice, the detection algorithms might not be ready yet. For ProPublica, Jack Gillum and Jeff Kao did some testing:
Yet ProPublica’s analysis, as well as the experiences of some U.S. schools and hospitals that have used Sound Intelligence’s aggression detector, suggest that it can be less than reliable. At the heart of the device is what the company calls a machine learning algorithm. Our research found that it tends to equate aggression with rough, strained noises in a relatively high pitch, like D’Anna’s coughing. A 1994 YouTube clip of abrasive-sounding comedian Gilbert Gottfried (“Is it hot in here or am I crazy?”) set off the detector, which analyzes sound but doesn’t take words or meaning into account. Although a Louroe spokesman said the detector doesn’t intrude on student privacy because it only captures sound patterns deemed aggressive, its microphones allow administrators to record, replay and store those snippets of conversation indefinitely.
Marvelous.
-
In the biggest crossover event of the century, Tom Lum used the Wikipedia API to chart the number of views for every reference in Billy Joel’s We Didn’t Start the Fire. Yes. [via @waxpancake]
-
Sonja Kuijpers used abstract imagery to represent some sobering numbers:
You might be wondering what you are viewing here. This landscape, each element in it represents a person who committed suicide in the Netherlands in the year 2017.
The cityscape leads into more traditional views, which in turn feel much more heavy.
-
For The New York Times, Kevin Roose on the possibility of machines becoming your boss:
The goal of automation has always been efficiency, but in this new kind of workplace, A.I. sees humanity itself as the thing to be optimized. Amazon uses complex algorithms to track worker productivity in its fulfillment centers, and can automatically generate the paperwork to fire workers who don’t meet their targets, as The Verge uncovered this year. (Amazon has disputed that it fires workers without human input, saying that managers can intervene in the process.) IBM has used Watson, its A.I. platform, during employee reviews to predict future performance and claims it has a 96 percent accuracy rate.
Splendid.
-
Ben Fry using the “tropiest of design tropes”, describes his goals for visualization. On communication:
Communication is the most basic part: the table stakes of information design. If the piece doesn’t communicate, then it’s useless. A lot of time, attention, and effort goes into creating PDF documents that few people will ever read. In the past this was even more expensive, because the result was elegant printed reports that would go straight to someone’s shelf. If you create something that nobody uses, you’ve failed to communicate.
-
You know those sped up videos where there’s a long line for something and someone walks the length of it? The New York Times did the scrolly equivalent for the recent Hong Kong protest, using snaps from aerial video and stringing them together geographically. A lot of people showed up.
-
Members Only
-
A small discrepancy in a couple of shapefiles led to a misclassification of land. Wealthy investors are taking advantage. For ProPublica, Jeff Ernsthausen and Justin Elliott:
They have President Donald Trump’s 2017 tax overhaul law to thank. The new law has a provision meant to spur investment into underdeveloped areas, called “opportunity zones.” The idea is to grant lucrative tax breaks to encourage new investment in poor areas around the country, carefully selected by each state’s governor.
But Port Covington, an ambitious development geared to millennials to feature offices, a hotel, apartments, and shopping, is not in a census tract that is poor. It’s not a new investment. And the census tract only became eligible to be an opportunity zone thanks to a mapping error.
-
The single-family home. It’s part of the American dream, but it can be awfully expensive when land grows scarce. Emily Badger and Quoctrung Bui for The Upshot map and discuss the current approaches of major cities:
A reckoning with single-family zoning is necessary, they say, amid mounting crises over housing affordability, racial inequality and climate change. But take these laws away, many homeowners fear, and their property values and quality of life will suffer. The changes, opponents in Minneapolis have warned, amount to nothing less than an effort to “bulldoze” their neighborhoods.
Using building footprints instead of just colored polygons provide nice depth and detail.
-
D3.js is one of the most popular JavaScript libraries for creating web-based data visualisations. It has a unique and powerful way of working but a steep learning curve. Most charting libraries provide a selection of ready made charts such as bar, line and pie charts while D3 offers building blocks from which you construct a bespoke chart.
This tutorial will get you started with D3 and shows you how a custom chart can be made. Along the way you’ll learn some of the key features of D3 such as loading CSV data into the browser, adding SVG elements, data-driven styling and user interaction.
-
Researchers went around rural Pennsylvania, showing people the charts above and asking what they thought about them. Evan Peck with one of the main findings from the study:
As we analyzed and coded our interviews, we were reminded of something that we often forget — data can be intimate and personal. If someone found a personal connection to any graph, it didn’t matter the color, the style or the technique. For the people we talked to, charts with personal connections superseded all other design dimensions.
Small data caveat: there were only 42 interviews.
But still, ever since I got into visualization for an audience, this was always my thought. Some say people are self-centered so readers want to know about themselves first. Maybe that’s partially true, but I also think we just like to start with what’s familiar.
So, if the data allows for it, make that familiarity and quick connection to the data easy to access.
-
Financial Times, in an effort to streamline a part of the data journalism process, developed templates for data stories. They call it the Story Playbook:
The Playbook is also an important driver of culture change in the newsroom. We have a rich and familiar vocabulary for print: The basement (A sometimes light-hearted, 350-word story that sits below the fold on the front page), for example, or the Page 3 (a 900–1200 word story at the top of the third page that is the day’s most substantive analysis article). For FT journalists, catflaps, birdcages, and skylines need no explanation.
The story playbook creates the equivalent for online stories, by introducing a vocabulary that provides a shared point of reference for everyone in the newsroom.
-
I compared time use for those with children under 18 against those without. Here’s where the minutes go.
-
Got a chuckle out of me:
Me explaining why standardizing your variables is important: pic.twitter.com/mQKj0nEJ0G
— Chelsea Parlett Pelleriti (@ChelseaParlett) June 12, 2019
-
Members Only
-
For The New York Times, Kevin Litman-Navarro plotted the length and readability of privacy policies for large companies:
To see exactly how inscrutable they have become, I analyzed the length and readability of privacy policies from nearly 150 popular websites and apps. Facebook’s privacy policy, for example, takes around 18 minutes to read in its entirety – slightly above average for the policies I tested.
The comparison is between websites with a focus on Facebook, but the main takeaway I think is that almost all privacy policies are complex, because they’re not there for the users.