Data Science Central Weekly Digest, 7 Dec 2020

Author: Kurt Cagle

Real Users Rate Top 8 Data Prep Tools
Data Science Central Weekly Digest, 7 Dec 2020
Welcome to the DSC Weekly Digest, a production of Data Science Central. Every week, we pick out the newest and brightest articles on the topic of data … and how it informs our lives, powers our applications, and provides insights into our world.

An old fashioned editor's desk with DSC logos
From the Editor’s Desk
One of the key concepts in data science (and indeed in physics) is the nature of noise and randomness. The two are connected, though not the same thing. Randomness is an abstract concept that can be described in any number of ways, but a useful way of thinking about is that when two variables are completely uncorrelated to one another, a phase space diagram of the two will ultimately fill a unit square if the variables are normalized.
True randomness is surprisingly rare. Noise, on the other hand, is not. Noise is seldom truly random, but instead, can be thought of as the signals of the universe that are not of interest to you. Sometimes that noise can be treated as random (television static is a good example, as that signal may be everything from the cosmic background radiation to interactions of the Earth’s electromagnetic fields to … who knows, ghosts in the machine, so to speak. A surprising amount of such noise comes in fact from recursive fractals that seem to fill nature, and may in fact be artifacts of any complex system. There’s a growing amount of evidence to suggest that contexts are themselves fractals, which is perhaps why AI keeps bumping up against the fractal walls.
One of the roles of most data scientists and data analysts is to extract the signal from the noise, to understand what is relevant from what is not. Neural Networks, when you get right down to it, exist primarily to boost signals – to both find and enhance the signals from noise, and to then use that to predict or verify behavior based upon known signals.
Vincent Granville writes about randomness in this issue of the DSC Weekly Digest. Ajit Jaokar attempts to parse the signal from the noise as well in the first of several explorations of what 2021 looks likely to bring in the field of AI, while Kirsten Lloyd digs into the rise of Model Ops , a synthesis of DevOps and Modeling, as a way to make navigating through that noise easier for modern companies. These are just a few of the articles new this week in Data Science Central.  
Data Science Central is your community. It is a chance to learn from other practitioners, and a chance to communicate what you know to the data science community overall. I encourage you to submit original articles and to make your name known to the people that are going to be hiring in the coming year. As always let us know what you think.

Announcements

With data analytics, consistency is key. Alarmingly, 64% percent of survey respondents said their analytic results are inconsistent, mainly because of an inability to reuse and share work across teams. Aginity solves this challenge by connecting leading analytic databases to help teams share and reuse SQL with consistent results. Join thousands of analysts using Aginity Pro. Download it free today.


DSC Featured Articles

    Tech Target Articles

    Go to Source