What today’s machine learning and AI is and is not

All artificial intelligence methods today are around machine learning modelling and use some form of sophisticated correlation or association method, which can be approximated to brute-force robot learning. It is about reverse engineering existing features/patterns and providing useful “forward engineering” solutions like:

  • Self-driving cars
  • Detecting diseases from X-rays/MRIs
  • Robots in manufacturing
  • Chatbots for customer service
  • Insert your future application here

The logic is that X event follows Y, or if X event occurs with Y in historical and simulation data, so today we can create automated models around it and use the models to predict an unknown object/variable/situation and even prescribe actions. Today, we are progressing to explain to some extent how we are predicting that way. However, today’s machine learning is about figuring the “what” in images, speech, numbers, translation and text. However, it doesn’t address the “why” question. “What” can work if the environment under which training happened also occurs, to some extent, during prediction, at least the context — right? The question so far has been, “Can we figure out all the ‘whats’ in our models?” Yes, to some extent ,if we can train on lots and lots of data within the same context. Examples of context would be playing chess or Pokemon Go, driving on streets, website browsing etc. Techniques such as deep learning/reinforcement learning with GPU hardware, big cluster farms and days/weeks of training make it possible.

The idea of machine learning or deep learning is to mimic by feature extraction, memorizing and generalizing instances of interest. Human brains work both with memorizing, generalizing problems, but also to “creatively infer causation.” This is yet to be seen in AI algorithms today and, as you can imagine, may move us toward general artificial intelligence or start a new AI winter.

Today’s AI cannot creatively infer causation on its own

Here are some challenges in today’s AI world:

  1. The models can tell that the sunset in a beach will be red or yellow. AI cannot tell why it is so on its own. It will not know that the sun’s rays scatter differently with red wavelength and also because the atmosphere has pollutants.
  2. The models can tell that an X-ray image shows a cancerous polyp, but it cannot tell why. It will not know that the polyp is caused because of DNA mutation, food factors and an external factor/trigger/environment from six months ago.
  3. The models can tell that an umbrella in a picture is for either rain or hot sun, but cannot tell why it was designed that was in the first place. It will not know that the length and colour of the umbrella were designed to reflect the rays of the sun, balancing with the average wind speed, not to fly off.
  4. The models can tell that a behaviour is fraudulent/suspect, but cannot really explain why the fraudster is targeting this particular business and using this technique.
  5. The models can turn on chatbots to answer questions intelligently by learning from a large corpus of chats, text or Q&A from the past. It will, however, miss sarcasm and humour or main intent at the outset.

Using circumstantial evidence (WHAT) is different than finding probable cause (WHY)

The above examples are assuming AI bots don’t have access to the internet to look up recorded facts in Wikipedia [cheating], again messing with what’s recorded somewhere, instead of doing research on its own and coming to new conclusions or opening doors and discovering new facts.

Sadly, today data scientists are dumbing down on the “why” question, but focusing on how we are arriving at “what” in our models. Clearly, human intelligence, creativity and ingenuity are not going to be replaced – at least in the near future. Computer mimicry will only automate results on known/learned situations given enough data, but not quite there to get into the depths of reasoning/creative thinking: That’s what really cognitive computing is all about!

However, as for today’s state of AI, lots of useful applications are possible for business and life using deep learning – with sophisticated machine learning mimicry. There are thousands of use cases that can benefit from machine learning/deep learning. However, it’s important to know what today’s AI is and is not.

