1DES logo
Cognitrade
arrow_forward_ios

How to Design a Machine Learning Trading Bot - Part 2: Data Analysis

Bahman

Published on Aug 19, 2020

Previously on “How to design a machine learning trading bot”

We have started with “Collecting Data”.

We found out what OHLCV data is, and we learned why we need both historical and online data.

In this part, we dive into the process of preparing and transforming raw financial data into clean, structured input ready for machine learning models—a crucial step that determines the accuracy and reliability of your trading bot.

So, let’s start the second step, considering we already collected the data and have it.

This article is the second episode in the series “How to design a machine learning trading bot”.

Step 2: Data Analysis

Cleaning data, filtering, and feature engineering are essential steps in any machine learning approach.

To explain its importance: imagine splitting development time into ten frames—“Analyzing Data” alone should take five of them. You are working with historical data to provide a structured, usable version for the machine.

Think of this as nutrition for machines: if you feed them with healthy, high-quality data, they perform well. If you feed poor-quality data, even the most powerful machine in the world won’t work properly.

If you have a mathematics or statistics background, this section will feel natural. But if not—don’t worry! Basic knowledge is enough to reach your goals.

Visualizing Data
Visualizing Data

Visualizing Data

Visualization makes data easier to understand and analyze. At this stage, you’ll work with plots and charts to spot insights that raw numbers can hide.

Key Data Analysis Terminology

Two key terms in data analysis you must know before going further: Labels and Features.

For a detailed explanation, see the Google Machine Learning Crash Course .

Last but not least

Many ML tutorials provide pre-prepared datasets, so you start straight with training. But in reality, data is rarely ready-made—you must prepare it yourself. Keep in mind that resources and references can be limited.

In our next season (development), we will cover how to do feature engineering on OHLCV data to prepare it for training.

TRADING MACHINE LEARNING DATA ANALYSIS