MATLAB - Analyze Your CAN Bus Data [Parquet, MDF]

MATLAB CAN bus MF4 MDF

Need to analyze your CAN bus data in MATLAB?

In this overview, you'll learn how to load your CAN bus data into MATLAB - either as MF4 files (MDF) or DBC decoded Parquet files. We also outline key features of MATLAB's Vehicle Network Toolbox and Predictive Maintenance Toolbox.



How to log & interpret CAN bus data?

The CAN protocol is today used in practically all automotive vehicles (cars, trucks, buses, forklifts, ...) and in many industrial machinery applications. This makes CAN data important to many use cases, such as vehicle diagnostics, fleet management, predictive maintenance, academic research and more.

Further, several 'higher layer protocols' exist based on CAN, such as J1939 (heavy duty), OBD2 (cars), CANopen (industrial), NMEA 2000 (maritime), ISOBUS (agriculture) and more.

To record CAN bus data (incl. higher layer protocols), you can use a CAN data logger. For example, the CANedge1 lets you log CAN data to an SD card, while the CANedge2/CANedge3 also enable upload of the data to your own server via WiFi/LTE.

The devices record 'raw' CAN frames, which need to be decoded to physical values for analysis - typically via a DBC file. Once the CAN bus data is DBC decoded, you can perform e.g. graphical visualization and statistical analysis.


In some cases you may simply want to open a specific log file and analyze the data via a software GUI tool. For example, you may need to analyze a specific event or issue in a vehicle.

For such use cases, various GUI software tools can be used. For the CANedge, you can for example use the free asammdf GUI - or convert & load data in your favorite GUI tools:

asammdf gui free CAN bus software tool open source

asammdf GUI

This is a general-purpose GUI tool that lets you load MF4 log files, DBC decode them and plot the physical values.

asammdf intro
ASAM MDF4 Measurement Data Format Converters ASC TRC CSV

MF4 converters

If you prefer using software like CANalyzer, PCAN-Explorer etc. you can convert your MF4 files to the relevant format.

mf4 converters intro
CAN bus data acquisition and DBC conversion




Why use MATLAB for CAN bus data processing?

MATLAB is popular tool for analyzing CAN bus data amongst automotive OEMs (Original Equipment Manufacturers). In particular, MATLAB is popular for large-scale automotive fleet testing, algorithm development and predictive modelling.

Looking at a typical data analytics workflow, MATLAB offers several benefits:

MATLAB Load MF4 MDF Data

Load and explore CAN bus data

Load CAN data from various log file formats (Parquet, MF4, MAT, ...) and locations (local, S3, ...). Data can be loaded as e.g. single file, datastores or tall arrays - for exploration via tables, stats & plots

DBC Decode MF4 MDF Data in MATLAB

DBC decode and preprocess data

DBC decode raw CAN data to physical values via the Vehicle Network Toolbox. Further, easily clean and pre-process your decoded CAN bus data via smoothing, filling, outlier handling, rescaling etc.

MATLAB Predictive Maintenance Automotive CAN bus

Develop models & analyses

Datastores and tall tables let you easily switch between analyzing a single device, trip or signal - to performing statistical analysis across fleets or months of data, incl. via e.g. machine learning

MATLAB predictive model vehicle fleet telematics

Deploy & integrate in your system

MATLAB offers the expertise and tools for deploying your CAN bus analytics, from development to production - for example to implement predictive maintenance models into your fleet operations




Why use the CANedge with MATLAB?

The CANedge is ideal for OEM engineers using MATLAB:

The hardware is low cost, meaning several devices can be deployed for e.g. prototype vehicle fleet tests. At the same time, it's extremely easy to work with your CANedge data in MATLAB - either by directly loading the MF4 log files, or by connecting MATLAB to your Parquet data lake of DBC decoded data.

learn about the CANedge






How to load your CAN bus data into MATLAB

Directly load unfinalized MF4 logs from the CANedge

With MATLAB and the Vehicle Network Toolbox, you can directly load MF4 log files from the CANedge with raw CAN/LIN data. The MF4 file can then be finalized in MATLAB via a simple function call. This approach makes it easy to load specific log files in MATLAB for e.g. vehicle diagnostics and automotive testing purposes.

docs script example

MATLAB Load Unfinalized MF4 CAN bus

Load 'finalized' MF4 files - incl. via datastores

You can also use the Vehicle Network Toolbox to load 'finalized & sorted' MF4 files. To do this, you can use our MF4 converter to perform the finalization & sorting step prior to loading your data (a process which can be easily automated). You can also load finalized MF4 files into MATLAB datastores to easily work across large numbers of log files.

docs mf4 datastores script example

MATLAB Vehicle Network Toolbox load MDF MF4

Load DBC decoded MF4 files - incl. via tall arrays

The VNT also lets you load MF4 log files that have been DBC decoded to physical values outside MATLAB. The DBC decoding can be done via the asammdf GUI or automated API scripts as per our examples. In this case, you can also use tall arrays, ideal for "big data" use cases - like fleet testing across 100+ devices where the terabytes of data do not fit into memory.

MF4 tall arrays many MF4 files

Load DBC decoded data in MATLAB J1939 OBD2 CAN

Load DBC decoded Parquet data lake [recommended]

You can use our MF4 decoders to easily create a Parquet data lake of DBC decoded CAN/LIN data - which can natively be processed within MATLAB (no toolbox required). MATLAB also supports Parquet tall arrays (stored locally or on S3) for out-of-memory operations - as well as powerful time resampling functions. This is our recommended integration method.

Learn more

MATLAB Load CAN Bus Data from Parquet Data Lake

Easily load data from S3

If you're using the CANedge2/CANedge3, you may wish to load log files directly from your S3 server. This can be done in multiple ways: Within MATLAB, you can create a datastore from an S3 bucket based on DBC decoded MF4 or Parquet files. Alternatively, you can mount your S3 bucket as a local drive using e.g. TntDrive.

Mount S3 as drive

MATLAB CAN Bus MDF MF4 S3 MinIO AWS




MATLAB's Vehicle Network Toolbox

MATLAB's Vehicle Network Toolbox offers powerful features for working with your CAN data:

MATLAB LOAD MDF MF4 LOG FILE MF4 SUPPORT

Easily load MF4 log files from the CANedge

MATLAB Datastores MDF MF4 CAN bus DATASTORES

Load folders / buckets and work with 'datalakes'

MATLAB Tall Arrays Vehicle Fleet Testing TALL ARRAYS

Easily analyze huge datasets (beyond memory)

MATLAB MDF MF4 S3 CAN Bus LOCAL / S3

Load files / datastores directly from S3 servers

DBC decode MF4 MDF in MATLAB DBC DECODING

Decode raw CAN bus data using DBC files incl. J1939

MATLAB integrate CAN bus data telematics INTEGRATE

Easily integrate with your existing systems


The Vehicle Network Toolbox makes it easy to DBC decode raw CAN bus data to physical values. You can either extract tables containing all decoded signals - or extract only specific signals for further analysis.

You can also perform decoding of J1939 data using e.g. our J1939 DBC file (with some minor tweaks as per our script examples).

docs


MATLAB does not currently support transport protocol decoding e.g. for J1939 multiframe messages, NMEA 2000 fast packets or UDS (Unified Diagnostic Services). If this form of decoding is required, we recommend that you use our Python API.



Example: MF4 CAN bus tall arrays

One of the powerful features of the Vehicle Network Toolbox is ability to load MF4 log files into MDF tall arrays. Effectively, this lets you analyze an unlimited number of log files with a few lines of code - regardless of the total file size.

Further, tall arrays can be plotted directly via MATLAB's visualization tools like plot, histogram and histogram2 - enabling quick visualization of e.g. months of data.

docs script example






CAN bus WiFi Telematics & Dashboards

Case study: Truck telematics

Cummins logo case study

Learn how Cummins uses the CANedge2 and CANmod.gps to collect J1939 + GPS/IMU data to their own AWS S3 server for analysis in MATLAB's Vehicle Network Toolbox.

"Excellent functionality. Outstanding support. Continuous improvements. The CANedge2 is the best you can get!"

full case study



Predictive maintenance with the CANedge and MATLAB

In recent years we've seen an increasing demand amongst our users for deploying predictive maintenance across various assets in the field - e.g. cars, trucks, machines, battery systems etc.

In this section we briefly outline how you can approach this topic using the CANedge2/CANedge3 and MATLAB.

CANedge2 for predictive maintenance

The CANedge2 is an ideal device for predictive maintenance as it lets you easily record CAN/LIN data - and automatically push this to your own S3 server for processing.

The raw CAN data is recorded to an SD card, meaning that a periodic loss of connectivity does not result in loss of data (e.g. if vehicles operate in mines, tunnels, outside of WiFi range etc). Further, data can be uploaded via stationary WiFi access points using the CANedge2 or via 3G/4G using the CANedge3 (for on-the-road upload of data). Log files can be split in configurable chunks based on size or time - e.g. in 1 MB splits, 10 second splits, 30 min splits etc.

Vehicle Telematics Downtime Breakdowns Data Logger SD card

We encounter two common ways to utilize the CANedge in predictive maintenance:



MATLAB collect CAN data for predictive maintenance

#1 Predictive model training

When training predictive models (e.g. via machine learning), it is vital that the data sets are "rich" and "complete". Here, the CANedge2 is ideally suited as you can record "everything" from 2 x CAN/LIN - sometimes comprising 4000+ frames/second. This amount of data can correspond to 2 GB+ per day - making it too costly for 3G/4G transfer, but often fine for periodic offload via WiFi routers in e.g. a garage. Via the device configuration, you can of course optimize your logging via filters, prescalers, triggers and compression.

In this type of use case, the CANedge2 may be deployed in e.g. 100-300 vehicles over a period of 6-24 months. By collecting operational data (such as speed, RPM, temperatures) along with "failures" (e.g. diagnostic trouble codes), you can collect many TB of 'failure data' directly in S3 for e.g. machine learning models



#2 Operational deployment

The CANedge2/CANedge3 can also be used in production deployments of predictive maintenance models. Here, the upload frequency is typically increased (e.g. to 1 minute or 5 minute intervals) and the incoming data is often automatically decoded, analyzed and reacted upon through event triggered scripts (using e.g. AWS Lambda functions).

Use cases often start with basic predictive models based on threshold analyses: "Does parameter X go above limit Y for Z seconds, while parameter A is below limit B?".

This type of "practical" predictive maintenance is typically a good way to start and many of our users successfully predict e.g. battery fires in forklifts, tire issues etc. via such methods.

See also our separate intro to automotive predictive maintenance and MATLAB's "event detection" MDF example.

CAN bus predictive maintenance vehicle fleet


MATLAB's Predictive Maintenance Toolbox

As use cases scale and the sophistication requirements increase, it becomes increasingly important to train and deploy more advanced predictive models. Here, tools like the MATLAB Predictive Maintenance Toolbox can be extremely useful:

Estimate remaining useful life RUL MATLAB ESTIMATE RUL

Estimate an asset's time to failure via Remaining Useful Life (RUL) predictions

MATLAB Machine Learning CAN bus predictive maintenance MODELLING

Use survival, similarity, trend based or ML models to estimate the RUL

Vehicle Network Toolbox MF4 Predictive Maintenance INTEGRATE MF4

Easily integrate CAN data MF4 log files via the Vehicle Network Toolbox

MATLAB Predictive Maintenance Reference Model QUICKSTART

Take outset in existing reference models for e.g. gearboxes, batteries, motors

Predict vehicle breakdowns MATLAB DETECT ISSUES

Detect system anomalies and faults via e.g. control charts or Kalman filters

Deploy predictive maintenance vehicle fleet DEPLOY

Deploy predictive models to edge devices or cloud e.g. via generated C/C++ code

You can learn more about the Predictive Maintenance Toolbox here. MATLAB also provides a video intro to predictive maintenance.


As indicated above, predictive maintenance for CAN bus data can sound simple enough: Evaluate signal values vs. defined threshold limits to determine if an issue is about to occur. If an issue occurs, trigger an alert.

While simple models can be set up this way, practical deployment across a fleet of vehicles or machines will typically require more advanced modelling. Even basic things like proper data preprocessing can be make-or-break to avoid countless "false alert" situations.

Here, using the CANedge2 while partnering with MATLAB provides an excellent combination of low cost hardware and access to vital support and useful tools.


See e.g. practical examples for RUL estimation in a Wind Turbine Bearing, fault detection in a DC motor or the estimation of battery degradation via Kalman filters.





FAQ


We provide basic support in regards to loading your CAN bus data into MATLAB (see our script examples). Further, we can typically answer basic questions and we're always happy to provide high level sparring on your use cases.

For more technical support on your MATLAB script, we recommend to reach out to the MATLAB support team directly, as their experts will be better suited to diagnose any issues and help you quickly progress.


You can load MF4 log files from the CANedge and replay them within the MATLAB/Simulink environment, see this article for details.


As per our Python API article, we also offer a Python API for use with the CANedge CAN bus data loggers. The API lets you do many of the same things as MATLAB and Python and our API are 100% open source and free.

However, if you're going to perform advanced analyses or if you're already familiar with MATLAB, you'll probably be better off going with MATLAB. In particular, this route has the major benefit of enabling support and expertise from the MATLAB team. When it comes to e.g. data preprocessing and predictive modelling, it's not only a matter of 'whether' something can be done in e.g. Python or MATLAB - but rather 'how' it can be done. Implementing advanced models in the best way can be hard to do on your own - and here MATLAB's team of experts can help provide outside-in sparring based on similar experiences across other automotive end users.

We also have users that start with simple use cases in Python and later on move to MATLAB. At the end of the day, you can decide on your own as the CANedge and related MF4 data is designed to be interoperable - letting you manage your data in the way you prefer.




Ready to process your CAN data in MATLAB?

Get your CANedge today!







Recommended for you