Site Search

The key is at the "edge"! "Collection" and "analysis" are essential for future data utilization

*This article is based on a lecture given at the Macnica Data・AI Forum 2024 Autumn held in October 2024.

1.First of all

In the modern business environment, it is widely recognized that utilizing data is essential for corporate growth and strengthening competitiveness. However, huge amounts of data exist everywhere, and collecting and analyzing it remains a major challenge. The key to solving these challenges is the "edge." The edge refers to a location that is extremely close to the data source. This data collection and analysis at the edge is an important means of solving the challenges of modern data utilization. In this article, we will explain the challenges of modern data utilization, the importance of the edge, and specific solutions.

2. Current issues in data utilization

Many companies are now using data to grow and improve their business efficiency. The market size for data utilization is growing steadily, and is predicted to reach 1.5 trillion yen in 2022 and 2 trillion yen in 2024. However, as the volume and variety of data increases, several challenges have emerged.

Current issues in data utilization

Data Collection Challenges

The first challenge companies face is collecting the necessary data. Data sources are becoming more diverse, including some that are difficult to collect, such as physical data and human body data. This means that traditional data collection methods are increasingly unable to handle the demand. Specific challenges include:

  • Data diversification: Not only IT assets but also physical data such as factory equipment and human bodies are increasing, making collection methods more complex.
  • Limited network connectivity: Data sources that are not connected to the internet are difficult to collect.
Data Collection Challenges

Data Analysis Challenges

While increasing amounts of data may seem like a good thing, in practice it brings many problems, the main challenges being:

  • Processing load: The load on the analysis platform increases as large amounts of data are processed, causing performance to degrade.
  • Data quality: The presence of noisy or poor quality data reduces the accuracy of analysis results.

To overcome these challenges, we need to rethink how we collect and analyze data.

Data Analysis Challenges

When considering the use of AI, learning from poor-quality information or information that contains noise can reduce the quality of the answers given, and processing large amounts of data can take time, potentially compromising real-time performance.

Data Analysis Challenges

3. The Importance of "Edge"

The key to solving the problems of data utilization is the "edge." The edge refers to a location that is as close as possible to the data source, and data collection and analysis at the edge has the potential to solve the problems of data utilization up to now.

Data Collection at the Edge

Traditionally, data has been collected primarily in central analytics platforms.

Data Collection at the Edge

However, by providing data collection functionality to the edge, it becomes possible to efficiently collect data that was previously very difficult to collect. Specific solution examples include the following:

  • IoT Gateway: Collects data from sensors and cameras and sends it to an analysis platform via the Internet.
  • Wearable devices: Collect physical data using devices such as smartwatches.

For example, Macnica is solving data collection issues by using a solution called Splunk Edge Hub. This solution is palm-sized and easy to install, and supports multiple protocols, especially OT protocols, which are considered difficult to collect.

Data Collection at the Edge

Data Analytics at the Edge

It is also very important to perform data analysis at the edge. By eliminating unnecessary data and performing preprocessing such as labeling before sending the data to the central analysis platform, it is expected that the accuracy of analysis will improve and the processing load will be reduced.

For example, a solution called Cribl optimizes the analysis process by preprocessing data collected from data sources at the edge and providing high-quality data to the analysis platform.

Data Analytics at the Edge

4. Summary

In today's world, where the amount and variety of data is increasing, traditional methods are reaching their limits in collecting and analyzing data. The key to solving today's problems lies in the "edge," and the solution is data "collection" and "analysis" at the edge. By utilizing the edge, it is possible to efficiently collect data that was previously difficult to collect and improve the accuracy of analysis. We have introduced Splunk Edge Hub and Cribl as concrete solutions to achieve this.

Data collection and analysis using the edge will be at the core of data utilization in the future. If you are interested, please feel free to contact us.

Macnica, Inc.
Data & Application Division, 1st Technology Department, 2nd Section, Deputy Section Manager
Tomoki Kawamura

His main duties are proposing data utilization related ideas and planning and developing services. He is skilled at problem solving and planning based on his wide range of experience, and in the past he has been involved in activities such as discovering new products and planning and managing cyber attack/defense exercises for companies.