*This article is based on a lecture given at the Macnica Data・AI Forum 2024 Autumn held in October 2024.
1.First of all
In the modern business environment, it is widely recognized that utilizing data is essential for corporate growth and strengthening competitiveness. However, huge amounts of data exist everywhere, and collecting and analyzing it remains a major challenge. The key to solving these challenges is the "edge." The edge refers to a location that is extremely close to the data source. This data collection and analysis at the edge is an important means of solving the challenges of modern data utilization. In this article, we will explain the challenges of modern data utilization, the importance of the edge, and specific solutions.
2. Current issues in data utilization
Many companies are now using data to grow and improve their business efficiency. The market size for data utilization is growing steadily, and is predicted to reach 1.5 trillion yen in 2022 and 2 trillion yen in 2024. However, as the volume and variety of data increases, several challenges have emerged.
Data Collection Challenges
The first challenge companies face is collecting the necessary data. Data sources are becoming more diverse, including some that are difficult to collect, such as physical data and human body data. This means that traditional data collection methods are increasingly unable to handle the demand. Specific challenges include:
- Data diversification: Not only IT assets but also physical data such as factory equipment and human bodies are increasing, making collection methods more complex.
- Limited network connectivity: Data sources that are not connected to the internet are difficult to collect.
Data Analysis Challenges
While increasing amounts of data may seem like a good thing, in practice it brings many problems, the main challenges being:
- Processing load: The load on the analysis platform increases as large amounts of data are processed, causing performance to degrade.
- Data quality: The presence of noisy or poor quality data reduces the accuracy of analysis results.
To overcome these challenges, we need to rethink how we collect and analyze data.
When considering the use of AI, learning from poor-quality information or information that contains noise can reduce the quality of the answers given, and processing large amounts of data can take time, potentially compromising real-time performance.
3. The Importance of "Edge"
The key to solving the problems of data utilization is the "edge." The edge refers to a location that is as close as possible to the data source, and data collection and analysis at the edge has the potential to solve the problems of data utilization up to now.
Data Collection at the Edge
Traditionally, data has been collected primarily in central analytics platforms.
However, by providing data collection functionality to the edge, it becomes possible to efficiently collect data that was previously very difficult to collect. Specific solution examples include the following:
- IoT Gateway: Collects data from sensors and cameras and sends it to an analysis platform via the Internet.
- Wearable devices: Collect physical data using devices such as smartwatches.
For example, Macnica is solving data collection issues by using a solution called Splunk Edge Hub. This solution is palm-sized and easy to install, and supports multiple protocols, especially OT protocols, which are considered difficult to collect.
Data Analytics at the Edge
It is also very important to perform data analysis at the edge. By eliminating unnecessary data and performing preprocessing such as labeling before sending the data to the central analysis platform, it is expected that the accuracy of analysis will improve and the processing load will be reduced.
For example, a solution called Cribl optimizes the analysis process by preprocessing data collected from data sources at the edge and providing high-quality data to the analysis platform.
4. Summary
In today's world, where the amount and variety of data is increasing, traditional methods are reaching their limits in collecting and analyzing data. The key to solving today's problems lies in the "edge," and the solution is data "collection" and "analysis" at the edge. By utilizing the edge, it is possible to efficiently collect data that was previously difficult to collect and improve the accuracy of analysis. We have introduced Splunk Edge Hub and Cribl as concrete solutions to achieve this.
Data collection and analysis using the edge will be at the core of data utilization in the future. If you are interested, please feel free to contact us.

Macnica, Inc.
Data & Application Division, 1st Technology Department, 2nd Section, Deputy Section Manager
Tomoki Kawamura
His main duties are proposing data utilization related ideas and planning and developing services. He is skilled at problem solving and planning based on his wide range of experience, and in the past he has been involved in activities such as discovering new products and planning and managing cyber attack/defense exercises for companies.