Model training of AI agrifood-related algorithms

Artificial intelligence model training in TEF infrastructure for agrifood applications. 

Interested in this service? Contact us at servicios@gradiant.org

Overview

The AI model training service in TEF infrastructure offers a powerful platform for developing cutting-edge artificial intelligence solutions in the agrifood sector. Leveraging high-performance computing resources, this service enables innovators to train AI models using diverse tabular data types. These include sensor readings (such as soil moisture, temperature, and nutrient levels), machine-generated data (from agricultural equipment and IoT devices), crop yield statistics, weather patterns, and supply chain metrics. Users can either provide their own datasets or utilise existing data within the TEF infrastructure (see Related Services). The service supports a wide range of AI frameworks and model architectures, allowing for flexible experimentation and rapid iteration. From predictive maintenance of farming equipment to optimising crop management practices, this platform accelerates the development of AI-driven solutions that address critical challenges in agriculture and food science, fostering innovation and efficiency in the industry. 

 

More about the service

Discover more about our service, including how it can benefit you, the delivery process, and the options for customisation tailored to your specific needs!

The AI model training service in TEF infrastructure addresses critical needs in the agrifood sector: it enables the improvement of existing AI models and provides the computational power to run existing algorithms at scale. Before using this service, customers may have developed algorithms but lack the computational capacity to train or run them effectively on large datasets. After utilizing this service, they can overcome these limitations.

The AI model training service in TEF infrastructure is already available to execute. Customer requirements are initially captured in an interview (online or face to face) between the service provider and the customer. Then, we can set the time needed to execute it (from approximately one month to several weeks). The service is executed remotely in digital infrastructure located in Vigo (Spain) so the customers could access the service remotely regardless of their location. As a result, the customer will receive documentation about the training process, the AI model itself and a user manual to know how to use it. The customer should provide the data to train the AI model and, optionally, a model prototype that the customer has already trained. If the customer lacks the data needed, the TEF could provide it through a dataset provision service, like these ones (see Related Services).

As a generic example, a company that have developed a predictive system to assist farmers in some agrifood process can improve the system with new data using TEF infrastructure.

The customer should provide the data to train the model. In case the customer provides a model prototype, it should be trained with one of the main machine learning development technologies in python language, like sckit-learn, catboost, xgboost, lightgbm, pytorch, keras, tensorflow, etc.

As a more specific example, the company VineGuard Analytics has developed a predictive system that helps wine producers optimize their grape harvesting schedule by forecasting the optimal harvest window 2 weeks in advance. The system analyses data from:
- Soil moisture sensors.
- Local weather station readings (temperature, humidity, rainfall).
- Historical harvest dates and quality ratings.
- Grape sugar content measurements.
- Satellite imagery of vineyard health.

To improve their model's accuracy, VineGuard wants to incorporate new data from their clients' vineyards. Each client provides:
- Three years of historical data including:
- Daily sensor readings in CSV format.
- Harvest dates and corresponding wine quality scores.
- Weekly grape analysis results.

A pre-trained XGBoost model (Python 3.8+) that:
- Takes 14 input features (weather, soil, and grape metrics).
- Outputs a harvest readiness score (0-100).
- Uses standard scikit-learn preprocessing pipelines.
- Is version controlled through Git.

The model will be re-trained on TEF infrastructure using the combined historical datasets from multiple vineyards while maintaining data privacy between clients. The improved model aims to reduce harvest timing errors from ±5 days to ±3 days.
Location
Remote
Type of Sector
Arable farming
Food processing
Greenhouse
Horticulture
Livestock farming
Tree Crops
Viticulture
Type of service
AI model training
Accepted type of products
Software or AI model