Agrolati – Plant Stalking Project

Solution Architecture Documentation

1. Overview of the Solution

Agrolati is an embedded AI-driven plant monitoring and conversational assistant designed for smallholder farmers. The system combines hardware sensors with custom machine-learning models and an LLM-powered response engine to diagnose crop conditions and deliver voice-based guidance in real time.

The initial version of Agrolati is optimized for maize crops, using synthetic, Nigeria-specific environmental data to ensure contextual relevance.


2. AI Models and Techniques Used

2.1 Custom Crop Condition Classification Model

2.2 LLM-Based Insight Generator

2.3 Text-to-Speech (TTS) System


3. Training Datasets and Sources

3.1 Dataset Used

3.2 Rationale for Synthetic Data


4. Solution Architecture Diagram (Text Description)

USER → Agrolati Hardware - Microphone (captures speech) - Sensors (Soil Moisture Sensor, Photoresistor/LDR) Hardware → Software Backend - Audio sent to speech-to-text engine - Sensor data transmitted to classification pipeline AI Core Layer 1. XGBoost Classification Model (Proprietary) - Input: Light + Moisture readings - Output: Diagnostic Condition (1 of 9 states) 2. LLM (Grok API) - Input: Diagnostic text - Output: Natural-language explanation & actionable advice 3. Text-to-Speech Engine - Converts LLM output into audio Software Backend → Hardware - Audio response returned to device - User hears Agrolati's voice feedback

5. Rationale for Architecture / Model Choices

Component Rationale Licensing
XGBoost High performance for structured data, fast computation, works well on embedded systems Open Source (Apache 2.0)
Custom Crop Condition Model Purpose-built for Nigerian maize conditions; ensures localized accuracy Proprietary
Synthetic Dataset (sammaz.csv) Eliminates data scarcity; enables domain-specific training Proprietary
Grok AI LLM High reasoning capability; suitable for generating detailed agronomic advice Proprietary
TTS Engine Provides natural audio output to the hardware device Licensed / Proprietary
Embedded Hardware Sensors Lightweight, low-power components suitable for rural environments Licensed hardware components

6. Resources Utilized

6.1 Open-Source Components

6.2 Proprietary Components

6.3 Licensed or Third-Party Resources


7. Risks & Mitigation Plans

Risk Impact Mitigation
Slow response time Poor user experience Optimizing pipeline, exploring MCP for faster inference
Limited sensor accuracy Misdiagnosis Calibration routines + data smoothing
Overreliance on synthetic data Potential performance gaps Field-data expansion in future updates
Connectivity issues LLM unavailability Local fallback responses in future versions

8. Future Roadmap

Download Model Output Example Download Synthetic Data (sammaz.csv)