CV: Cargill Agriculture Image Classifier

EXECUTIVE SUMMARY

Developed and compared multiple convolutional neural network (CNN) models to classify plant seedling images for Cargill, delivering a high-accuracy proof-of-concept for automating agricultural identification.

Goal:

Automate the classification of 12 different plant seedling species to improve efficiency and support precision agriculture.

Approach:

Built a baseline CNN and implemented a transfer learning solution with ResNet50. Applied data augmentation (random rotations) and used class weights to create a robust model that generalizes well.

Outcome:

The final transfer learning model achieved ~95% validation accuracy. This POC demonstrates a powerful and scalable solution for reducing manual labor and enhancing crop management.

THE CHALLENGE

The manual identification of plant seedlings is a time-consuming and labor-intensive process in agriculture. Cargill sought a proof-of-concept to demonstrate that computer vision could automate this task accurately and efficiently, paving the way for data-driven farming solutions.

MY APPROACH

1. Data Preparation & Augmentation:

Preprocessed a dataset of 4,750 seedling images across 12 species.
Implemented on-the-fly data augmentation to introduce rotational variance into the training set, improving model generalization.
Addressed a notable class imbalance by calculating and applying class weights during training.

2. Model Design & Training:

Baseline Model: Built a sequential CNN from scratch to establish an initial performance benchmark (~88% accuracy).
Advanced Model: Implemented a transfer learning strategy using a pre-trained ResNet50 model, adding custom classification layers on top.

3. Evaluation:

Utilized Adam optimizer and categorical cross-entropy loss.
Monitored learning curves and used callbacks like EarlyStopping and ReduceLROnPlateau.
Performed a detailed evaluation using a confusion matrix and classification report.

PERFORMANCE & VALIDATION

Validation Accuracy: ~95%
The ResNet50 model proved highly effective, demonstrating robust classification even for visually similar species.
Analysis confirmed the model's ability to overcome the initial dataset imbalance.

IMPACT & BUSINESS RELEVANCE

Accuracy: Delivers high-fidelity classification suitable for real-world agricultural applications.
Efficiency: Provides a foundation for an automated system that can classify crops instantly, reducing reliance on manual labor.
Scalability: The architecture is adaptable and can be fine-tuned for other agricultural classification tasks.

NEXT STEPS

Integrate with mobile or IoT camera systems for field deployment.
Expand dataset to include seasonal variations and multi-region crops.
Implement real-time inference for on-site decision making.