Course: 13. Camera & Vision-Based Robots

Section outline

Select section General

Collapse Expand
General

Collapse all Expand all
Introduction to Computer Vision and OpenCV

Computer vision is a field of artificial intelligence that enables machines to interpret and make decisions based on visual input—just like humans do using their eyes and brain. In robotics, this means equipping our robots with the ability to see, recognize, and react to objects, colors, shapes, and movements captured by a camera. This opens up powerful applications like line following, face detection, gesture control, and autonomous navigation.
- Select activity OpenCV
  
  What is OpenCV
  
  One of the most widely used tools in computer vision is OpenCV (Open Source Computer Vision Library). It is a highly efficient and flexible library written in C++ with Python bindings. OpenCV allows developers to perform a wide range of image processing tasks such as filtering, object detection, edge recognition, motion tracking, and more.
  
  Before we dive into using OpenCV, it's important to understand what it does under the hood. Every image or video is essentially a matrix of pixel values. These values define colors, brightness, and edges. Computer vision processes these pixel matrices to extract useful information. For example, if we want to detect a red ball, OpenCV can isolate red-colored pixels from a video feed, find the ball's outline, and track its movement across frames.
- Select activity Uses and How to use
  
  👓 Uses of computer vision in robotics
  
  Let’s look at a few key uses of computer vision in robotics:
  
  Color detection – Identify specific colors and take action when they appear.
  
  Object tracking – Follow a moving item like a hand, ball, or light source.
  
  Gesture control – Detect hand signs or body movements to control a robot.
  
  Obstacle avoidance – Use vision to detect and avoid barriers in the path.
  
  In this course, we will use Python and OpenCV together to implement real-world computer vision tasks for robotics. By the end, you will be able to build your own robot that sees, thinks, and reacts to visual input.
  
  To get started, make sure you have a working installation of Python and OpenCV on your computer. If not, you can use the command below to install OpenCV via pip:
  
  pip install opencv-python
  
  We will also use additional tools like NumPy for matrix operations and matplotlib for image display when required.
  
  This foundational understanding of what computer vision is, and why OpenCV is the tool of choice, will set the stage for all the exciting hands-on sections that follow.
Select section Setting Up

Collapse Expand
Setting Up
Setting Up Your Camera Module for Vision-Based Robotics

To work with computer vision in robotics, you need a camera module that captures real-time visual data. The most commonly used modules for small robotic projects include the Raspberry Pi Camera Module and generic USB webcams. In this section, we will focus on setting up the Pi Camera, though USB cameras can also be used similarly with OpenCV.
- Select activity Step-by-Step
  
  1. Choosing the Right Camera Module
  For Raspberry Pi-based robots, the official Pi Camera Module v2 or v3 offers compact size, good frame rate, and high resolution. If you are working with a PC and Arduino or ESP32 combo, a USB webcam might be a more accessible option. Ensure the camera supports at least 640x480 resolution and 30 FPS for smoother tracking.
  
  2. Installing Camera Drivers and Enabling the Module
  If you are using Raspberry Pi OS, the camera interface must be enabled first:
  
  Open terminal and run sudo raspi-config
  
  Navigate to Interface Options > Camera and enable it
  
  Reboot your Pi using sudo reboot
  
  Once rebooted, test your camera with the command libcamera-still -o test.jpg or raspistill -o test.jpg (based on your OS version).
  
  3. Accessing Camera in OpenCV
  Once the camera is set up and tested, OpenCV can be used to stream real-time video input using the following basic Python code:
  
  import cv2 cap = cv2.VideoCapture(0) # Use 0 for default camera while True: ret, frame = cap.read() if not ret: break cv2.imshow('Camera Feed', frame) if cv2.waitKey(1) & 0xFF == ord('q'): break cap.release() cv2.destroyAllWindows()
  
  This will open a window displaying the live video from your camera. Press q to exit the feed.
  
  4. Camera Troubleshooting Tips
  
  Check power supply – insufficient voltage can cause camera lag or dropout
  
  Ensure no other program is using the camera in the background
  
  On Raspberry Pi, check if camera is detected using vcgencmd get_camera
  
  Update your OS and OpenCV to the latest versions to ensure compatibility
  
  With your camera successfully connected and streaming through OpenCV, your robot now has eyes. The next step is to make it understand what it is seeing, which we will accomplish through color detection, object tracking, and more advanced vision techniques.
Select section Color and Object Detetection

Collapse Expand
Color and Object Detetection
Color Detection and Object Tracking with OpenCV

Color detection is one of the most effective and beginner-friendly techniques for enabling vision in robotics. By detecting specific colors in the video feed, your robot can identify and track colored objects such as balls, lines, or markers. This section walks you through the entire process of setting up real-time color tracking using OpenCV, from color filtering to object localization and tracking logic.
- Select activity How to
  
  1. Understanding the HSV Color Space
  Unlike the standard RGB color format, the HSV (Hue, Saturation, Value) color space is much more suitable for color detection because it separates image brightness (Value) from color information (Hue and Saturation). This makes color filtering more reliable under varying lighting conditions.
  
  HSV Breakdown:
  
  Hue: Type of color (e.g., red, blue, green)
  
  Saturation: Intensity or purity of the color
  
  Value: Brightness of the color
  
  To use HSV in OpenCV, convert your image frame using:
  
  hsv = cv2.cvtColor(frame, cv2.COLOR_BGR2HSV)
  
  2. Defining Color Ranges
  Each color you want to detect needs a defined lower and upper HSV range. For example, to detect a red object:
  
  lower_red = np.array([0, 120, 70]) upper_red = np.array([10, 255, 255]) mask = cv2.inRange(hsv, lower_red, upper_red)
  
  The mask is a binary image that highlights only the parts of the frame that fall within your defined color range.
  
  3. Finding Object Contours
  Once you have a color mask, use contour detection to identify the shape and location of the object:
  
  contours, _ = cv2.findContours(mask, cv2.RETR_TREE, cv2.CHAIN_APPROX_SIMPLE) for cnt in contours: area = cv2.contourArea(cnt) if area > 500: x, y, w, h = cv2.boundingRect(cnt) cv2.rectangle(frame, (x, y), (x + w, y + h), (0, 255, 0), 2)
  
  This code draws a rectangle around the detected object if its area is large enough, reducing noise from small false detections.
  
  4. Object Tracking Logic
  To track an object, find its center and map its position relative to the frame’s center. You can then control a robot’s motors to follow the object based on its position:
  
  If object is left of center – turn left
  
  If object is right of center – turn right
  
  If centered – move forward
  
  Sample logic to calculate object center:
  
  cx = x + w // 2 cy = y + h // 2 cv2.circle(frame, (cx, cy), 5, (255, 0, 0), -1)
  
  5. Handling Multiple Colors and Objects
  You can track multiple colors simultaneously by applying multiple masks and merging them using bitwise operations. For example:
  
  combined_mask = cv2.bitwise_or(mask1, mask2)
  
  Also, remember to handle overlapping objects and sort by area or proximity depending on the task.
  
  6. Optimizing Performance
  
  Use smaller frame size to reduce lag (e.g., 320x240)
  
  Apply Gaussian blur to reduce noise
  
  Filter contours by area and shape for accuracy
  
  Color detection provides a solid foundation for computer vision in robotics. You can now identify and follow a ball, detect specific markers on the floor, or even trigger actions based on the presence of certain colors. In upcoming sections, we will expand this capability using object detection models and more complex image analysis techniques.
Select section Ball-Tracking Robot

Collapse Expand
Ball-Tracking Robot
Building a Ball-Tracking Robot with OpenCV

Now that you have a solid understanding of color detection and tracking using OpenCV, it's time to bring that vision capability into the physical world. This section focuses on building a real-time ball-tracking robot that can follow a specific colored ball using a USB or Pi camera. You'll learn how to interface the vision system with motor controls to enable autonomous movement based on camera input.
- Select activity Step-by-Step
  
  1. System Overview and Flow
  The robot will use a camera module to capture live video, process each frame using OpenCV to detect the ball, calculate its position in the frame, and then control the motors accordingly. The entire loop happens continuously to maintain real-time tracking.
  
  Core flow of operations:
  
  Capture frame using OpenCV
  
  Convert to HSV and apply color mask
  
  Find the largest contour (assumed to be the ball)
  
  Determine the ball’s x-position in the frame
  
  Send motor commands via serial to Arduino or Raspberry Pi GPIO
  
  2. Hardware Setup
  
  Camera: USB webcam or Pi Camera (for Raspberry Pi)
  
  Processor: Raspberry Pi (preferred) or PC with Arduino for motor control
  
  Motors: 2 DC motors connected via L298N motor driver
  
  Chassis: Two-wheeled robot chassis with space for electronics
  
  For Raspberry Pi control, use GPIO pins to send forward, left, and right signals based on object position. For Arduino control, send serial commands (like 'F', 'L', 'R') from your computer or Pi based on OpenCV output.
  
  3. Frame Division and Control Logic
  Divide the frame into three vertical zones: Left, Center, and Right. Based on the detected object's center (cx), decide the motion:
  
  if cx < 100: # Turn left elif cx > 220: # Turn right else: # Move forward
  
  Use Python’s serial library to send the control signals to Arduino:
  
  ser.write(b'L')
  
  On the Arduino side, read these inputs and translate into motor control instructions.
  
  4. Stabilizing the Robot Behavior
  
  Set a minimum area threshold to ignore small, irrelevant detections.
  
  Use a simple PID control (proportional part only) to smooth turning movements based on distance from center.
  
  Add a short delay or frame skip to balance processing load and motor reaction time.
  
  5. Testing and Fine-Tuning
  During testing, experiment with different lighting conditions and ball colors. Adjust HSV ranges accordingly. Also calibrate the thresholds for frame zones so the robot makes timely decisions.
  
  For example, if the robot turns too aggressively, increase the central threshold range. If it reacts too slowly, reduce delays or tweak motor speeds.
  
  6. Enhancing the Project
  
  Add a second camera for better perception
  
  Track multiple objects with different colors
  
  Combine with ultrasonic sensors to avoid obstacles while tracking
  
  This project demonstrates a complete cycle of visual input, real-time processing, and robotic output. It is a foundational exercise for more advanced camera-guided robots like pick-and-place systems or autonomous vehicles.
Select section Edge & Shape Detection

Collapse Expand
Edge & Shape Detection
Edge Detection and Shape Recognition in Vision Robots

In this section, we will explore advanced computer vision techniques that move beyond color tracking. You will learn how to detect object boundaries, recognize basic shapes, and apply these skills in robotics. These features enable your robot to interpret more complex scenes, such as following lines, identifying objects by shape, or reacting to patterns in the environment.
- Select activity Edge
  
  1. Understanding Edge Detection
  Edge detection helps a computer recognize the boundaries of objects within an image. It simplifies the visual input by highlighting areas with strong contrast, usually representing object outlines.
  
  Key methods:
  
  Grayscale conversion: Reduces color image to intensity values.
  
  Gaussian blur: Smooths image to reduce noise before edge detection.
  
  Canny edge detection: A popular and effective method provided by OpenCV.
  
  Sample code using OpenCV:
  
  gray = cv2.cvtColor(frame, cv2.COLOR_BGR2GRAY) blur = cv2.GaussianBlur(gray, (5, 5), 0) edges = cv2.Canny(blur, 50, 150)
  
  This produces a binary image where white lines represent detected edges.
- Select activity Shape
  
  2. Contour Detection and Shape Analysis
  Contours are curves that follow the boundary of shapes. OpenCV allows you to extract contours and analyze their properties to identify shapes like circles, rectangles, and triangles.
  
  Steps:
  
  Convert to grayscale
  
  Apply thresholding or edge detection
  
  Use cv2.findContours to extract contour outlines
  
  Analyze contour area, number of edges (using polygon approximation), etc.
  
  For example, to detect a triangle:
  
  approx = cv2.approxPolyDP(contour, 0.04 * cv2.arcLength(contour, True), True) if len(approx) == 3: print("Triangle detected")
  
  3. Shape-Based Robotic Applications
  
  Object classification: Identify items based on geometry (e.g., only pick up circles).
  
  Obstacle differentiation: Avoid square boxes, follow circular markers.
  
  Path detection: Use contour shapes to follow white lines or marked routes.
  
  These methods are used in robotic sorting systems, warehouse bots, and smart vehicles.
- Select activity Challenges and Improvements
  
  4. Challenges and Improvements
  
  Lighting: Strong shadows or low light can affect edge clarity.
  
  Noise: Blurring and morphological operations (like erosion) can reduce noise.
  
  Perspective distortion: Adjust for camera angles or use shape ratio filters.
  
  5. Experiment Ideas
  
  Detect different paper cut-out shapes placed on a board.
  
  Build a shape-sorting robotic arm that classifies objects by their contour.
  
  Combine shape detection with color filtering for enhanced accuracy.
  
  Edge and shape detection elevate your robot’s ability to interact with the environment using visual logic, making it more autonomous and intelligent in real-world applications.
Select section Ball-Tracking Turret

Collapse Expand
Ball-Tracking Turret
Project: Ball-Tracking Turret Using OpenCV

In this section, we will apply all the concepts learned so far in a practical and engaging project—a camera-controlled turret that tracks and follows a moving colored ball. This project integrates hardware and software: image processing with OpenCV and physical actuation using servo motors controlled by Arduino or Raspberry Pi. It mimics how real-world vision-based robots perform tracking tasks like surveillance, sorting, or interaction.
- Select activity Project Details
  
  1. Project Overview and Objective
  The goal is to build a simple robotic turret that can rotate and tilt to follow a specific color object, such as a red or blue ball. The camera identifies the position of the ball in its frame, and based on the ball’s location, it adjusts the servos to center the ball on screen.
  
  This project involves:
  
  Camera input via USB or Pi Camera
  
  OpenCV for object detection
  
  Arduino or Raspberry Pi for servo control
  
  Servo mount for horizontal and vertical motion (2 DOF)
  
  2. Setting Up the Hardware
  
  Camera: Mount it securely to see the tracking area.
  
  Servo motors: Connect two micro servos for pan and tilt movement.
  
  Microcontroller: Use Arduino Uno or connect Raspberry Pi GPIOs if you're using Python to control servos directly.
  
  Power supply: Ensure servos are powered externally to avoid current issues.
  
  Servo connection tips:
  
  Use PWM-capable pins
  
  Use Servo.h library on Arduino
  
  Connect camera to Raspberry Pi if using Python with OpenCV
  
  3. Ball Detection Using OpenCV
  
  To detect a colored ball, use HSV color space for robust detection under different lighting:
  
  hsv = cv2.cvtColor(frame, cv2.COLOR_BGR2HSV) mask = cv2.inRange(hsv, lower_color, upper_color) contours, _ = cv2.findContours(mask, cv2.RETR_TREE, cv2.CHAIN_APPROX_SIMPLE)
  
  Once you find the largest contour, calculate its center and use that to adjust servo angles accordingly.
  
  4. Servo Movement Logic
  
  Compare the center of the ball with the center of the frame. If the ball is left of center, rotate servo to the left; if above center, tilt the camera up, and so on. This real-time feedback loop allows smooth tracking.
  
  Example pseudocode:
  
  if ball_x < center_x: pan_angle -= 1 elif ball_x > center_x: pan_angle += 1 if ball_y < center_y: tilt_angle -= 1 elif ball_y > center_y: tilt_angle += 1
  
  5. Testing and Calibration
  
  Test detection using different colors and distances
  
  Calibrate servo angles to prevent overshooting
  
  Add range limits to prevent servo from rotating too far
  
  6. Extensions and Variations
  
  Track multiple objects and switch targets
  
  Use voice commands to activate tracking
  
  Integrate object size estimation to control zoom
  
  This project combines many core robotic concepts—mechanical control, image processing, real-time feedback, and calibration. It is an exciting step toward building robots that can truly sense and respond to the environment dynamically.
Select section Optimization and Challenges

Collapse Expand
Optimization and Challenges
Performance Optimization and Challenges

Vision-based robotic systems often face real-world constraints such as limited processing power, inconsistent lighting, and the need for real-time performance. This section focuses on techniques to enhance the performance of your camera and computer vision system while maintaining reliable accuracy.
- Select activity Tips
  
  1. Reducing Lag and Frame Drop
  
  Lag and frame drops can result in delayed servo movements, causing the robot to miss its target. Here are ways to reduce these issues:
  
  Lower Resolution: Reduce video frame size (e.g., from 640x480 to 320x240) to improve processing speed.
  
  Limit Frame Rate: Set a fixed frame rate to reduce CPU overload and maintain consistency.
  
  Efficient Loops: Avoid unnecessary processing inside your frame loop—only run essential tasks like detection and movement calculation.
  
  Hardware Acceleration: Use GPU acceleration if available, especially on platforms like Jetson Nano or Raspberry Pi 4 with OpenCV compiled for hardware support.
  
  2. Dealing with Variable Lighting
  
  Lighting conditions greatly affect the accuracy of color detection and image processing. Consider the following tips:
  
  Use HSV Color Space: Unlike RGB, HSV separates brightness from color, making it more robust in changing light.
  
  Apply Gaussian Blur: Smooths out sharp noise before thresholding to reduce false detection.
  
  Use Auto Exposure Settings: Many cameras support manual exposure, which can prevent flickering or white-out areas.
  
  Enclose Setup: For consistent results, enclose the tracking area with controlled lighting, especially for demo purposes.
  
  3. Balancing Accuracy vs. Speed
  
  A fast system may sacrifice precision, while a very accurate system may be too slow. You must balance the two based on your application:
  
  Reduce Detection Area: Track only a region of interest (ROI) instead of the full frame for faster processing.
  
  Approximate Shape Matching: Use bounding boxes or centroid tracking rather than complex contour analysis.
  
  Skip Frames: In non-critical applications, process every second or third frame to lighten load while maintaining responsiveness.
  
  Preprocess Smartly: Use thresholding, morphological operations, and early exits in code for fast rejection of unwanted data.
  
  4. Real-World Testing and Iteration
  
  Always test your robot in various conditions—different lighting, backgrounds, distances, and speeds. Record performance, tweak parameters like HSV ranges, servo delay times, and PID-style tuning to optimize tracking stability.
  
  By systematically identifying and addressing these challenges, you can build a much more robust and responsive vision system that performs reliably in both indoor and outdoor environments. These optimizations are essential to scaling your vision-based projects to more complex or real-world applications.
Select section Summary and Quiz

Collapse Expand
Summary and Quiz
Congratulations on completing the course on Camera & Vision-Based Robots. You have learned the fundamentals of integrating computer vision into robotics, from setting up a Pi Camera and using OpenCV to implementing real-time color detection, object tracking, and optimizing performance for real-world environments. You explored core image processing techniques and built practical projects like a ball-tracking robot. With these skills, you are now prepared to experiment with more advanced vision-based systems. Let’s assess your understanding with a quick quiz.
- Select activity Quiz
  
  Quiz

Section outline

Introduction to Computer Vision and OpenCV

What is OpenCV

👓 Uses of computer vision in robotics

Setting Up Your Camera Module for Vision-Based Robotics

Color Detection and Object Tracking with OpenCV

Building a Ball-Tracking Robot with OpenCV

Edge Detection and Shape Recognition in Vision Robots

Project: Ball-Tracking Turret Using OpenCV

Performance Optimization and Challenges