One year later…

Last year was my first encounter with the robotic arm. I managed to understand what it is and how to set it up. Sounds superficial? Indeed!.

This spring semester I decided to face the robotic arm again, with a bit more patience and a bit more courage. I felt confident that at least I can learn a bit more on the software side.

Goals:

The GRAND goal was to stack jenga blocks in a closed-loop visual control rather than a scripted coordinate.

To achieve this I give myself some Sub Goals:

Using the Intel RealSense D435 to build a clean data collection setup.
- Meaning that I can create a repeatable camera setup and capture images of Jenga blocks under different lighting, angles, backgrounds, and stack arrangement
Using classical OpenCv to detect jenga blocks in the image.
- edge detection
- contour detection
- rectangle fitting
- oriental estimation
Estimate block position and orientation in real space
Execute reliable pick actions
Execute reliable place actions
Add corrections and feedbacks

Outreach:

Cold email is sometimes very hot:
Response

Resources

UFACTORY Documentation — Official UFACTORY docs for manuals, APIs, support articles, and release notes.
xArm Python SDK — Official Python SDK for controlling xArm robots.
UFACTORY Vision — UFACTORY’s vision-related repository, useful for camera integration and vision-based robotic workflows.
GGCNN Kinova Grasping — Reference project for vision-guided robotic grasping using GG-CNN.

OpenCV resources

OpenCV-Python Tutorials: Image Processing in OpenCV — Main OpenCV image-processing tutorial hub.
Camera Calibration — Calibrate the camera and correct distortion.
Camera Calibration and 3D Reconstruction (calib3d) — Geometry reference for calibration and projection.
Image Thresholding — Segment blocks from the background.
Morphological Transformations — Clean up masks after thresholding.
Contours: Getting Started — Find object outlines in binary images.
Contour Features — Measure contours and get rotated rectangles.
Perspective-n-Point (solvePnP) — Estimate object pose from image points.
Template Matching — Simple baseline for locating known visual patterns.
Hough Line Transform — Detect long straight edges and estimate orientation.
LearnOpenCV: Stereo Camera Depth Estimation With OpenCV — Helpful later if you add stereo/depth.