ROS Hand Gesture Recognition

A ROS package for estimating hand pose using Mediapipe (and Python). For more details, please check out my blog post here.

Installation

Install all of the following dependencies using pip or conda:

mediapipe 0.8.1
OpenCV 3.4.2 or Later
Tensorflow 2.3.0 or Later
tf-nightly 2.5.0.dev or later (Only when creating a TFLite for an LSTM model)
scikit-learn 0.23.2 or Later (Only if you want to display the confusion matrix)
matplotlib 3.3.2 or Later (Only if you want to display the confusion matrix)

Clone this repo into a src folder of a catkin workspace:

$ git clone https://github.com/TrinhNC/ros_hand_gesture_recognition.git

Build this package by running: catkin build or catkin_make in a terminal in the worspace folder. For example:
```
$ cd ~/catkin_ws
$ catkin build
```

Run a demo

Source the workspace:
```
$ source ~/catkin_ws/devel/setup.bash
```
Launch the image publisher (here)
```
$ roslaunch my_cam my_cam.launch
```

Launch the hand pose recognition:

$ roslaunch ros_hand_gesture_recognition hand_sign.launch

Train new hand gesture

The current package can classify only six signs (classes) and I labeled them: Go, Stop, Forward, Backward, Turn Right and Turn Left (see the image below). I named them like that because they will be converted to control signals to move a robot later in this series. If you want to change or add gestures, or you find out that my trained model does not perform very well with your hand, you can collect data and train it again by yourself.

There are two jupyter notebooks included in the folder src/notebooks:

keypoint_classification_EN.ipynb: a model training script for hand sign recognition.
point_history_classification.ipynb: a model training script for finger gesture recognition (meaning the model can detect the movement of your fingers and not just a static sign like in the keypoint classification).

I used only the keypoint classification model in the current ROS package because it is enough for the application but you can feel free to adjust it to match yours.

In the example below, I will show you how to add one more sign to the detection. Let's say we want to add this sign✌️and name it "Hi".

First, open the keypoint_classifier_label.csv in the folder src/model/keypoint_classifier. Here you find all the labels (at the moment 6 classes) and you should add 'Hi' to the end. Next, you need to record data and append it to the file keypoint.csv in the folder src/model/keypoint_classifier. If you open this file, you will see it contains 6410 lines. The first number in each line is the class ID with respect to the list above, for example, "Go" has ID 0, "Stop" has ID 1, and so on. Then comes 42 numbers or 21 pairs of numbers which represent the coordinates of each keypoint (i.e. the hand knuckle) with respect to the origin which is the wrist. One thing to note is that the IDs in the image below are the key point IDs and they are different from the class IDs.

In order to record data, open the script app.py:

python3 app.py

Then press k on the keyboard. You should see the line MODE: Logging Key Point shows up. Then, use your right hand to make the target sign✌️visible and press and hold the number 6 (class ID of "Hi") with your left hand. This will append the new data to the file keypoint.csv until you release the key. You can also try to press & release 6 immediately and check the file. It should have one new line at the end starting with the number 6 and a list of numbers that follow. Also, during the recording, remember to move your right hand to different positions to make the dataset varied.

After recording for about 10-15 seconds, the data should be ready and you can stop the program. Open the notebook file keypoint_classification_EN.ipynb. Edit dataset, model_save_path and tflite_save_path to match your paths. Change NUM_OF_CLASSES to 7 instead of 6: NUM_CLASSES = 7. Then run the notebook from beginning to the end. The training is executed in cell [13] and takes around 2-3 minutes. After that you can launch my_cam.launch and hand_sign.launch to see the result like below.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
launch		launch
src		src
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
package.xml		package.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ROS Hand Gesture Recognition

Installation

Run a demo

Train new hand gesture

About

Releases

Packages

Languages

License

TrinhNC/ros_hand_gesture_recognition

Folders and files

Latest commit

History

Repository files navigation

ROS Hand Gesture Recognition

Installation

Run a demo

Train new hand gesture

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages