Handwritten Digit Recognition

Introduction

Every human being has unique handwriting. While some handwriting is easy to understand, others may not be as legible. In fact, there is a wide variation in how individuals write letters and digits. To address this variation, we require a system capable of recognizing handwritten digits, regardless of how differently people write them.

This project focuses on handwritten digit recognition, which involves detecting and identifying digits written by various individuals. The goal is to create a machine learning application that can accurately interpret handwritten digits.

Applications and Advantages of Handwritten Digit Recognition

Handwritten digit recognition has a wide range of applications across various industries, such as:

Finance
Retail Industry – for fast processing of documents
Insurance and Banking Sectors
Healthcare
Logistics Companies

It is a crucial tool that converts handwritten digits into machine-readable data, streamlining processes in several sectors.

Dataset

To detect handwritten digits, a large and diverse dataset is required. The MNIST dataset is commonly used for this purpose as it provides a collection of handwritten digits in various formats.

MNIST refers to the Modified National Institute of Standards and Technology database.
This dataset consists of 60,000 training examples and 10,000 testing examples.
The dataset contains 4 files:
- Training set images
- Training set labels
- Test set images
- Test set labels

Implementation

Steps to implement handwritten digit recognition:

Download the Dataset:
- First, download the MNIST dataset file, which is in ZIP format.
Extract the Dataset:
- After downloading, extract the dataset, which contains the files necessary for training and testing.
Organize the Files:
- Inside the main project folder, you will find six different folders corresponding to six machine learning algorithms (both supervised and unsupervised). These include algorithms such as:
  - SVM
  - Logistic Regression
  - Linear Regression
  - K-Nearest Neighbors (KNN)
  - Random Forest, etc.
Prepare the Dataset for Each Algorithm:
- Copy and paste the MNIST dataset into each of the six algorithm folders, as the dataset will be used for training and testing all models.
Run the K-Nearest Neighbors (KNN) Algorithm:
- Open the knn folder and run the algorithm by opening the command prompt within this folder.
- In the command prompt, run the following command: bash python knn.py
Model Execution:
- The script will execute and display the confusion matrix for the training data, validation data, and testing data. It will also display the accuracy for each dataset.
Repeat for Other Algorithms:
- Repeat steps 5 and 6 for all six algorithms (SVM, Logistic Regression, Random Forest, etc.).

Note:

If you do not want the data to display in the command prompt, you can comment out specific lines (10, 11, 103, 104) in the .py files.

Evaluate Accuracy:
- Compare the accuracy of each algorithm to determine the best one for handwritten digit recognition.

Results

Algorithm	Accuracy
K-Nearest Neighbors (KNN)	95.80%
SVM	98.74%
Random Forest	96.43%
Logistic Regression	83.15%
Linear Regression	21.99%

Analysis

By analyzing the results, we can conclude the following:

Algorithm with the highest accuracy: SVM (98.74%)
Algorithm with the lowest accuracy: Linear Regression (21.99%)
- Note: The accuracy of Linear Regression can be improved using Logistic Regression.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
1. K Nearest Neighbors		1. K Nearest Neighbors
2. SVM		2. SVM
3. Random Forest Classifier		3. Random Forest Classifier
4. Linear Regression		4. Linear Regression
5. Logistic Regression		5. Logistic Regression
MNIST_Dataset_Loader		MNIST_Dataset_Loader
README.md		README.md
dataset.zip		dataset.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Handwritten Digit Recognition

Introduction

Applications and Advantages of Handwritten Digit Recognition

Dataset

Implementation

Steps to implement handwritten digit recognition:

Note:

Results

Analysis

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Handwritten Digit Recognition

Introduction

Applications and Advantages of Handwritten Digit Recognition

Dataset

Implementation

Steps to implement handwritten digit recognition:

Note:

Results

Analysis

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages