Neural network for unicode optical character recognition
Table Of Contents
- Title page – – – – – – – – iiCertification – – – – – – – – iiiApproval page – – – – – – – ivDedication – – – – – – – – vAcknowledgement – – – – – – – viAbstract – – – – – – – – – viiTable of contents – – – – – – – ixCHAPTER ONE
- 1.0INTRODUCTION – – – – – –
- 11.1 Statement of the problem – – – –
- 51.2 Purpose of the study – – – – –
- 61.3 Aims and objectives – – – – –
- 61.4 Scope of study – – – – – –
- 81.5 Limitations of the study – – – – –
- 81.6 Definition of terms.- – – – – – 9CHAPTER TWO
- 2.0LITERATURE REVIEW – – – – – 11CHAPTER THREE3.0 Methods for fact finding and details discussions on the subject matter. – – – – – –
- 153.1 Methodologies for fact finding – – –
- 153.2 Discussions – – – – – – – 16CHAPTER FOUR4.0 Futures, Implications and challenges of the subject matter for the society – – – –
- 204.1 Futures – – – – – – – –
- 204.2 Implications – – – – – – –
- 214.3 Challenges – – – – – – – 22CHAPTER FIVE5.0 SUMMARY, RECOMMENDATION AND CONCLUSION
- 245.1 Summary – – – – – – –
- 245.2 Recommendation – – – – – –
- 255.3 Conclusion – – – – – – – 28References – – – – – – – 30
Thesis Abstract
Neural networks have shown tremendous potential in the field of Optical Character Recognition (OCR) for various scripts and languages. Unicode, a standard for consistent encoding, encompasses a wide range of characters from different writing systems globally. Recognition of Unicode characters poses a significant challenge due to the vast diversity and complexity of characters. This research focuses on developing a neural network for Unicode OCR, specifically targeting the recognition of characters from different languages and scripts encoded in Unicode. The neural network architecture consists of multiple layers, including convolutional layers for feature extraction, followed by recurrent layers for sequence modeling. The use of convolutional layers allows the network to capture spatial patterns within the characters, while recurrent layers enable the model to learn the sequential dependencies within the characters. Training the neural network involves feeding it with labeled Unicode character images to learn the mapping between the input images and their corresponding Unicode labels. The network is trained using a large dataset of annotated Unicode characters to ensure robustness and generalization. Data augmentation techniques are employed to increase the diversity of the training dataset and improve the model's performance on unseen data. The proposed neural network architecture is optimized using various techniques such as dropout regularization to prevent overfitting and gradient descent optimization for efficient training. Hyperparameter tuning is performed to enhance the model's performance and convergence speed. The network is evaluated on a separate test dataset to measure its accuracy and performance metrics such as precision, recall, and F1 score. Experimental results demonstrate the effectiveness of the neural network in recognizing Unicode characters with high accuracy across different languages and scripts. The model shows robustness to variations in font styles, sizes, and noise levels, making it suitable for real-world applications where diverse Unicode characters are encountered. The use of neural networks for Unicode OCR opens up new possibilities for multilingual text recognition and document processing. By leveraging the power of deep learning, this research contributes to advancing the field of OCR for Unicode characters and paves the way for developing more sophisticated systems capable of handling complex scripts and languages with high accuracy and efficiency.
Thesis Overview
<p>
</p><p><strong>1.0 INTRODUCTION</strong></p><p>Character is the basic building block of any language that is used to build different structures of a language. Characters are the alphabets and the structures are the words, strings and sentences.</p><p>Optical character Recognition (OCR) is the process of converting an image of text, such as a scanned project character, document or electronic fax file, into computer-editable text. The text in an image is not editable. The letters/characters are made of tiny dots (pixels) that together form a picture of text. During OCR, the software analyzes an image and converts the pictures of the characters to editable text based on the patterns of the pixels in the image. After OCR, you can expert the converted text and use it with a variety of word-processing, page layout and spreadsheet applications. OCR also enables screen readers and refreshable bralle displays to read the text contained in images.</p><p>Optical character Recognition (OCR) deals with machine recognition of characters present in an input image obtained using scanning operation. It refers to the process by which scanned images are electronically processed and converted to an editable text. The need for OCR arises in the context of digitizing tamil documents from the ancient and old era to the latest, which helps in sharing the data through the internet.</p><p>A properly printed document is chosen for scanning. It is placed over the scanner, A scanner software is invoked which scans the document. The document is sent to a program that saves it in preferably TIF, JPG or GIF format, so that the image of the document can be obtained when needed. This is the first step in OCR (Vijaya Kumar, 2001), the size of the input image is as specific by the user and can be of any length but is inherently restricted by the scope of the vision and by the scanner software length.</p><p>This is the first step in the processing of scanned image. The scanned image is checked for skewing, there are possibilities of image getting skewed with either left or right orientation.</p><p>Here, the image is first brightened and binarized the function for skew detection checks for an angle of orientation between +15 degrees and if detected than a simple image rotation is carried out till the lines match with the true horizontal axis, which produce a skew corrected image.</p><p>After pre-processing, the noise free image is passed to the segmentation phase, where the image is decomposed into individual characters.</p><p><strong>Algorithm for Segmentation</strong></p>
<br><p></p>