

The ultimate objective is building a writer independent model for each character.

So the output should be in one of the 35 classes. The characters (in standard UNIPEN format) are written both in upper and lower case and there is a whole two set of characters per writer. The data consists of samples of 26 characters and 10 digits written by 11 writers on a tablet PC. Our software package proposes to solve the classification of isolated handwritten characters and digits of the UJI Pen Characters Data Set using Neural Networks. Software to recognize the images is also required. Before OCR can be used, the source material must be scanned using an optical scanner (and sometimes a specialized circuit board in the PC) to read in the page as a bitmap (a pattern of dots). And each year, the technology frees acres of storage space once given over to file cabinets and boxes full of paper documents. For many document-input tasks, OCR is the most cost-effective and speedy method available. Advances in OCR technology have spurred its increasing use by enterprises. This is the technology long used by libraries and government agencies to make lengthy documents quickly available electronically. This is an efficient way to turn hard-copy materials into data files that can be edited and otherwise manipulated on a computer.

From Optical character recognition (OCR) is the translation of optically scanned bitmaps of printed or written text characters into character codes, such as ASCII.
