Part 2:
Using the Instant Access OCR
Part 3: A
Sample Scan with TextBridge
Part
1: When to Use TextBridge Classic
TextBridge Classic is an
optical character recognition (or OCR) program designed to
convert scanned documents into the format necessary to edit
the document within a word processing application. For a
complete explanation of how OCR programs work, consult the
document Understanding OCR, but for
the best results using TextBridge Classic it is important to
understand the programs limitation and the most common
misconceptions.
Under optimal circumstances,
TextBridge Classic can recognize 90-95 percent of the
characters in a document. Optimal circumstances mean that the
document is a clean copy of a black and white text-only
document, using standard fonts and type sizes, printed on a
laser printer or professional printing press.
TextBridge Classics
limitations:
- It is for TEXT only. It is
unable to retain or interpret a graphic whether it is a
color picture or a simple line art drawing.
- OCR programs are not for
making exact duplicates of documents like a photocopier.
Making exact copies should be done in programs like iPhoto
Plus.
- It is able to recognize
standard fonts and type sizes. Documents written in
unusual fonts, like cursive script, or that are too large
or small will not be properly recognized.
- Many people want to scan in
forms and fill them out on their computers. Most OCR
programs, like TextBridge Classic, are unable to recognize
forms because they contain both text and graphic elements,
such as the lines and boxes used to make the form. Also,
most word processors are unable to have more than one
character occupy one space at a time. There are
specialized form programs that can be used with a
TWAIN-compliant scanner available at software retailers.
Part
2: Using the Instant Access OCR
The Instant Access OCR feature
allows TextBridge to interact directly with a word processor.
When this icon is on the Windows taskbar, TextBridge can be
accessed from any word processor that has been registered with
the Instant Access OCR program. Many popular word processors
are already registered with the Instant Access OCR, and they
can be viewed by clicking on the icon. To register a word
processor not found in the Registered Application list, follow
these steps:
- The Instant Access OCR needs
to be running on the Windows taskbar. In most cases it
appears on the taskbar when Windows is started, but if it
does not appear it can be accessed by clicking on Start
and selecting Programs and TextBridge Classic, then
clicking on the icon for Instant Access OCR.
- Once Instant Access OCR is
on the taskbar, open the word processor that is going to
be registered.
- Click the Instant Access OCR
icon on the taskbar. It will open to reveal a list of
Available Applications and Registered Applications. The
word processor and any other running applications will be
under the Available Applications heading.
- Highlight the name of the
word processor and click the Add button. The word
processor will now join the others on the list of
Registered Applications.
- Press the OK button and the
Instant Access OCR returns to the taskbar.
Part
3: A Sample Scan with TextBridge
While the sample scan with
TextBridge could be utilized on any word processor correctly
registered with the Instant Access OCR as in Part Two above,
this tutorial is going to use the Windows Wordpad. The steps
used here would be nearly identical in another word processor.
- With the Instant Access OCR
running on the Windows taskbar, click on Start, select
Programs, Accessories and Wordpad.
- Click on the File menu in
Wordpad and select TextBridge OCR. If TextBridge OCR is
not on the File menu, then the Instant Access OCR was not
running before the Wordpad was opened.
- The TextBridge welcome
screen appears and then becomes the main TextBridge
window.
- A pair of the icons depicts
a scanner and a file folder.
If the scanner is highlighted, then the source has already
been selected. Proceed to step
5. If the file folder is highlighted, then the scanner
needs to be selected now. To select the scanner:
- Click on File and choose
Select Scanner/Source.
- On the list of driver
types, choose TWAIN.
- A list of scanners will be
displayed. Choose the appropriate Mustek driver for the
scanner being used and click Select.
- It is not necessary to
Test the scanner. Click the End button to return to the
main TextBridge window.
- The scanner icon should
now be highlighted instead of the file folder.
- Click the
Preview Image button at the right-hand end of the row
of icons.
- Click the Go button. This
will launch the scanners TWAIN interface.
- Place a clean copy of a
black and white text document face down on the scanner
flush with the upper right-hand corner of the glass.
- Press the Prescan button on
the scanner interface. The document will be prescanned and
displayed in the preview window.
- Make sure the dotted line
crop frame is surrounding the document. Click on the edges
of the crop frame to drag them in/out or up/down until
they are even with the sides of the document.
- Set the scanner to the Line
Art mode (TextBridge is unable to recognize color or gray
scale) and the resolution to 300 or 400 dpi. Click the
Scan button to scan the document into TextBridge.
- TextBridges Preview Image
screen will appear with a miniature version of the
document. The magnifying glass icons will zoom in on the
document to verify that the text is legible. The Zone tool
is a blue T that is used to draw borders or
"zones" around blocks of text intended for
recognition.
After selecting the Zone tool, move the cursor to the
upper corner of the first section of text that is to be
zoned. Hold down the mouse button and move the mouse to
the right and down until an outline is drawn around the
section of text. Once the entire section is outlined,
release the mouse button. The zone will be a dark box with
a number around the text.
- Text can be zoned in any
order, which will determine the order of recognition.
Items such as graphics or unwanted text will not require
zones as shown above.
- Once the text is zoned as
desired, press the Go button to continue.
- TextBridge will prompt for
more pages. If there are multiple pages, click Continue
and repeat steps
5 through 13 until each page is complete. If this is
the final page, click the End button.
- TextBridge now recognizes
the text and converts it into a text file, which is then
displayed in the Wordpad. The new version can be edited
and saved as if it were typed from scratch. If the
document has many misspellings or odd characters, it may
not have been an appropriate document. Check Part
1 of this tutorial or Understanding
OCR.