Welcome to Mustek Home
www.mustek.com
Search  
Highlight
Our Products
 

Bitmap vs.Vector
Image Formats


Scanner Basics
H/W Installation
S/W Installation

Scanning Tips

User's Guide
Glossary

Software Tutorials

Scanner Classroom
 

Mustek Global :

Software Tutorials

TextBridge Classic OCR Tutorial for Macintosh

Part 1: When to Use TextBridge Classic

Part 2: Linking TextBridge to the Scanner’s ISIS Driver

Part 3: A Sample Scan with TextBridge


Part 1: When to Use TextBridge Classic

TextBridge Classic is an optical character recognition (or OCR) program designed to convert scanned documents into the format necessary to edit the document within a word processing application. For the best results using TextBridge Classic, it is important to understand the program’s limitations and the most common misconceptions.

Under optimal circumstances, TextBridge Classic can recognize 90-95 percent of the characters in a document. Optimal circumstances mean that the document is a clean copy of a black and white text-only document, using standard fonts and type sizes, printed on a laser printer or professional printing press.

TextBridge Classic’s limitations:

  • It is for TEXT only. It is unable to retain or interpret a graphic whether it is a color picture or a simple line art drawing.
  • OCR programs are not for making exact duplicates of documents like a photocopier. Making exact copies should be done in programs like Color It!.
  • It is able to recognize standard fonts and type sizes. Documents written in unusual fonts, like cursive script, or that are too large or small will not be properly recognized. It is also unable to recognize handwriting.
  • Many people want to scan in forms and fill them out on their computers. Most OCR programs, like TextBridge Classic, are unable to recognize forms because they contain both text and graphic elements, such as the lines and boxes used to make the form. Also, most word processors are unable to have more than one character occupy one space at a time. There are specialized form programs that can be used with scanners available at software retailers.

Part 2: Linking TextBridge with the Scanner’s ISIS Driver

Unlike the plug-ins used for image editing programs like Color It!, TextBridge interfaces with the scanner through an ISIS driver (Image and Scanner Interface Specifications). An ISIS driver is perfectly suited for OCR work, because it is an automatic black and white driver that will make a 300 dpi, line art, full page scan without prescanning or setting anything in advance.

When TextBridge is used for the first time, follow these steps to link the ISIS driver to TextBridge:

  1. Locate and open the TextBridge folder on the Macintosh hard drive. Double-click the TextBridge icon, which resembles a bridge, to launch the program.
  2. The TextBridge settings screen will appear.

The first set of options, in the outlined area called Input, have radio buttons for File or Scanner. The choice for Scanner is grayed out.

  1. The ISIS driver is an Extension that can be found in the Chooser. Click on the Apple menu in the upper left-hand corner of the screen and select Chooser to access it.
  2. Along with any installed printer drivers, the ISIS driver will appear as an icon in the Chooser. Usually the icon will look like a small scanner sitting above the scanner model name, or it will look like the Mustek logo "M" above the name of the scanner.
  3. If the ISIS driver is not in the Chooser, then it must be reloaded from the scanner installation CD, or the Macintosh Extensions have been turned off. Reload or reboot with the proper Extension settings.
  4. If the ISIS driver is present, simply click it to select it.

    Unlike most printer drivers, nothing will appear in the right hand window except the term Scanner Connection just above the window. Click the close window box in the upper-left corner to close the Chooser and return to the TextBridge settings.
  5. On the TextBridge screen, click on File and choose Select Source. A window will appear showing the selected ISIS driver with other selections grayed out. Click the OK button.
  6. The radio button in front of Scanner under the Input section will now become active. TextBridge is ready to proceed.

Part 3: A Sample Scan with TextBridge

  1. Place a clean copy of a black and white text document face down on the scanner flush with the upper right-hand corner of the glass and close the document cover.
  2. Open the hard disk and locate the TextBridge Classic folder. Double-click the folder to open it.
  3. Click on the TextBridge Classic icon, which looks like a bridge, to launch the program.
  4. The TextBridge settings window will now appear. If Scanner is grayed out under the Input section, refer to Part 2.

The first set of options is Input. This is for choosing between scanning in a document, or opening a saved document TIF file. Since the scanner is being used, the radio button is clicked for scanner.

The next section is Output. The options here allow us to choose whether we want Text, a Page Image or Both. In most cases, only Text will be wanted, but if a scanned TIF image of the document is desired, click on Page Image or Both.

The next section is Tools. These are not chosen by default, but clicking the check box in front of them selects them. The Preview tool will show us our scanned version of the text so that we may choose the parts we want to be recognized. The Verifier tool allows TextBridge to double-check certain words to make sure they are correct.

  1. With the desired options chosen, click the Go button.
  2. The Save file screen will now appear. TextBridge will need to know where to save the finished document, what to call it and what kind of text format it should be in.
  3. The top selection is the folder or disk the document will be saved to. TextBridge will use its own folder as the default. To choose a new folder or disk, select it from the pull down menu or create a new folder using the button to the right.

    Enter a name for the document in the Save Output As: line.

    Choose a text format on the Text: line. In order to edit the document, a format must be chosen that is compatible with a word processor being used on the system. TextBridge can use the following formats: XDOC, WYSIWYG Text, Ami Pro, Dbase, DisplayWrite & DCA-RFT, Microsoft Excel, Formatted ASCII, Frame Maker, ILF, Lotus 123, MacWrite 4.x-5.0, MacWrite II, Microsoft Word (RTF), MultiMate, PCL/PostScript, WordPerfect 1.0, WordPerfect 2.1, WordPerfect DOS 5.0-5.2 and WriteNow.

    When choosing a format, use the one that is most compatible with the preferred word processor. For instance, if Microsoft Word is commonly used, then choose the rich text format Microsoft Word (RTF). If there is no preferred word processor, or if unsure about compatibility between the different formats, choose Formatted ASCII. ASCII stands for American Standard Code for Information Interchange, and it is compatible with a wide range of word processors, including the Simple Text program that comes with the Macintosh OS.
  4. Once the location, filename and format are chosen, click the Continue button.
  5. The ISIS driver will now take over and the scanner will make a pass to scan the document. After it has finished scanning, the Preview window will appear.
  6. The Preview window features an image of the scanned page and a set of tools across the top. Use the magnifying glass tool with the plus sign (+) to zoom in on the image of the document to check for clarity. If the text is crooked or upside down, or if it is hard to read, adjust the original document and then click the Rescan button. If the text is legible, click on the magnifying glass with the minus sign (-) to zoom out a few steps.
  7. Use the T tool to create text zones. Zones are borders around the text that is intended for recognition. Items such as graphics or unwanted text will not require zones. To zone the text, click on the T and then move the cursor to the corner of the text to be zoned. Hold down the mouse button and drag the outline around the entire area of text and then release the mouse button. The area is now zoned with a number as shown here.

    The large dark area is a picture, which TextBridge would be unable to process, and is not zoned. Text can be zoned in any order, and that order will determine how it appears on the final copy.
  8. Once all of the desired text is zoned, click on the Continue button. TextBridge will begin the recognition process and then the Verifier window will appear.

  1. The Verifier allows us to double check words that TextBridge is unsure of, or that it wants verification of. The level of verification is set with the slide bar next to the word Words:. Moving the bar all the way to the darkest line means that TextBridge will verify every word. Moving it to the smallest line means TextBridge will only verify those words it doesn’t know at all, or that have missing characters (represented with a tilde ~).
  2. The word to be verified appears in the top text window with the corresponding copy from the original text appearing in the lower window as a highlighted block. If TextBridge has not properly recognized the word, type the correction into the upper window and click Continue. If it has recognized the word correctly, make no changes and click Continue.
  3. Once all of the questioned words have been verified, TextBridge will prompt for more pages. If there are multiple pages, click Continue and repeat steps 9-14 until each page is complete. If this is the final page, click the End button.
  4. TextBridge will now convert the text to the selected format and save the file. The TextBridge settings screen will reappear. Click the File menu and select Quit to close the TextBridge program. Locate the document and double-click it, or open it in the appropriate word processing program to begin editing the document.

Mustek, Inc.


Copyright, 2000, Mustek, Inc. All rights reserved.

Scanner Class Room Class Room