WO2000003321A1 - Method and apparatus for processing a document containing confidential information - Google Patents

Method and apparatus for processing a document containing confidential information Download PDF

Info

Publication number
WO2000003321A1
WO2000003321A1 PCT/US1999/015348 US9915348W WO0003321A1 WO 2000003321 A1 WO2000003321 A1 WO 2000003321A1 US 9915348 W US9915348 W US 9915348W WO 0003321 A1 WO0003321 A1 WO 0003321A1
Authority
WO
WIPO (PCT)
Prior art keywords
document
worker
section
file
ofthe
Prior art date
Application number
PCT/US1999/015348
Other languages
French (fr)
Inventor
Jay S. Walker
James A. Jorasch
Magdalena Mik
Original Assignee
Walker Digital, Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/112,131 external-priority patent/US7523045B1/en
Application filed by Walker Digital, Llc filed Critical Walker Digital, Llc
Priority to AU52089/99A priority Critical patent/AU5208999A/en
Publication of WO2000003321A1 publication Critical patent/WO2000003321A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management

Definitions

  • the present invention relates to methods and apparatus for processing documents, and more specifically to methods and apparatus that facilitate the processing of documents containing possibly confidential information.
  • OCR Optical Character Recognition
  • OCR is often inaccurate. Any character may assume a large number of shapes, especially if the character is handwritten. To circumvent this disadvantage, some OCR systems are limited to recognizing known typefaces and sizes and cannot recognize handwriting at all. Other OCR systems use sophisticated algorithms to learn new typefaces and sizes, and may thus increase recognition accuracy somewhat. However, even the best OCR system is unable to make the common sense judgments that allow humans to recognize poorly recognizable characters and words from their context. Consequently, OCR remains unable to consistently generate accurate text files. In fact, it can take so long to proofread and correct a text file created with OCR that it would be just as easy to have a human manually create a text file by typing the text.
  • a controller receives a document from a customer.
  • the document may be an image generated by scanning a paper document.
  • One or more sections of the document that are indicated by predetermined indicia are determined. Different sections may include different types of information. For example, one section of a document might include information that identifies a person, while another section of the document might include confidential information about that person.
  • the sections are transmitted to one or more workers who perform corresponding data entry. Separating the sections can thus preserve the confidential nature of the document.
  • a file is subsequently received from each worker. The file corresponds to data entry performed on the section of the document.
  • the controller determines the customer associated with the file and transmits the file to the customer. Payment is then rendered, directly or indirectly, from the customer to the workers. BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a schematic illustration of an apparatus for processing a document in accordance with the present invention.
  • FIG. 2 is a schematic illustration of a controller of the apparatus of FIG. 1.
  • FIG. 3 is a schematic illustration of a customer database of the controller of FIG. 2.
  • FIG 4 is a schematic illustration of a worker database of the controller of FIG. 2.
  • FIG. 5 is a schematic illustration of a document database of the controller of FIG. 2.
  • FIG. 6 is a schematic illustration of a section database of the controller of FIG. 2.
  • FIG. 7 is a flow chart illustrating a method for processing a document in accordance with the present invention.
  • FIG. 8 is a data flow diagram illustrating the transfer of information within the apparatus of FIG. 1 during a portion of the method of FIG. 7.
  • FIGS. 9 A, 9B, 9C and 9D are images that include indicia for identifying sections.
  • FIG. 10 is an exemplary image including indicia in the form of boundary lines that define the perimeters of two sections.
  • FIGS. 11A and 1 IB are schematic illustrations of sections of a document.
  • FIG. 12 is a flow chart illustrating an embodiment of a portion of the method of FIG. 7.
  • FIG. 13 is a data flow diagram illustrating the transfer of information within the apparatus of FIG. 1 during another portion of the method of FIG. 7.
  • FIG. 14 is a flow chart illustrating an embodiment of another portion ofthe method of FIG. 7.
  • FIG. 15 is a schematic illustration of a form for entering data.
  • FIG. 16 is a schematic illustration of a form database.
  • FIG. 17 is a flow chart illustrating a method for permitting a section to be processed by a plurality of workers.
  • FIG. 18 is a schematic illustration of a record of another embodiment ofthe section database ofthe controller of FIG. 2.
  • FIG. 19 is a flow chart illustrating a method for processing a plurality of documents in accordance with processing criteria for the plurality.
  • the present invention allows a business or other entity to automatically locate and communicate with one or more workers who in turn create computer-based information (e.g. text files or data base entries) for the business from a paper document.
  • computer-based information e.g. text files or data base entries
  • the business need not consume resources in locating and hiring appropriate workers.
  • the business also remains unburdened by the administrative costs associated with hiring and retaining such workers in an employment capacity.
  • the present invention further allows information derivable from the paper document to remain confidential even if the document is being processed by a worker who may be untrustworthy. Thus, even sensitive documents such as medical records may be processed without compromising their confidential information.
  • the present invention likewise benefits people that can function as "workers” (described below) even without possessing specialized skills.
  • a market is created for such workers, who may work at times they choose and as much as they choose.
  • very small amounts of free time e.g. minutes
  • people can earn money during times such as lunch breaks or other periods of inactivity. Workers may furthermore maintain their anonymity, should they desire.
  • an apparatus 100 comprises a controller 110 that is in communication with a customer device 120 and with worker devices 130 and 135.
  • the controller 110 may communicate with the customer device 120 and the worker devices 130 and 135 via an appropriate network such as the Internet.
  • Each ofthe controller 110, the customer device 120, and the worker devices 130 and 135 may comprise computers, such as those based on the Intel® Pentium® microprocessor, that are adapted to communicate via the Internet (e.g. via a modem). Any number of customer devices and worker devices may be in communication with the controller 110.
  • the controller 110 facilitates communication between a customer (e.g. a business) using the customer device 120 and one or more workers using worker devices such as the worker devices 130 and 135.
  • the communication may be performed while maintaining anonymity of each party, and in fact the present invention can eliminate the need for a party to identify the other party with which it communicates. Accordingly, the parties may form a relationship that allows confidential information to be handled efficiently and without compromise.
  • the controller 110 comprises a processor 202, such as the Intel® Pentium® microprocessor.
  • the processor 202 is in communication with a data storage device 204, such as an appropriate combination of magnetic, optical and/or semiconductor memory.
  • the data storage device 204 may comprise one or more of a ROM, RAM and hard disk drive.
  • the processor 202 and the data storage device 204 may each be (i) located entirely within a single computer or other computing device; (ii) connected to each other by a remote communication medium, such as a serial port cable, telephone line or radio frequency transceiver; or (iii) a combination thereof.
  • the controller 110 may comprise one or more computers that are connected to a remote server computer for maintaining databases.
  • the data storage device 204 stores a program 210 for controlling the processor 202.
  • the processor 202 performs instructions ofthe program 210, and thereby operates in accordance with the present invention, and particularly in accordance with the methods described in detail herein.
  • the program 210 furthermore includes program elements that may be necessary, such as an operating system and "device drivers" for allowing the processor 202 to interface with computer peripheral devices. Appropriate device drivers and other necessary program elements are known to those skilled in the art, and need not be described in detail herein.
  • the storage device 204 also stores (i) a customer database 220, (ii) a worker database 230, (iii) a document database 240, and (iv) a section database 250.
  • the databases 220, 230, 240 and 250 are described in detail below and depicted with exemplary entries in the accompanying figures.
  • the schematic illustrations and accompanying descriptions of the databases presented herein are exemplary arrangements for stored representations of information. A number of other arrangements may be employed besides those suggested by the tables shown.
  • the illustrated entries ofthe databases represent exemplary information, and those skilled in the art will understand that the number and content ofthe entries can be different from those illustrated herein. Referring to FIG.
  • a table 300 represents an embodiment ofthe customer database 220 (FIG. 2).
  • the table 300 includes entries 302 and 304, each defining a customer that interacts with the controller 110 (FIG. 1) to have documents processed. It will be understood by those skilled in the art that the table 300 may include any number of entries.
  • the table 300 also defines fields for each ofthe entries 302 and 304. The fields specify (i) a customer identifier 320 that uniquely identifies the customer, (ii) an electronic mail ("email") address 322 for communication with the customer, (iii) a customer name 324, and (iv) a financial account identifier 326 that indicates a financial account the customer may use to pay for services.
  • a table 400 represents an embodiment ofthe worker database 230 (FIG. 2).
  • the table 400 includes entries 402, 404 and 406, each defining a worker that interacts with the controller 110 (FIG. 1) to process documents. It will be understood by those skilled in the art that the table 400 may include any number of entries.
  • the table 400 also defines fields for each ofthe entries 402, 404 and 406.
  • the fields specify (i) a worker identifier 420 that uniquely identifies the worker, (ii) an email address 422 for communication with the worker, (iii) a worker name 424, (iv) a financial account identifier 426 that indicates a financial account that the worker may use to receive payment, (v) job skills 428 ofthe worker, (vi) a quality rating 430 ofthe worker, (vii) a speed rating 432 ofthe worker, and (viii) a fee charged 434 by the worker for its services.
  • a worker identifier 420 that uniquely identifies the worker
  • an email address 422 for communication with the worker e.g., a email address 422 for communication with the worker
  • a worker name 424 e.g., e., a email address 422 for communication with the worker
  • a financial account identifier 426 that indicates a financial account that the worker may use to receive payment
  • job skills 428 ofthe worker e.g., a quality rating 432 of
  • a table 500 represents an embodiment ofthe document database 240 (FIG. 2).
  • the table 500 includes entries 502 and 504, each defining a document that a customer needs to have processed by a worker. It will be understood by those skilled in the art that the table 500 may include any number of entries.
  • the table 500 also defines fields for each ofthe entries 502 and 504.
  • the fields specify (i) a document identifier 520 that uniquely identifies the document, (ii) the customer 522 for which the document is to be processed, (iii) a job code 524 that identifies the type of processing to be performed on the document, (iv) a price 526 to pay for processing the document, (v) how to receive payment 528 for processing performed, (vi) a required worker rating 530, (vii) a time 532 by which the document must be processed, (viii) other required characteristics 534 ofthe worker (if any), (ix) whether sections ofthe document must be processed by different workers 536, (x) section identifiers 538 that uniquely identify sections ofthe document that are to be processed by one or more workers, and (xi) whether and when the document processing has been completed 540.
  • a table 600 represents an embodiment ofthe section database 250 (FIG. 2).
  • the table 600 includes entries 602, 604, 606, 608 and 610, each defining a section that is to be processed by a worker. It will be understood by those skilled in the art that the table 600 may include any number of entries.
  • the table 600 also defines fields for each ofthe entries 602, 604, 606, 608 and 610.
  • the fields specify (i) a section identifier 620 that uniquely identifies the section, (ii) a document identifier 622 ofthe document from which the section originated, (iii) a worker identifier 624 ofthe worker to which the section was sent, (iv) when the section was sent to the worker 626, (v) when the section was completed by the worker 628, and (vi) a price 630 the worker charges for processing the section.
  • a method 700 for processing a document includes receiving a document from a customer (step 702).
  • the step 702 may comprise receiving an electronic image, such as a JPEG ("Joint Photographic Experts Group") file or GIF ("Graphics Interchange Format"), of a paper document.
  • an image may be generated by a device known as a "scanner” (e.g. a flatbed scanner or drum scanner) and sent via electronic mail to the controller 110 (FIG. 1).
  • a scanner e.g. a flatbed scanner or drum scanner
  • Other methods for generating the image and sending the image to the controller 110 will be understood by those skilled in the art.
  • the step 702 may comprise receiving a paper document and then generating an electronic document (e.g. one or more images) from the paper document.
  • the controller 110 may include a scanner that generates an electronic document from the received paper document.
  • the document is divided into one or more sections (step 704).
  • each section comprises an image that is a portion of an image ofthe document.
  • each section represents a portion ofthe document.
  • one section of a document might include information that identifies a person, while another section ofthe document might include confidential information about that person. Providing these two sections to two workers would thus keep the person anonymous to the worker processing the section including the confidential information.
  • the sections ofthe document are transmitted to one or more workers
  • a section may be transmitted to a worker in a number of ways, such as via email or via a web site.
  • the workers receive the sections and perform corresponding data entry.
  • the term "data entry" refers to the process of writing data into computer memory.
  • Such computer memory may be volatile memory (e.g. RAM) or nonvolatile memory (e.g. disk subsystems or flash memory).
  • data entry includes (i) creating text from an image of a paper document by reading printing (e.g. printed characters) on the image and manually typing the text corresponding to the printing, (ii) proofreading and amending text, (iii) translating text (e.g.
  • the worker transmits back to the controller 110 (FIG. 1) a file that corresponds to the data entry. For example, the worker may transmit a text file that represents words that were printed in the section the worker received.
  • the controller 110 receives files from each worker (step 708) and transmits the files to the customer (step 710).
  • the controller 110 (FIG. 1) receives an amount of payment from the customer (step 712), and transmits appropriate amounts of payment to each worker (step 714). Alternatively, the controller 110 may initiate payment from the customer directly to each worker.
  • a data flow diagram 800 illustrates the transfer of information among the devices ofthe apparatus 100 (FIG. 1). The illustrated transfer occurs during a first stage (i.e. steps 702, 704 and 706) ofthe method 700.
  • the customer device 120 transmits an electronic document (e.g. an image of a paper document) to the controller 110, which in turn divides the electronic document into sections.
  • the controller 110 then transmits the sections to the worker devices 130 and 135.
  • An image of a document may be divided into sections in a number of ways. Sections ofthe document may be defined by one or more predetermined indicia.
  • the customer may include indicia in an image transmitted to the controller 110.
  • the document may be partially preprinted with the indicia, or the customer may add indicia to an electronic image ofthe document that it generates from a paper document.
  • the indicia may be added after the customer provides the document.
  • the image may be marked with the indicia by the controller 110.
  • documents may be marked with indicia in accordance with a predetermined process, and each customer may have one (or more) associated predetermined processes.
  • the controller 110 may add predetermined indicia to all documents received from a first predetermined customer.
  • all documents received from the first predetermined customer would be divided into sections in accordance with the same process. This can be especially advantageous for businesses having a large number of documents ofthe same type (e.g. medical claim forms).
  • FIGS. 9 A, 9B, 9C and 9D illustrate different ways to identify sections with indicia.
  • an image 900 includes indicia 902 that are in the form of a demarcation line.
  • the indicia 902 thereby define two sections 904 and 906.
  • it may be predetermined, for example, that the top section 904 contains information that identifies a person and the bottom section 906 contains confidential information about that person.
  • an image 920 includes indicia 922, 924 and 926, each in the form of a boundary line that defines the perimeters of one of three sections 928, 930 and 932, respectively.
  • more than one indicia can indicate a plurality of disjointed portions ofthe same section.
  • the indicia 922 and 926 may indicate two disjointed portions ofthe same section, while the indicia 924 can indicate another, different section.
  • the indicia 922 and 926 may both be ofthe same type (to denote portions ofthe same section) while the indicia 924 are of a different type.
  • the indicia 922 and 926 may both be of an identical color, width or other style, which is different from that ofthe indicia 924.
  • an image 940 includes indicia 942 and 944.
  • the indicia 942 are in the form of a demarcation line that defines two sections 946 and 948.
  • the indicia 944 may differentiate between the sections by identifying the section in which the indicia 944 is located.
  • the indicia 944 may indicate that the section 948 (which contains the indicia) contains information that identifies a person. In one embodiment, it would then be assumed that the remaining section 946 contains confidential information about that person.
  • an image 960 includes indicia 962, 964 and 966.
  • the indicia 962 are in the form of a demarcation line that defines two sections 968 and 970.
  • the section 968 includes indicia 964 in the form of a bar code
  • the section 970 includes indicia 966 that are also in the form of a bar code.
  • the indicia 964 and 966 may identify the sections 968 and 970 respectively.
  • the indicia 964 may indicate that the corresponding section (the section 968) contains information that identifies a person
  • indicia 966 may indicate that the corresponding section (the section 970) contains confidential information about that person.
  • an image 1000 of a document includes indicia 1002 and 1004, each in the form of a boundary line that defines the perimeters of two sections 1012 and 1014, respectively.
  • the section 1012 contains information that identifies a person (i.e. a name, social security number and address).
  • the section 1014 contains confidential information about that person (i.e. medical information).
  • Sections ofthe document may alternatively be identified as predetermined locations within a document.
  • the top half of a document may be a first section and the bottom half may be a second section.
  • documents may be divided into sections in accordance with a predetermined set of locations, and each customer may have one (or more) associated predetermined sets of locations.
  • all documents received from a first customer may be divided into three identically sized sections: a bottom section, a middle section and a top section.
  • Sections may be mutually exclusive so that no section overlaps another section. Conversely, sections may overlap so that a portion of a document is included in more than one section. Furthermore, a document may include one or more sections, but there may be portions of a document that are not included in any section. Such portions are thus not provided to workers. For example, a portion of a document that includes a standard letterhead logo may need no processing. In addition, as described above a section may include two disjointed portions of a document.
  • the image 1000 also includes an identifier 1020 that is in the form of a set of characters. Although in FIG. 10 the identifier 1020 is in the form of a set of characters, in other embodiments the identifier may be in another form, such as a bar code.
  • the identifier 1020 identifies processing criteria (the customer's requirements and/or preferences for processing the document).
  • the identifier may indicate, for example, an amount of payment to be paid for processing the document, billing information, a job code that specifies the type of processing required, a required rating ofthe worker(s), an indication of whether sections ofthe document can go to a single worker, a time when completion by the worker is required, and required characteristics ofthe worker.
  • the identifier 1020 on the image 1000 may indicate the document identifier described above with reference to FIG. 5.
  • the document identifier may be used to access further information from the document database 240 (FIG. 2) from which processing criteria may be derived.
  • the identifier may represent the processing criteria. For example, if the identifier is a set of characters, the first two characters may indicate billing information, the second three characters may indicate the type of processing required and the final eight characters may indicate a time when completion by the worker is required.
  • the identifier is a bar code that represents a sequence of digits, the various digits may represent various types of processing criteria, as would be apparent to one skilled in the art.
  • FIGS. 11 A and 1 IB each illustrate a section of a document that may be sent to a worker.
  • the section 1100 corresponds to the section 1012 of FIG. 10
  • the section 1150 corresponds to the section 1014 of FIG. 10.
  • the section 1100 includes a bar code 1102 that indicates worker criteria for the section 1100.
  • the section 1150 includes a bar code 1152 that indicates worker criteria for the section 1150.
  • the worker criteria of a section specify requirements and/or preferences for processing the document that are provided to the worker.
  • the worker criteria may indicate a time of completion for the worker.
  • the worker criteria are based, at least in part, on the processing criteria received from the customer.
  • FIG. 12 illustrates a method 1200 for processing a document in accordance with an embodiment ofthe present invention.
  • the method 1200 generally describes an embodiment ofthe steps 702, 704 and 706 ofthe method 700 (FIG. 7).
  • a paper document is received from a customer (step 1202) and a corresponding electronic document (e.g. an image ofthe paper document) is generated (step 1204).
  • an electronic document is received (e.g. via email).
  • Processing criteria associated with the image are also received (step 1206).
  • the processing criteria indicate the customer's requirements and/or preferences for processing the document.
  • the processing criteria may comprise a job code that identifies the type of data entry to be performed on the document.
  • the processing criteria may comprise a requirement that a section is processed by two or more workers, so that one worker can "proofread" the work of another, or so that inaccurate processing performed by any one or more workers may be more readily detected.
  • the processing criteria may be machine-readable (e.g. in the form of a bar code) to automate reading and interpreting the processing criteria.
  • the processing criteria may be text or other indicia that is not interpreted by the controller 110, but that is read by human personnel who direct the operations ofthe controller 110 accordingly.
  • the processing criteria may be text or other indicia that is merely made available to be read by workers.
  • the processing criteria may be received separately from the document, or may be incorporated into the image ofthe document (e.g. as a bar code or other indicia on the document).
  • the processing criteria may be received substantially simultaneously with the paper document. Typically, separate processing criteria would accompany each paper document.
  • processing criteria may be received from a customer, and the processing criteria are then applied to all documents received from that customer. In such an embodiment, the customer may, for example, desire that only one type of data entry ever be performed.
  • the controller 110 determines a first section ofthe image that is indicated by first predetermined indicia (step 1208), and determines a second section ofthe image that is indicated by second predetermined indicia (step 1210).
  • the first predetermined indicia may be the second predetermined indicia.
  • the indicia 902 are in the form of a demarcation line that defines two sections 904 and 906. Thus, the same indicia 902 indicate both sections 904 and 906.
  • the controller 110 selects a first worker and a second worker (steps 1212 and 1214).
  • a worker is selected if the type of work that the worker performs corresponds to the job code included in the processing criteria. For example, if the job code identifies that a document must be translated from German to English, then a worker having the requisite job skills (i.e. German language translation) is selected.
  • Various methods for selecting workers are described below.
  • the first section is transmitted to the first worker (step 1216) and the second section is transmitted to the second worker (step 1218).
  • a first identifier that identifies the document is transmitted to the first worker (step 1220), and a second identifier that also identifies the document is transmitted to the second worker (step 1222).
  • the first identifier and the second identifier may be identical but need not be.
  • the first identifier may be "Dl " and the second identifier may be "El ", yet each may refer to the same document (e.g. the last character may uniquely identify the document).
  • the first identifier and the second identifier may identify a customer instead of the document.
  • the sections would still correspond to a customer, and each corresponding file received in response to a transmitted section would correspond to a customer.
  • the worker criteria may include the above-described identifiers. Furthermore, the worker criteria may be transmitted and received in a manner similar to that described herein with regard to the processing criteria.
  • the first identifier and the second identifier may be transmitted separately, or may be incorporated into the sections themselves.
  • the identifiers may comprise a bar code, and the controller 110 could label each section with an identifier in the form of a bar code. Sections and identifiers may be transmitted in several ways, such as via email or via a site on the World Wide Web.
  • the above-described step of transmitting a first section to a first worker and a second section to a second worker may comprise transmitting the first section to a worker in a first group of workers and the second section to a worker in a second group of workers.
  • the first group of workers are qualified to receive a first type of section (e.g. a section that includes information that identifies a person) and that the second group of workers are qualified to receive a second type of section (e.g. a section that includes confidential information about that person).
  • a first type of section e.g. a section that includes information that identifies a person
  • a second type of section e.g. a section that includes confidential information about that person
  • a data flow diagram 1300 illustrates the transfer of information among the devices ofthe apparatus 100 (FIG. 1). The illustrated transfer occurs during a second stage (i.e. steps 708 and 710) of the method 700, which is after that illustrated by FIG. 8.
  • the worker devices 130 and 135 each transmit a file to the controller 110.
  • the controller 110 then transmits the files to the customer device 120.
  • the files may be transmitted to the customer device 120 separately, or the controller 110 may combine the files and transmit the combined file to the customer device 120.
  • a method 1400 for processing a document includes receiving a first file from a first worker (step 1402).
  • the first file corresponds to data entry performed on a first section that was previously transmitted to the first worker.
  • the first section is derived from a document and may be a portion ofthe document.
  • the first file may be a text file of manually typed text.
  • the first file may be data that is created by the worker via a form and stored according to various fields ofthe form.
  • the controller 110 may also receive a first identifier that identifies a first document (from which the first section was derived).
  • the first identifier may be a document identifier as illustrated in field 520 (FIG. 5).
  • the first identifier may also be a section identifier as illustrated in the field 620 (FIG. 6), which would, in turn, indicate a document, as is apparent from the field 622 (FIG. 6).
  • the first identifier may be received substantially simultaneously with the first file.
  • the first identifier may be received immediately before or immediately after the first file is received, thereby associating the first file with the first identifier.
  • the first file may also include the first identifier, thereby allowing the document to be identified from the file. Other methods for identifying the first document from which the first section was derived will be understood by those skilled in the art.
  • a second file is received from a second worker (step 1404).
  • the second file corresponds to data entry performed on a second section that was previously transmitted to the second worker.
  • a second identifier also identifies a second document from which the second section was derived.
  • the second file and the step of receiving the second file are analogous to the description above ofthe step 1402.
  • the controller 110 determines whether the first document and the second document are equivalent (step 1406). If so, then the first section and the second section are derived from the same document, and accordingly the first file and the second file represent data entry performed in connection with the same document. Consequently, in one embodiment both the first file and the second file are destined for the same customer.
  • the customer that is associated with both the first file and the second file is determined (step 1408). For example, referring again to FIG. 5, each document corresponds to a customer, as indicated by fields 520 and 522 ofthe document database 240. Thus, the customer may be determined from the document database 240.
  • the first file and the second file are transmitted to the customer (step 1410).
  • the files may be transmitted to the customer device 120 separately, or the controller 110 may combine the files and transmit the combined file to the customer device 120.
  • the workers are rated based on their work (step 1412). For example, the first worker is rated based on the first file and possibly based on a previous rating, if any. Rating a worker is described in the parent application, U.S. Patent Application No. 09/112,131, entitled “METHOD AND APPARATUS FOR A CRYPTOGRAPHICALLY-ASSISTED COMMERCIAL NETWORK SYSTEM DESIGNED TO FACILITATE AND SUPPORT EXPERT-BASED COMMERCE", filed on July 8, 1998.
  • a document may include one or more sections that are processed. Accordingly, in one embodiment, it may be desirable to determine the number of sections in a document, and all co ⁇ esponding files are transmitted only after a predetermined number of files (e.g. the number of sections) have been received. Alternatively, each file may be transmitted to the customer when it is received.
  • a form 1500 illustrates a form that may be used to allow a worker to enter data based on the section 1100 (FIG. 11) and to store that data according to fields ofthe form 1500.
  • the form 1500 includes a form field 1502 that permits entry of a name, a form field 1504 that permits entry of a social security number and a form field 1506.
  • Forms may be sent along with a corresponding document from the customer to the controller 110 (FIG. 1) and in turn to the worker.
  • Forms may also be provided to a worker via a web site. For example, a worker may be provided with a "URL" (Uniform Resource Locator) that identifies a web site or other computer-based communication channel.
  • URL Uniform Resource Locator
  • HTML forms are well known to those skilled in the art, as are proprietary form formats such as PDF (portable document format) sold by Adobe and JetForm.
  • the form may be accessible through a web site after the worker enters an access code at the web site.
  • the access code may serve as authorization that the worker is allowed to access a form.
  • the access code may additionally or alternatively uniquely identify one of a plurality of forms that are available via the web site.
  • the access code may be provided to the worker in the worker criteria discussed above.
  • the access code may be provided to the worker by other means as well, such as transmitted along with the section.
  • Data that is entered via a form may be transmitted to the controller 110 which in turn transmits the data to the customer device 120. Alternatively, the data entered via the form may be transmitted directly to the customer device 120 and/or a database specified by the customer.
  • a table 1600 represents a form database.
  • the form database may be a part ofthe section database 250 (FIG. 2).
  • the table 1600 includes entries 1602, 1604, 1606, 1608 and 1610, each defining a form to enter data in connection with processing a section. It will be understood by those skilled in the art that the table 1600 may include any number of entries.
  • the table 1600 also defines fields for each ofthe entries 1602, 1604, 1606, 1608 and 1610.
  • the fields specify (i) a section identifier 1620 that uniquely identifies the section, (ii) a form identifier 1622 that uniquely identifies the form, (iii) a URL 1624 that identifies a web site where the form may be accessed, and (iv) an access code 1626 that is required to access the web site and/or specify a particular form available at the web site.
  • One URL may be specified for more than one form.
  • the entries 1606, 1608 and 1610 all have the same URL "www.z.com”. However, at least three forms "F3", “F4" and "F5" are all accessible using that URL. In particular, different access codes distinguish among the different forms.
  • payment may be rendered either directly or indirectly from customer to worker.
  • Payment usually comprises a crediting of an account, such as a bank account or credit card account.
  • payment may also be rendered via check or other means.
  • the customer may be billed after any predetermined event, such as when the worker transmits the appropriate files to the customer. In one embodiment, the customer may be billed periodically (e.g. every month) for all fees due to workers.
  • Payment due may be defined by the customer, such as when the customer sets forth an amount to be paid for a particular document. Payment may also be defined by the worker, such as when the worker charges a time-based or job- based fee.
  • a plurality of workers may participate in a "Dutch” auction and compete for the lowest amount or rate of pay they will accept in exchange for a particular section or type of work. The lowest bidder would then earn the right to perform the specified work in exchange for the highest bid amount.
  • a plurality of workers may participate in a "reverse" auction in which bidding starts low and increases until all sections have been accepted by workers. Other types of auctions may take place among workers to specify pay rates and terms.
  • workers may compete in an auction for sellers, sellers may compete in an auction for workers.
  • a section of a document may include significant amounts of information, and processing such a section may require a significant amount of time. Consequently, it can be advantageous to allow a plurality of workers to participate in processing a single section. Similarly, it can be advantageous to allow a worker to receive a payment upon processing a portion of a section. Accordingly, the present invention permits a section to be partially processed by a first worker, and a second worker can further process the section. For example, a first worker may create a first text file representing the first page of a document, and a second worker may create a second text file representing the remaining pages of the document.
  • FIG. 17 illustrates a method 1700 for permitting a section to be processed by a plurality of workers.
  • a section is transmitted to a worker (step 1702), as discussed above.
  • a file is received from the worker (step 1704), as also discussed above.
  • the controller 110 determines whether the section has been completely processed (step 1706). In other words, the controller 110 determines whether there is a portion ofthe section on which processing was not performed. If there is a portion ofthe section on which processing was not performed, then at least that portion (possibly the entire section) is transmitted to another worker (step 1708).
  • the file from the original worker may also be sent to the subsequent worker to allow that worker to more readily determine which portions of the section have been processed. Subsequently, a file is received from the subsequent worker (step 1710).
  • the steps 1706, 1708 and 1710 are repeated until the section has been completely processed by one or more workers.
  • All received files may be combined to form a combined file (step 1712).
  • the combined file is in turn transmitted to the customer (step 1714).
  • the files may be transmitted to the customer upon receipt.
  • the customer need not know that the section has been processed by a plurality of workers.
  • a "stop indicator" may be received with, or included in, the file.
  • a stop indicator indicates where in the section the worker stopped processing the section, and therefor indicates the portion ofthe section that has not been processed.
  • the stop indicator may be a textual description ofthe portion ofthe section on which processing was not performed.
  • the stop indicator may comprise the text "Until page 3, second paragraph".
  • a file received from a worker may include the portion ofthe section that has not been processed.
  • a file received from a worker may include the section which the worker has marked with indicia that indicates a portion ofthe section on which processing was not performed. Such indicia may be considered a "stop indicator” that indicates were the worker stopped processing the section.
  • the stop indicator may comprise a boundary line that separates a portion that was processed from a portion that was not processed.
  • a table 1800 represents a record of another embodiment ofthe section database 250 (FIG. 2).
  • each section may be processed by one or more workers, and accordingly the processing performed by each worker is recorded. Accordingly, each record co ⁇ esponds to a section as indicated by the section identifier 1802.
  • the record also includes a document identifier 1804 to identify the document from which the section originated.
  • the record further includes entries 1806, 1808 and 1810, each of which defines a portion ofthe section that was processed by a worker.
  • the table 1800 also defines fields for each ofthe entries 1806, 1808 and 1810.
  • the fields specify (i) a worker identifier 1820 ofthe worker to which the portion was sent, (ii) when the portion was sent to the worker 1822, (iii) when the portion was completed by the worker 1824, (iv) a price 1826 the worker charges for processing the portion, and (v) a percentage completion 1828 ofthe section after the specified worker completes processing his respective portion.
  • FIG. 19 illustrates a method 1900 for processing a plurality of documents in accordance with processing criteria for the plurality.
  • the controller 110 receives a plurality of documents from the customer (step 1902), and also receives processing criteria applicable to the plurality of documents (step 1904).
  • the documents and processing criteria may be received in a manner similar to that described above.
  • the processing criteria may specify, for example, a number ofthe plurality of documents to process in a predetermined time period (e.g. at least five of these documents must be processed within a week), a required accuracy of processing (e.g. the documents must be at least 99% accurately transcribed), and a desired cost for processing the plurality of documents (no more than $1000 to process the plurality).
  • a plurality of workers are selected based on the processing criteria (step 1906). For example, the controller 110 may select ten workers that are qualified for the task and that also have the lowest rates. Similarly, the controller 110 may select twenty ofthe fastest workers if there is not much time. In one embodiment, work may be apportioned among a first set of workers that perform typing, and a second set of workers that proofread the files generated by the first set of workers. In such an embodiment, the controller 110 may determine whether it is more economical to hire more ofthe first set than o the second set to achieve the same overall level of accuracy.
  • the workers may also be selected based on the work loads ofthe workers. For example, if a particular worker has been assigned a large number of sections and has not processed many of those sections, the controller 110 may determine that the worker has a large work load and thus is unlikely to provide rapid service. Conversely, if a particular worker has few or no unprocessed sections, the controller 110 may determine that the worker has a small work load and thus is likely to provide rapid service. In addition, the past response rate (duration from receiving a section and completing processing for the section) of a worker may be considered in determining his future response rate.
  • the documents and/or sections of the documents are apportioned among the plurality of workers (step 1908) so each worker is assigned a section of a document and all sections are allocated.
  • the sections are transmitted to the appropriate worker (step 1910). Transmitting sections to workers and subsequently receiving files from those workers has been described above.

Abstract

A controller receives a document (702) from a customer. The document may be an image generated by scanning a paper document. One or more sections (704) of the document that are indicated by predetermined indicia are determined. Different sections may include different types of information. For example, one section of a document might include information that identifies a person, while another section of the document might include confidential information about that person. The sections are transmitted to one or more workers (706) who perform corresponding data entry. Separating the sections can thus preserve the confidential nature of the document. A file is subsequently received from each worker. The file corresponds to data entry performed on the section of the document. The controller determines the customer associated with the file and transmits the file to the customer (710). Payment is then rendered, directly or indirectly, from the customer to the workers (714).

Description

METHOD AND APPARATUS FOR PROCESSING A DOCUMENT
CONTAINING CONFIDENTIAL INFORMATION
The present application is a continuation-in-part application of co- pending U.S. Patent Application No. 09/112,131, entitled "METHOD AND APPARATUS FOR A CRYPTOGRAPHICALLY-ASSISTED COMMERCIAL NETWORK SYSTEM DESIGNED TO FACILITATE AND SUPPORT EXPERT- BASED COMMERCE", filed on July 8, 1998, which is a continuation of U.S. Patent Application No. 08/685,706, filed July 24, 1996, each incorporated herein by reference as part of the present disclosure.
FIELD OF THE INVENTION
The present invention relates to methods and apparatus for processing documents, and more specifically to methods and apparatus that facilitate the processing of documents containing possibly confidential information.
BACKGROUND OF THE INVENTION
Despite the prevalence of computers and computer-related technology, a significant amount of information processing must be performed by humans rather than by computers or similar devices. For example, humans continue to be required in proofreading and amending language-translated text, especially if the translation was performed by a computer. Similarly, humans are often required in creating computer-based information, such as a text file, from a paper document or a similar document that is not computer-based. To do so, a human must typically read printing (e.g. printed characters) on the paper document and manually type text corresponding to the printing in order to create a text file.
Computers can create text files from paper-based documents using a technique known as Optical Character Recognition ("OCR"). As is known in the art, OCR requires that an electronic device examine printed characters on the paper document, and determine the shapes of the printed characters by detecting patterns of dark and light. Then, character recognition methods are used to translate the shapes into computer text and a corresponding text file may be created and stored.
Unfortunately, OCR is often inaccurate. Any character may assume a large number of shapes, especially if the character is handwritten. To circumvent this disadvantage, some OCR systems are limited to recognizing known typefaces and sizes and cannot recognize handwriting at all. Other OCR systems use sophisticated algorithms to learn new typefaces and sizes, and may thus increase recognition accuracy somewhat. However, even the best OCR system is unable to make the common sense judgments that allow humans to recognize poorly recognizable characters and words from their context. Consequently, OCR remains unable to consistently generate accurate text files. In fact, it can take so long to proofread and correct a text file created with OCR that it would be just as easy to have a human manually create a text file by typing the text. Unfortunately, many businesses have a great need to create computer- based information from paper documents. For example, insurance companies and other businesses in the health care field must process paper documents such as claim requests, payment receipts, new member applications and medical test results. The overwhelming majority of these paper documents are processed manually which is extremely expensive and time consuming. Such documents may include confidential information, such as the medical information of patients. Consequently, it is undesirable to permit people to have access to these documents.
It would be advantageous to provide a method and apparatus that facilitates the creation of computer-based information from paper documents.
SUMMARY OF THE INVENTION
It is an object of the present invention to provide a method and apparatus that facilitates the creation of computer-based information from paper documents.
In accordance with the present invention, a controller receives a document from a customer. The document may be an image generated by scanning a paper document. One or more sections of the document that are indicated by predetermined indicia are determined. Different sections may include different types of information. For example, one section of a document might include information that identifies a person, while another section of the document might include confidential information about that person. The sections are transmitted to one or more workers who perform corresponding data entry. Separating the sections can thus preserve the confidential nature of the document. A file is subsequently received from each worker. The file corresponds to data entry performed on the section of the document. The controller determines the customer associated with the file and transmits the file to the customer. Payment is then rendered, directly or indirectly, from the customer to the workers. BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a schematic illustration of an apparatus for processing a document in accordance with the present invention.
FIG. 2 is a schematic illustration of a controller of the apparatus of FIG. 1.
FIG. 3 is a schematic illustration of a customer database of the controller of FIG. 2.
FIG 4 is a schematic illustration of a worker database of the controller of FIG. 2. FIG. 5 is a schematic illustration of a document database of the controller of FIG. 2.
FIG. 6 is a schematic illustration of a section database of the controller of FIG. 2.
FIG. 7 is a flow chart illustrating a method for processing a document in accordance with the present invention.
FIG. 8 is a data flow diagram illustrating the transfer of information within the apparatus of FIG. 1 during a portion of the method of FIG. 7.
FIGS. 9 A, 9B, 9C and 9D are images that include indicia for identifying sections. FIG. 10 is an exemplary image including indicia in the form of boundary lines that define the perimeters of two sections.
FIGS. 11A and 1 IB are schematic illustrations of sections of a document. FIG. 12 is a flow chart illustrating an embodiment of a portion of the method of FIG. 7.
FIG. 13 is a data flow diagram illustrating the transfer of information within the apparatus of FIG. 1 during another portion of the method of FIG. 7. FIG. 14 is a flow chart illustrating an embodiment of another portion ofthe method of FIG. 7.
FIG. 15 is a schematic illustration of a form for entering data. FIG. 16 is a schematic illustration of a form database. FIG. 17 is a flow chart illustrating a method for permitting a section to be processed by a plurality of workers.
FIG. 18 is a schematic illustration of a record of another embodiment ofthe section database ofthe controller of FIG. 2.
FIG. 19 is a flow chart illustrating a method for processing a plurality of documents in accordance with processing criteria for the plurality.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
The present invention allows a business or other entity to automatically locate and communicate with one or more workers who in turn create computer-based information (e.g. text files or data base entries) for the business from a paper document. Thus, the business need not consume resources in locating and hiring appropriate workers. The business also remains unburdened by the administrative costs associated with hiring and retaining such workers in an employment capacity.
In addition, the present invention further allows information derivable from the paper document to remain confidential even if the document is being processed by a worker who may be untrustworthy. Thus, even sensitive documents such as medical records may be processed without compromising their confidential information.
The present invention likewise benefits people that can function as "workers" (described below) even without possessing specialized skills. A market is created for such workers, who may work at times they choose and as much as they choose. In particular, very small amounts of free time (e.g. minutes) may be spent earning pay in exchange for services rendered remotely. Thus, people can earn money during times such as lunch breaks or other periods of inactivity. Workers may furthermore maintain their anonymity, should they desire.
Referring to FIG. 1 , an apparatus 100 comprises a controller 110 that is in communication with a customer device 120 and with worker devices 130 and 135. The controller 110 may communicate with the customer device 120 and the worker devices 130 and 135 via an appropriate network such as the Internet. Each ofthe controller 110, the customer device 120, and the worker devices 130 and 135 may comprise computers, such as those based on the Intel® Pentium® microprocessor, that are adapted to communicate via the Internet (e.g. via a modem). Any number of customer devices and worker devices may be in communication with the controller 110. The controller 110 facilitates communication between a customer (e.g. a business) using the customer device 120 and one or more workers using worker devices such as the worker devices 130 and 135. The communication may be performed while maintaining anonymity of each party, and in fact the present invention can eliminate the need for a party to identify the other party with which it communicates. Accordingly, the parties may form a relationship that allows confidential information to be handled efficiently and without compromise.
Referring to FIG. 2, the controller 110 comprises a processor 202, such as the Intel® Pentium® microprocessor. The processor 202 is in communication with a data storage device 204, such as an appropriate combination of magnetic, optical and/or semiconductor memory. For example, the data storage device 204 may comprise one or more of a ROM, RAM and hard disk drive. The processor 202 and the data storage device 204 may each be (i) located entirely within a single computer or other computing device; (ii) connected to each other by a remote communication medium, such as a serial port cable, telephone line or radio frequency transceiver; or (iii) a combination thereof. In one embodiment, the controller 110 may comprise one or more computers that are connected to a remote server computer for maintaining databases.
The data storage device 204 stores a program 210 for controlling the processor 202. The processor 202 performs instructions ofthe program 210, and thereby operates in accordance with the present invention, and particularly in accordance with the methods described in detail herein. The program 210 furthermore includes program elements that may be necessary, such as an operating system and "device drivers" for allowing the processor 202 to interface with computer peripheral devices. Appropriate device drivers and other necessary program elements are known to those skilled in the art, and need not be described in detail herein.
The storage device 204 also stores (i) a customer database 220, (ii) a worker database 230, (iii) a document database 240, and (iv) a section database 250. The databases 220, 230, 240 and 250 are described in detail below and depicted with exemplary entries in the accompanying figures. As will be understood by those skilled in the art, the schematic illustrations and accompanying descriptions of the databases presented herein are exemplary arrangements for stored representations of information. A number of other arrangements may be employed besides those suggested by the tables shown. Similarly, the illustrated entries ofthe databases represent exemplary information, and those skilled in the art will understand that the number and content ofthe entries can be different from those illustrated herein. Referring to FIG. 3, a table 300 represents an embodiment ofthe customer database 220 (FIG. 2). The table 300 includes entries 302 and 304, each defining a customer that interacts with the controller 110 (FIG. 1) to have documents processed. It will be understood by those skilled in the art that the table 300 may include any number of entries. The table 300 also defines fields for each ofthe entries 302 and 304. The fields specify (i) a customer identifier 320 that uniquely identifies the customer, (ii) an electronic mail ("email") address 322 for communication with the customer, (iii) a customer name 324, and (iv) a financial account identifier 326 that indicates a financial account the customer may use to pay for services.
Referring to FIG. 4, a table 400 represents an embodiment ofthe worker database 230 (FIG. 2). The table 400 includes entries 402, 404 and 406, each defining a worker that interacts with the controller 110 (FIG. 1) to process documents. It will be understood by those skilled in the art that the table 400 may include any number of entries. The table 400 also defines fields for each ofthe entries 402, 404 and 406. The fields specify (i) a worker identifier 420 that uniquely identifies the worker, (ii) an email address 422 for communication with the worker, (iii) a worker name 424, (iv) a financial account identifier 426 that indicates a financial account that the worker may use to receive payment, (v) job skills 428 ofthe worker, (vi) a quality rating 430 ofthe worker, (vii) a speed rating 432 ofthe worker, and (viii) a fee charged 434 by the worker for its services. Those skilled in the art will understand that a number of other fields may specify further information regarding a worker, such as other types of ratings for the worker.
Referring to FIG. 5, a table 500 represents an embodiment ofthe document database 240 (FIG. 2). The table 500 includes entries 502 and 504, each defining a document that a customer needs to have processed by a worker. It will be understood by those skilled in the art that the table 500 may include any number of entries. The table 500 also defines fields for each ofthe entries 502 and 504. The fields specify (i) a document identifier 520 that uniquely identifies the document, (ii) the customer 522 for which the document is to be processed, (iii) a job code 524 that identifies the type of processing to be performed on the document, (iv) a price 526 to pay for processing the document, (v) how to receive payment 528 for processing performed, (vi) a required worker rating 530, (vii) a time 532 by which the document must be processed, (viii) other required characteristics 534 ofthe worker (if any), (ix) whether sections ofthe document must be processed by different workers 536, (x) section identifiers 538 that uniquely identify sections ofthe document that are to be processed by one or more workers, and (xi) whether and when the document processing has been completed 540.
Referring to FIG. 6, a table 600 represents an embodiment ofthe section database 250 (FIG. 2). The table 600 includes entries 602, 604, 606, 608 and 610, each defining a section that is to be processed by a worker. It will be understood by those skilled in the art that the table 600 may include any number of entries. The table 600 also defines fields for each ofthe entries 602, 604, 606, 608 and 610. The fields specify (i) a section identifier 620 that uniquely identifies the section, (ii) a document identifier 622 ofthe document from which the section originated, (iii) a worker identifier 624 ofthe worker to which the section was sent, (iv) when the section was sent to the worker 626, (v) when the section was completed by the worker 628, and (vi) a price 630 the worker charges for processing the section.
Referring to FIG. 7, a method 700 for processing a document includes receiving a document from a customer (step 702). The step 702 may comprise receiving an electronic image, such as a JPEG ("Joint Photographic Experts Group") file or GIF ("Graphics Interchange Format"), of a paper document. Such an image may be generated by a device known as a "scanner" (e.g. a flatbed scanner or drum scanner) and sent via electronic mail to the controller 110 (FIG. 1). Other methods for generating the image and sending the image to the controller 110 will be understood by those skilled in the art.
Alternatively, the step 702 may comprise receiving a paper document and then generating an electronic document (e.g. one or more images) from the paper document. In such an embodiment, the controller 110 may include a scanner that generates an electronic document from the received paper document. The document is divided into one or more sections (step 704). In one embodiment, each section comprises an image that is a portion of an image ofthe document. Thus, each section represents a portion ofthe document. As described in detail below, it can be advantageous to send only a section of a document to a worker. For example, one section of a document might include information that identifies a person, while another section ofthe document might include confidential information about that person. Providing these two sections to two workers would thus keep the person anonymous to the worker processing the section including the confidential information. The sections ofthe document are transmitted to one or more workers
(step 706). A section may be transmitted to a worker in a number of ways, such as via email or via a web site. The workers receive the sections and perform corresponding data entry. The term "data entry" refers to the process of writing data into computer memory. Such computer memory may be volatile memory (e.g. RAM) or nonvolatile memory (e.g. disk subsystems or flash memory). As used herein, data entry includes (i) creating text from an image of a paper document by reading printing (e.g. printed characters) on the image and manually typing the text corresponding to the printing, (ii) proofreading and amending text, (iii) translating text (e.g. from one language to another), and (iv) creating text from an audio file by listening to the audio and manually typing the text corresponding to the audio. Those skilled in the art will understand that there are other types of data entry that the workers may perform after receiving a section of a document.
In one embodiment, it may be necessary to assure that a document is processed by at least two workers. However, in another embodiment it may be acceptable to send sections to one worker. For example, it may be sufficient to send the same sections to a worker if each section cannot be matched with the others. Such an embodiment is especially appropriate where a large number of documents are processed by a single worker. The worker could not match any one section with another from the same document. In addition, it can be desirable to delay the transmission ofthe sections to the worker, since this reduces the ability ofthe worker to match the sections.
Once a worker has performed the specified data entry, the worker transmits back to the controller 110 (FIG. 1) a file that corresponds to the data entry. For example, the worker may transmit a text file that represents words that were printed in the section the worker received. The controller 110 receives files from each worker (step 708) and transmits the files to the customer (step 710). The controller 110 (FIG. 1) receives an amount of payment from the customer (step 712), and transmits appropriate amounts of payment to each worker (step 714). Alternatively, the controller 110 may initiate payment from the customer directly to each worker.
Referring to FIG. 8, a data flow diagram 800 illustrates the transfer of information among the devices ofthe apparatus 100 (FIG. 1). The illustrated transfer occurs during a first stage (i.e. steps 702, 704 and 706) ofthe method 700. The customer device 120 transmits an electronic document (e.g. an image of a paper document) to the controller 110, which in turn divides the electronic document into sections. The controller 110 then transmits the sections to the worker devices 130 and 135.
An image of a document may be divided into sections in a number of ways. Sections ofthe document may be defined by one or more predetermined indicia. The customer may include indicia in an image transmitted to the controller 110. For example, the document may be partially preprinted with the indicia, or the customer may add indicia to an electronic image ofthe document that it generates from a paper document. Alternatively, the indicia may be added after the customer provides the document. For example, the image may be marked with the indicia by the controller 110. In one embodiment, documents may be marked with indicia in accordance with a predetermined process, and each customer may have one (or more) associated predetermined processes. For example, the controller 110 may add predetermined indicia to all documents received from a first predetermined customer. In such an embodiment, all documents received from the first predetermined customer would be divided into sections in accordance with the same process. This can be especially advantageous for businesses having a large number of documents ofthe same type (e.g. medical claim forms).
FIGS. 9 A, 9B, 9C and 9D illustrate different ways to identify sections with indicia. In FIG. 9A, an image 900 includes indicia 902 that are in the form of a demarcation line. The indicia 902 thereby define two sections 904 and 906. In such an embodiment, it may be predetermined, for example, that the top section 904 contains information that identifies a person and the bottom section 906 contains confidential information about that person. In FIG. 9B, an image 920 includes indicia 922, 924 and 926, each in the form of a boundary line that defines the perimeters of one of three sections 928, 930 and 932, respectively. Alternatively, more than one indicia can indicate a plurality of disjointed portions ofthe same section. For example, the indicia 922 and 926 may indicate two disjointed portions ofthe same section, while the indicia 924 can indicate another, different section. In such an embodiment, the indicia 922 and 926 may both be ofthe same type (to denote portions ofthe same section) while the indicia 924 are of a different type. For example, the indicia 922 and 926 may both be of an identical color, width or other style, which is different from that ofthe indicia 924. In FIG. 9C, an image 940 includes indicia 942 and 944. The indicia 942 are in the form of a demarcation line that defines two sections 946 and 948. In such an embodiment, the indicia 944 may differentiate between the sections by identifying the section in which the indicia 944 is located. For example, the indicia 944 may indicate that the section 948 (which contains the indicia) contains information that identifies a person. In one embodiment, it would then be assumed that the remaining section 946 contains confidential information about that person.
In FIG. 9D, an image 960 includes indicia 962, 964 and 966. The indicia 962 are in the form of a demarcation line that defines two sections 968 and 970. The section 968 includes indicia 964 in the form of a bar code, and the section 970 includes indicia 966 that are also in the form of a bar code. In such an embodiment, the indicia 964 and 966 may identify the sections 968 and 970 respectively. For example, the indicia 964 may indicate that the corresponding section (the section 968) contains information that identifies a person, while indicia 966 may indicate that the corresponding section (the section 970) contains confidential information about that person.
Referring to FIG. 10, an image 1000 of a document includes indicia 1002 and 1004, each in the form of a boundary line that defines the perimeters of two sections 1012 and 1014, respectively. The section 1012 contains information that identifies a person (i.e. a name, social security number and address). The section 1014 contains confidential information about that person (i.e. medical information).
Sections ofthe document may alternatively be identified as predetermined locations within a document. For example, the top half of a document may be a first section and the bottom half may be a second section. In one embodiment, documents may be divided into sections in accordance with a predetermined set of locations, and each customer may have one (or more) associated predetermined sets of locations. For example, all documents received from a first customer may be divided into three identically sized sections: a bottom section, a middle section and a top section.
Sections may be mutually exclusive so that no section overlaps another section. Conversely, sections may overlap so that a portion of a document is included in more than one section. Furthermore, a document may include one or more sections, but there may be portions of a document that are not included in any section. Such portions are thus not provided to workers. For example, a portion of a document that includes a standard letterhead logo may need no processing. In addition, as described above a section may include two disjointed portions of a document.
Different sections of a document may include different types of information. For example, a first section may include confidential information about a person, while a second section includes information that identifies that person. Sections may also contain the same type of information, yet each section would include a small portion of such information. This is advantageous where there exists a large amount of one type of information, and it would be easier to break up the information into more manageable sizes. The image 1000 also includes an identifier 1020 that is in the form of a set of characters. Although in FIG. 10 the identifier 1020 is in the form of a set of characters, in other embodiments the identifier may be in another form, such as a bar code. The identifier 1020 identifies processing criteria (the customer's requirements and/or preferences for processing the document). The identifier may indicate, for example, an amount of payment to be paid for processing the document, billing information, a job code that specifies the type of processing required, a required rating ofthe worker(s), an indication of whether sections ofthe document can go to a single worker, a time when completion by the worker is required, and required characteristics ofthe worker.
The identifier 1020 on the image 1000 may indicate the document identifier described above with reference to FIG. 5. In such an embodiment, the document identifier may be used to access further information from the document database 240 (FIG. 2) from which processing criteria may be derived. In another embodiment, the identifier may represent the processing criteria. For example, if the identifier is a set of characters, the first two characters may indicate billing information, the second three characters may indicate the type of processing required and the final eight characters may indicate a time when completion by the worker is required. Similarly, if the identifier is a bar code that represents a sequence of digits, the various digits may represent various types of processing criteria, as would be apparent to one skilled in the art.
FIGS. 11 A and 1 IB each illustrate a section of a document that may be sent to a worker. The section 1100 (FIG. 11 A) corresponds to the section 1012 of FIG. 10, and the section 1150 (FIG. 1 IB) corresponds to the section 1014 of FIG. 10. The section 1100 includes a bar code 1102 that indicates worker criteria for the section 1100. Similarly, the section 1150 includes a bar code 1152 that indicates worker criteria for the section 1150. As described below, the worker criteria of a section specify requirements and/or preferences for processing the document that are provided to the worker. For example, the worker criteria may indicate a time of completion for the worker. In one embodiment, the worker criteria are based, at least in part, on the processing criteria received from the customer. In one embodiment, the worker criteria may simply indicate the section's corresponding document or customer. FIG. 12 illustrates a method 1200 for processing a document in accordance with an embodiment ofthe present invention. The method 1200 generally describes an embodiment ofthe steps 702, 704 and 706 ofthe method 700 (FIG. 7).
A paper document is received from a customer (step 1202) and a corresponding electronic document (e.g. an image ofthe paper document) is generated (step 1204). In an alternate embodiment, an electronic document is received (e.g. via email). Processing criteria associated with the image are also received (step 1206). As described above, the processing criteria indicate the customer's requirements and/or preferences for processing the document. The processing criteria may comprise a job code that identifies the type of data entry to be performed on the document. Similarly, the processing criteria may comprise a requirement that a section is processed by two or more workers, so that one worker can "proofread" the work of another, or so that inaccurate processing performed by any one or more workers may be more readily detected.
The processing criteria may be machine-readable (e.g. in the form of a bar code) to automate reading and interpreting the processing criteria. Alternatively, the processing criteria may be text or other indicia that is not interpreted by the controller 110, but that is read by human personnel who direct the operations ofthe controller 110 accordingly. In another embodiment, the processing criteria may be text or other indicia that is merely made available to be read by workers. The processing criteria may be received separately from the document, or may be incorporated into the image ofthe document (e.g. as a bar code or other indicia on the document). The processing criteria may be received substantially simultaneously with the paper document. Typically, separate processing criteria would accompany each paper document. In an alternate embodiment, processing criteria may be received from a customer, and the processing criteria are then applied to all documents received from that customer. In such an embodiment, the customer may, for example, desire that only one type of data entry ever be performed.
The controller 110 then determines a first section ofthe image that is indicated by first predetermined indicia (step 1208), and determines a second section ofthe image that is indicated by second predetermined indicia (step 1210). In one embodiment, the first predetermined indicia may be the second predetermined indicia. For example, referring again to FIG. 9A, the indicia 902 are in the form of a demarcation line that defines two sections 904 and 906. Thus, the same indicia 902 indicate both sections 904 and 906.
Based on the processing criteria, the controller 110 selects a first worker and a second worker (steps 1212 and 1214). In one embodiment, a worker is selected if the type of work that the worker performs corresponds to the job code included in the processing criteria. For example, if the job code identifies that a document must be translated from German to English, then a worker having the requisite job skills (i.e. German language translation) is selected. Various methods for selecting workers are described below.
After the first worker and the second worker are selected, the first section is transmitted to the first worker (step 1216) and the second section is transmitted to the second worker (step 1218). In addition, a first identifier that identifies the document is transmitted to the first worker (step 1220), and a second identifier that also identifies the document is transmitted to the second worker (step 1222). The first identifier and the second identifier may be identical but need not be. For example, the first identifier may be "Dl " and the second identifier may be "El ", yet each may refer to the same document (e.g. the last character may uniquely identify the document). In another embodiment, the first identifier and the second identifier may identify a customer instead of the document. Thus, the sections would still correspond to a customer, and each corresponding file received in response to a transmitted section would correspond to a customer.
The worker criteria may include the above-described identifiers. Furthermore, the worker criteria may be transmitted and received in a manner similar to that described herein with regard to the processing criteria. For example, the first identifier and the second identifier may be transmitted separately, or may be incorporated into the sections themselves. For example, the identifiers may comprise a bar code, and the controller 110 could label each section with an identifier in the form of a bar code. Sections and identifiers may be transmitted in several ways, such as via email or via a site on the World Wide Web.
Those skilled in the art will understand that the above-described step of transmitting a first section to a first worker and a second section to a second worker may comprise transmitting the first section to a worker in a first group of workers and the second section to a worker in a second group of workers. In such an embodiment, it may be desirable that the first group of workers are qualified to receive a first type of section (e.g. a section that includes information that identifies a person) and that the second group of workers are qualified to receive a second type of section (e.g. a section that includes confidential information about that person). Accordingly, it may be necessary to ensure the workers in that one group (or both groups) satisfy minimum requirements (e.g. no criminal background). Referring to FIG. 13, a data flow diagram 1300 illustrates the transfer of information among the devices ofthe apparatus 100 (FIG. 1). The illustrated transfer occurs during a second stage (i.e. steps 708 and 710) of the method 700, which is after that illustrated by FIG. 8. The worker devices 130 and 135 each transmit a file to the controller 110. The controller 110 then transmits the files to the customer device 120. The files may be transmitted to the customer device 120 separately, or the controller 110 may combine the files and transmit the combined file to the customer device 120.
Referring to FIG. 14, a method 1400 for processing a document includes receiving a first file from a first worker (step 1402). The first file corresponds to data entry performed on a first section that was previously transmitted to the first worker. As described above, the first section is derived from a document and may be a portion ofthe document. For example, the first file may be a text file of manually typed text. In another embodiment described in more detail below, the first file may be data that is created by the worker via a form and stored according to various fields ofthe form.
The controller 110 may also receive a first identifier that identifies a first document (from which the first section was derived). For example, the first identifier may be a document identifier as illustrated in field 520 (FIG. 5). The first identifier may also be a section identifier as illustrated in the field 620 (FIG. 6), which would, in turn, indicate a document, as is apparent from the field 622 (FIG. 6).
The first identifier may be received substantially simultaneously with the first file. For example, the first identifier may be received immediately before or immediately after the first file is received, thereby associating the first file with the first identifier. The first file may also include the first identifier, thereby allowing the document to be identified from the file. Other methods for identifying the first document from which the first section was derived will be understood by those skilled in the art. A second file is received from a second worker (step 1404). The second file corresponds to data entry performed on a second section that was previously transmitted to the second worker. A second identifier also identifies a second document from which the second section was derived. The second file and the step of receiving the second file are analogous to the description above ofthe step 1402.
The controller 110 determines whether the first document and the second document are equivalent (step 1406). If so, then the first section and the second section are derived from the same document, and accordingly the first file and the second file represent data entry performed in connection with the same document. Consequently, in one embodiment both the first file and the second file are destined for the same customer. The customer that is associated with both the first file and the second file is determined (step 1408). For example, referring again to FIG. 5, each document corresponds to a customer, as indicated by fields 520 and 522 ofthe document database 240. Thus, the customer may be determined from the document database 240. The first file and the second file are transmitted to the customer (step 1410). As described above, the files may be transmitted to the customer device 120 separately, or the controller 110 may combine the files and transmit the combined file to the customer device 120. The workers are rated based on their work (step 1412). For example, the first worker is rated based on the first file and possibly based on a previous rating, if any. Rating a worker is described in the parent application, U.S. Patent Application No. 09/112,131, entitled "METHOD AND APPARATUS FOR A CRYPTOGRAPHICALLY-ASSISTED COMMERCIAL NETWORK SYSTEM DESIGNED TO FACILITATE AND SUPPORT EXPERT-BASED COMMERCE", filed on July 8, 1998.
Although two sections and two corresponding files were described with reference to FIG. 14, a document may include one or more sections that are processed. Accordingly, in one embodiment, it may be desirable to determine the number of sections in a document, and all coπesponding files are transmitted only after a predetermined number of files (e.g. the number of sections) have been received. Alternatively, each file may be transmitted to the customer when it is received.
Referring to FIG. 15, a form 1500 illustrates a form that may be used to allow a worker to enter data based on the section 1100 (FIG. 11) and to store that data according to fields ofthe form 1500. The form 1500 includes a form field 1502 that permits entry of a name, a form field 1504 that permits entry of a social security number and a form field 1506. Forms may be sent along with a corresponding document from the customer to the controller 110 (FIG. 1) and in turn to the worker. Forms may also be provided to a worker via a web site. For example, a worker may be provided with a "URL" (Uniform Resource Locator) that identifies a web site or other computer-based communication channel. Upon accessing the URL, the form may be presented to the worker. Those skilled in the art will understand that various types of forms may be readily displayed. For example, HTML forms are well known to those skilled in the art, as are proprietary form formats such as PDF (portable document format) sold by Adobe and JetForm.
In one embodiment, the form may be accessible through a web site after the worker enters an access code at the web site. The access code may serve as authorization that the worker is allowed to access a form. The access code may additionally or alternatively uniquely identify one of a plurality of forms that are available via the web site. The access code may be provided to the worker in the worker criteria discussed above. The access code may be provided to the worker by other means as well, such as transmitted along with the section. Data that is entered via a form may be transmitted to the controller 110 which in turn transmits the data to the customer device 120. Alternatively, the data entered via the form may be transmitted directly to the customer device 120 and/or a database specified by the customer.
Referring to FIG. 16, a table 1600 represents a form database. The form database may be a part ofthe section database 250 (FIG. 2). The table 1600 includes entries 1602, 1604, 1606, 1608 and 1610, each defining a form to enter data in connection with processing a section. It will be understood by those skilled in the art that the table 1600 may include any number of entries. The table 1600 also defines fields for each ofthe entries 1602, 1604, 1606, 1608 and 1610. The fields specify (i) a section identifier 1620 that uniquely identifies the section, (ii) a form identifier 1622 that uniquely identifies the form, (iii) a URL 1624 that identifies a web site where the form may be accessed, and (iv) an access code 1626 that is required to access the web site and/or specify a particular form available at the web site. One URL may be specified for more than one form. For example, the entries 1606, 1608 and 1610 all have the same URL "www.z.com". However, at least three forms "F3", "F4" and "F5" are all accessible using that URL. In particular, different access codes distinguish among the different forms.
As described above, payment may be rendered either directly or indirectly from customer to worker. Payment usually comprises a crediting of an account, such as a bank account or credit card account. However, payment may also be rendered via check or other means. The customer may be billed after any predetermined event, such as when the worker transmits the appropriate files to the customer. In one embodiment, the customer may be billed periodically (e.g. every month) for all fees due to workers.
Payment due may be defined by the customer, such as when the customer sets forth an amount to be paid for a particular document. Payment may also be defined by the worker, such as when the worker charges a time-based or job- based fee. Alternatively, a plurality of workers may participate in a "Dutch" auction and compete for the lowest amount or rate of pay they will accept in exchange for a particular section or type of work. The lowest bidder would then earn the right to perform the specified work in exchange for the highest bid amount. Similarly, a plurality of workers may participate in a "reverse" auction in which bidding starts low and increases until all sections have been accepted by workers. Other types of auctions may take place among workers to specify pay rates and terms. In addition, just as workers may compete in an auction for sellers, sellers may compete in an auction for workers.
A section of a document may include significant amounts of information, and processing such a section may require a significant amount of time. Consequently, it can be advantageous to allow a plurality of workers to participate in processing a single section. Similarly, it can be advantageous to allow a worker to receive a payment upon processing a portion of a section. Accordingly, the present invention permits a section to be partially processed by a first worker, and a second worker can further process the section. For example, a first worker may create a first text file representing the first page of a document, and a second worker may create a second text file representing the remaining pages of the document.
FIG. 17 illustrates a method 1700 for permitting a section to be processed by a plurality of workers. A section is transmitted to a worker (step 1702), as discussed above. In response, a file is received from the worker (step 1704), as also discussed above. The controller 110 then determines whether the section has been completely processed (step 1706). In other words, the controller 110 determines whether there is a portion ofthe section on which processing was not performed. If there is a portion ofthe section on which processing was not performed, then at least that portion (possibly the entire section) is transmitted to another worker (step 1708). The file from the original worker may also be sent to the subsequent worker to allow that worker to more readily determine which portions of the section have been processed. Subsequently, a file is received from the subsequent worker (step 1710). The steps 1706, 1708 and 1710 are repeated until the section has been completely processed by one or more workers.
All received files may be combined to form a combined file (step 1712). The combined file is in turn transmitted to the customer (step 1714). Alternatively, the files may be transmitted to the customer upon receipt. In an embodiment where the combined file is created, the customer need not know that the section has been processed by a plurality of workers.
In order to determine whether the section has been completely processed, a "stop indicator" may be received with, or included in, the file. A stop indicator indicates where in the section the worker stopped processing the section, and therefor indicates the portion ofthe section that has not been processed. The stop indicator may be a textual description ofthe portion ofthe section on which processing was not performed. For example, the stop indicator may comprise the text "Until page 3, second paragraph". In another embodiment, a file received from a worker may include the portion ofthe section that has not been processed. Alternatively, a file received from a worker may include the section which the worker has marked with indicia that indicates a portion ofthe section on which processing was not performed. Such indicia may be considered a "stop indicator" that indicates were the worker stopped processing the section. For example, the stop indicator may comprise a boundary line that separates a portion that was processed from a portion that was not processed.
Referring to FIG. 18, a table 1800 represents a record of another embodiment ofthe section database 250 (FIG. 2). In the illustrated embodiment, each section may be processed by one or more workers, and accordingly the processing performed by each worker is recorded. Accordingly, each record coπesponds to a section as indicated by the section identifier 1802. The record also includes a document identifier 1804 to identify the document from which the section originated. The record further includes entries 1806, 1808 and 1810, each of which defines a portion ofthe section that was processed by a worker. The table 1800 also defines fields for each ofthe entries 1806, 1808 and 1810. The fields specify (i) a worker identifier 1820 ofthe worker to which the portion was sent, (ii) when the portion was sent to the worker 1822, (iii) when the portion was completed by the worker 1824, (iv) a price 1826 the worker charges for processing the portion, and (v) a percentage completion 1828 ofthe section after the specified worker completes processing his respective portion.
It can be advantageous to allow a customer to specify processing criteria for a group of documents, rather than processing criteria for each document. A customer may only be concerned with a "high level" goal, and thus want to specify high level processing criteria for the group of documents. For example, a customer may wish to specify that a group of one thousand documents is processed within a week for an average cost of under $10 per document. Accordingly, in one embodiment ofthe present invention a customer may specify processing criteria that apply to a plurality of documents. The controller 110 (FIG. 1) then apportions the documents among workers in a manner that satisfies the processing criteria. FIG. 19 illustrates a method 1900 for processing a plurality of documents in accordance with processing criteria for the plurality. The controller 110 receives a plurality of documents from the customer (step 1902), and also receives processing criteria applicable to the plurality of documents (step 1904). The documents and processing criteria may be received in a manner similar to that described above. The processing criteria may specify, for example, a number ofthe plurality of documents to process in a predetermined time period (e.g. at least five of these documents must be processed within a week), a required accuracy of processing (e.g. the documents must be at least 99% accurately transcribed), and a desired cost for processing the plurality of documents (no more than $1000 to process the plurality).
A plurality of workers are selected based on the processing criteria (step 1906). For example, the controller 110 may select ten workers that are qualified for the task and that also have the lowest rates. Similarly, the controller 110 may select twenty ofthe fastest workers if there is not much time. In one embodiment, work may be apportioned among a first set of workers that perform typing, and a second set of workers that proofread the files generated by the first set of workers. In such an embodiment, the controller 110 may determine whether it is more economical to hire more ofthe first set than o the second set to achieve the same overall level of accuracy.
The workers may also be selected based on the work loads ofthe workers. For example, if a particular worker has been assigned a large number of sections and has not processed many of those sections, the controller 110 may determine that the worker has a large work load and thus is unlikely to provide rapid service. Conversely, if a particular worker has few or no unprocessed sections, the controller 110 may determine that the worker has a small work load and thus is likely to provide rapid service. In addition, the past response rate (duration from receiving a section and completing processing for the section) of a worker may be considered in determining his future response rate.
The documents and/or sections of the documents are apportioned among the plurality of workers (step 1908) so each worker is assigned a section of a document and all sections are allocated. The sections are transmitted to the appropriate worker (step 1910). Transmitting sections to workers and subsequently receiving files from those workers has been described above.
Although the present invention has been described with respect to a preferred embodiment thereof, those skilled in the art will note that various substitutions may be made to those embodiments described herein without departing from the spirit and scope ofthe present invention.

Claims

What is claimed is:
1. A method for processing a document, comprising: determining a first section of a document; determining a second section ofthe document; transmitting the first section ofthe document to a first worker to perform coπesponding data entry; transmitting to the first worker a first identifier that identifies the document; transmitting the second section ofthe document to a second worker to perform coπesponding data entry; and transmitting to the second worker a second identifier that identifies the document.
2. The method of claim 1, further comprising: providing the first worker with access to a first database to enter data coπesponding to the first section.
3. The method of claim 1 , in which the document comprises an image.
4. The method of claim 3, further comprising: generating the image by scanning a paper document.
5. The method of claim 3, further comprising: marking the image with at least one of the first predetermined indicia and the second predetermined indicia.
6. The method of claim 3, in which the first section comprises a portion ofthe image.
7. The method of claim 1, in which the first predetermined indicia is the second predetermined indicia.
8. The method of claim 1, in which the first predetermined indicia comprises: an indication of a boundary between the first section ofthe document and the second section ofthe document.
9. The method of claim 8, in which the first predetermined indicia comprises: an indication of one ofthe first section ofthe document and the second section of the document.
10. The method of claim 1, in which the first predetermined indicia comprises a bar code.
11. The method of claim 1 , further comprising: labeling the first section with an identifier that indicates a customer.
12. The method of claim 11 , in which the identifier that indicates a customer comprises: a bar code.
13. The method of claim 1, further comprising: receiving processing criteria.
14. The method of claim 13, in which the processing criteria indicates at least one of: an amount of payment to be paid, billing information, a job code, a required rating ofthe first worker, an indication of whether sections ofthe document can go to a single worker, a time of completion by the first worker, and a required characteristic ofthe first worker.
15. The method of claim 13 , in which the document includes a bar code that indicates the processing criteria.
16. The method of claim 13, in which the document includes a set of characters that indicate the processing criteria.
17. The method of claim 1 , further comprising: transmitting worker criteria to the first worker.
18. The method of claim 17, in which the worker criteria indicates at least one of: a customer identifier, an amount of payment to be paid, a job code, billing information, and a time of completion by the first worker.
19. The method of claim 17, in which the first section is labeled with a bar code that indicates the worker criteria.
20. The method of claim 17, further comprising: creating the worker criteria from received processing criteria.
21. The method of claim 17, in which the first section includes a set of characters that indicate the worker criteria.
22. The method of claim 1, further comprising: ascertaining a size ofthe first section.
23. The method of claim 22, further comprising: if the size ofthe first section is greater than a predetermined threshold, creating a third section and a fourth section from the first section, the third section and the fourth section each having a size less than the size ofthe first section.
24. The method of claim 23, further comprising: transmitting the third section ofthe document to a third worker to perform coπesponding data entry; and transmitting the fourth section ofthe document to a fourth worker to perform coπesponding data entry.
25. The method of claim 1, further comprising: providing the first worker with access to a form that permits entry of data for the first section.
26. The method of claim 25, in which the form comprises an HTML form.
27. The method of claim 1, further comprising: receiving an offer price to be paid in exchange for performance of coπesponding data entry.
28. The method of claim 27, further comprising: receiving a payment identifier that specifies a financial account to pay the offer price.
29. The method of claim 1, in which the step of determining a first section of a document comprises: determining a first section of a document, the first section being indicated by first predetermined indicia.
30. An apparatus for processing a document, comprising: a processor; and a storage device in communication with the processor, the storage device storing a program for directing the processor to: determine a first section of a document, and determine a second section ofthe document; and transmit the first section ofthe document to a first worker to perform coπesponding data entry; transmit to the first worker a first identifier that identifies the document; transmit the second section ofthe document to a second worker to perform coπesponding data entry; and transmit to the second worker a second identifier that identifies the document.
31. A method for processing a document, comprising: receiving a job code associated with an electronic document; determining a first section of the electronic document that is indicated by first predetermined indicia; determining a second section ofthe electronic document that is indicated by second predetermined indicia; selecting, based on the job code, a first worker from a plurality of workers; transmitting the first section ofthe electronic document to the first worker to perform corresponding data entry; transmitting to the first worker a first identifier that identifies the electronic document; selecting, based on the job code, a second worker from the plurality of workers; transmitting the second section ofthe electronic document to the second worker to perform coπesponding data entry, the second worker being different from the first worker; and transmitting to the second worker a second identifier that identifies the electronic document.
32. The method of claim 31 , in which the job code specifies a type of work to be performed on the electronic document.
33. The method of claim 32, in which the step of selecting a first worker comprises: selecting a first worker if a type of work that the first worker performs coπesponds to the job code.
34. The method of claim 31 , in which the step of selecting a first worker comprises: determining a bid price each of plurality of workers is willing to accept; and selecting a worker having a lowest bid price.
35. The method of claim 31, further comprising: generating the electronic document from a paper document.
36. A method for processing a document, comprising: receiving a first file from a first worker, the first file coπesponding to data entry performed on a first previously-transmitted section of a document; receiving a second file from a second worker, the second file coπesponding to data entry performed on a second previously-transmitted section ofthe document; determining a customer associated with both the first file and the second file; and transmitting the first file and the second file to the customer.
37. The method of claim 36, further comprising: rating the first worker based on the first file.
38. The method of claim 37, in which the step of rating comprises: rating the first worker based on the first file and a previous rating of the first worker.
39. The method of claim 36, in which the step of transmitting the first file and the second file to the customer comprises: creating a combined file that includes the first file and the second file; and transmitting the combined file to the customer.
40. The method of claim 36, in which the step of transmitting the first file and the second file to the customer is performed after a predetermined number of files have been received.
41. The method of claim 36, further comprising: paying the first worker.
42. An apparatus for processing a document, comprising: a processor; and a storage device in communication with the processor, the storage device storing a program for directing the processor to: receive a first file from a first worker, the first file coπesponding to data entry performed on a first previously-transmitted section of a document; receive a second file from a second worker, the second file coπesponding to data entry performed on a second previously-transmitted section of the document; determine a customer associated with both the first file and the second file; and transmit the first file and the second file to the customer.
43. A method for processing a document, comprising: receiving a first file from a first worker, the first file coπesponding to data entry performed on a first previously-transmitted section of a first document, the first file indicating a first identifier that identifies the first document; receiving a second file from a second worker, the second file coπesponding to data entry performed on a second previously-transmitted section of a second document, the second file indicating a second identifier that identifies the second document; determining whether the first document and the second document are equivalent; determining a customer associated with both the first file and the second file; and transmitting the first file and the second file to the customer.
44. A method for processing a document, comprising: determining a section of a document, the section being indicated by predetermined indicia; transmitting the section ofthe document to a worker to perform coπesponding data entry; receiving a file from the worker, the file coπesponding to data entry performed on the section ofthe document; determining a customer associated with the file; and transmitting the file to the customer.
45. The method of claim 44, in which the step of transmitting the section ofthe document to a worker comprises: transmitting the section of the document to a first worker and to a second worker; and in which the step of receiving a file from the worker comprises: receiving a first file from the first worker; and receiving a second file from the second worker.
46. The method of claim 45, further comprising: comparing the first file and the second file.
47. The method of claim 44, further comprising: transmitting the file to an additional worker; and receiving from the additional worker an indication of a quality ofthe file.
48. The method of claim 44, further comprising: receiving the document from the customer.
49. The method of claim 48, in which the document comprises an image.
50. An apparatus for processing a document, comprising: a processor; and a storage device in communication with the processor, the storage device storing a program for directing the processor to: determine a section of a document, the section being indicated by predetermined indicia; transmit the section ofthe document to a worker to perform coπesponding data entry; receive a file from the worker, the file corresponding to data entry performed on the section ofthe document; determine a customer associated with the file; and transmit the file to the customer.
51. A method for processing a document, comprising: determining a first section of a document that is indicated by first predetermined indicia; determining a second section ofthe document that is indicated by second predetermined indicia; transmitting the first section ofthe document to a first worker to perform coπesponding data entry; transmitting to the first worker a first identifier that identifies the document; transmitting the second section ofthe document to a second worker to perform coπesponding data entry; transmitting to the second worker a second identifier that identifies the document; receiving a first file from the first worker, the first file coπesponding to data entry performed on the first section ofthe document; and receiving a second file from the second worker, the second file coπesponding to data entry performed on the second section ofthe document.
52. A method for processing a document, comprising: generating an electronic document from a paper document; receiving a job code associated with the electronic document; determining a first section ofthe electronic document that is indicated by first predetermined indicia; determining a second section ofthe electronic document that is indicated by second predetermined indicia; selecting, based on the job code, a first worker from a plurality of workers; transmitting the first section ofthe electronic document to the first worker to perform coπesponding data entry; transmitting to the first worker a first identifier that identifies the electronic document; selecting, based on the job code, a second worker from the plurality of workers; transmitting the second section ofthe electronic document to the second worker to perform coπesponding data entry, the second worker being different from the first worker; transmitting to the second worker a second identifier that identifies the electronic document; receiving a first file from the first worker, the first file indicating a third identifier; receiving a second file from the second worker, the second file indicating a fourth identifier; comparing the third identifier and the fourth identifier to determine whether both the third identifier and the fourth identifier identify the electronic document; determining a customer associated with both the first file and the second file; and transmitting the first file and the second file to the customer.
53. A method for processing a document, comprising: receiving a plurality of documents; receiving processing criteria for processing the plurality of documents; selecting a plurality of workers in accordance with the processing criteria; and apportioning the plurality of documents among the plurality of workers in accordance with the processing criteria.
54. The method of claim 53, in which the criteria for processing includes at least one of: a number of the plurality of documents to process in a predetermined time period, a required accuracy of processing, and a desired cost.
55. The method of claim 53, in which the step of selecting a plurality of worker comprises: selecting a plurality of workers in accordance with the processing criteria and in accordance with at least one of work loads and response rates ofthe plurality of workers.
56. The method of claim 53, further comprising: transmitting the sections to the plurality of workers to perform coπesponding data entry.
57. An apparatus for processing a document, comprising: a processor; and a storage device in communication with the processor, the storage device storing a program for directing the processor to: receive a plurality of documents; receive processing criteria for processing the plurality of documents; select a plurality of workers in accordance with the processing criteria; and apportion the plurality of documents among the plurality of workers in accordance with the processing criteria.
58. A method for processing a document, comprising: receiving a plurality of documents; receiving processing criteria for processing the plurality of documents; selecting a plurality of workers in accordance with the processing criteria; determining for each document a section that is indicated by predetermined indicia; and apportioning the plurality of documents among the plurality of workers in accordance with the processing criteria; and transmitting the sections to the plurality of workers to perform coπesponding data entry.
59. A method for processing a document, comprising: determining a section ofthe document that is indicated by first predetermined indicia; transmitting the section ofthe document to a first worker to perform coπesponding data entry; receiving a file from the first worker, the file coπesponding to data entry performed on the section ofthe document, determining a first portion ofthe section on which processing was not performed; and transmitting the first portion ofthe section to a second worker to perform coπesponding data entry.
60. The method of claim 59, in which the file comprises the section ofthe document.
61. The method of claim 59, in which the step of determining a first portion ofthe section on which processing was not performed comprises: receiving a stop indicator that indicates a first portion ofthe section on which processing was not performed.
62. The method of claim 61, in which the file includes the stop indicator.
63. The method of claim 62, in which the file includes second predetermined indicia indicating the first portion ofthe section on which processing was not performed.
64. The method of claim 61, in which the stop indicator comprises a textual description of a first portion ofthe section on which processing was not performed.
65. An apparatus for processing a document, comprising: a processor; and a storage device in communication with the processor, the storage device storing a program for directing the processor to: determine a section ofthe document that is indicated by first predetermined indicia; transmit the section ofthe document to a first worker to perform coπesponding data entry; receive a file from the first worker, the file coπesponding to data entry performed on the section ofthe document, determine a first portion ofthe section on which processing was not performed; and transmit the first portion ofthe section to a second worker to perform coπesponding data entry.
PCT/US1999/015348 1998-07-08 1999-07-07 Method and apparatus for processing a document containing confidential information WO2000003321A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU52089/99A AU5208999A (en) 1998-07-08 1999-07-07 Method and apparatus for processing a document containing confidential information

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US09/112,131 US7523045B1 (en) 1996-07-24 1998-07-08 Method and apparatus for a cryptographically assisted commercial network system designed to facilitate and support expert-based commerce
US09/112,131 1998-07-08
US21825798A 1998-12-22 1998-12-22
US09/218,257 1998-12-22

Publications (1)

Publication Number Publication Date
WO2000003321A1 true WO2000003321A1 (en) 2000-01-20

Family

ID=26809616

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/015348 WO2000003321A1 (en) 1998-07-08 1999-07-07 Method and apparatus for processing a document containing confidential information

Country Status (2)

Country Link
AU (1) AU5208999A (en)
WO (1) WO2000003321A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2404471A (en) * 2003-07-29 2005-02-02 Hewlett Packard Development Co Processing a document based on data contained in its image

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU696831B2 (en) * 1994-02-18 1998-09-17 Wilhelmus Robertus Tys Didjeridoo blue

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4141078A (en) * 1975-10-14 1979-02-20 Innovated Systems, Inc. Library circulation control system
US4509123A (en) * 1983-01-06 1985-04-02 Vereen William J Automated tracking process for manufacturing and inventory
US4863384A (en) * 1986-04-10 1989-09-05 Keilty, Goldsmith & Boone Personalized feedback system utilizing pre-recorded media and method of making same
US5225978A (en) * 1989-01-25 1993-07-06 Usisys Corp. Document processing system having integrated expert module
US5301350A (en) * 1989-10-10 1994-04-05 Unisys Corporation Real time storage/retrieval subsystem for document processing in banking operations
US5557515A (en) * 1989-08-11 1996-09-17 Hartford Fire Insurance Company, Inc. Computerized system and method for work management
US5678046A (en) * 1994-11-18 1997-10-14 The Chase Manhattan Bank, N.A. Method and apparatus for distributing files on a file storage device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4141078A (en) * 1975-10-14 1979-02-20 Innovated Systems, Inc. Library circulation control system
US4509123A (en) * 1983-01-06 1985-04-02 Vereen William J Automated tracking process for manufacturing and inventory
US4863384A (en) * 1986-04-10 1989-09-05 Keilty, Goldsmith & Boone Personalized feedback system utilizing pre-recorded media and method of making same
US5225978A (en) * 1989-01-25 1993-07-06 Usisys Corp. Document processing system having integrated expert module
US5557515A (en) * 1989-08-11 1996-09-17 Hartford Fire Insurance Company, Inc. Computerized system and method for work management
US5301350A (en) * 1989-10-10 1994-04-05 Unisys Corporation Real time storage/retrieval subsystem for document processing in banking operations
US5678046A (en) * 1994-11-18 1997-10-14 The Chase Manhattan Bank, N.A. Method and apparatus for distributing files on a file storage device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2404471A (en) * 2003-07-29 2005-02-02 Hewlett Packard Development Co Processing a document based on data contained in its image

Also Published As

Publication number Publication date
AU5208999A (en) 2000-02-01

Similar Documents

Publication Publication Date Title
US11663654B2 (en) System and method for processing transaction records for users
US8335702B2 (en) System and method for standardized and automated appeals process
US6934691B1 (en) System and method for managing mail/bills through a central location
US7076462B1 (en) System and method for electronic loan application and for correcting credit report errors
EP1471450A1 (en) A computer system for providing credit information data
US20040143547A1 (en) Automated accounts payable using image typing and type specific processing
JP2006511896A (en) Receipt and related data capture, storage and processing systems and methods
CA2317209A1 (en) Methods and apparatus for collateral risk monitoring
US8326718B2 (en) Systems and methods for distributing private placement documents
US20040243501A1 (en) System and method for automated data processing
JP2012048724A (en) Techniques for pricing task
US6850908B1 (en) Methods and apparatus for monitoring collateral for lending
JP6771121B1 (en) Invoice management device, invoice management method and program
EP1300785A1 (en) Advertisement printing system
CN1204522C (en) File, file processing system and file generating system
US20040210483A1 (en) Method and apparatus for digitally integrating sales, confirmation and billing of published advertising
US20140310158A1 (en) System and Method of Electronically Perfecting A Premium Finance Agreement
US20020156857A1 (en) Method and system for posting requests for bids for professional services
US20030140185A1 (en) Method and system for client-side print job meta-data collection and open billing
US20080255970A1 (en) Method, system,apparatus or device for providing reconciled bookkeeping or accounting electronically
WO2000003321A1 (en) Method and apparatus for processing a document containing confidential information
JP2010157137A (en) Information processing apparatus, information processing method and program
WO2002041109A2 (en) Global vendor financing system and method
CN113792570A (en) Claims data processing method and device, electronic equipment and readable medium
AU2008201398B2 (en) A method, system, apparatus or device for providing reconciled bookkeeping or acounting electronically

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase