Specifications
The Patent Office Journal 17/12/2010
4602
(12) PATENT APPLICATION PUBLICATION (21) Application No.4008/CHENP/2010 A
(19) INDIA
(22) Date of filing of Application :29/06/2010 (43) Publication Date : 17/12/2010
(54) Title of the invention : MANAGING AN ARCHIVE FOR APPROXIMATE STRING MATCHING
(51) International classification :G06F7/00
(31) Priority Document No :12/015,085
(32) Priority Date :16/01/2008
(33) Name of priority country :U.S.A.
(86) International Application No
Filing Date
:PCT/US2008/088530
:30/12/2008
(87) International Publication No :WO 2009/091494 A1
(61) Patent of Addition to Application
Number
Filing Date
:NA
:NA
(62) Divisional to Application Number
Filing Date
:NA
:NA
(71)Name of Applicant :
1)AB INITIO TECHNOLOGY LLC
Address of Applicant :201 SPRING STREET, LEXINGTON,
MA 02421 U.S.A.
(72)Name of Inventor :
1)ARLEN ANDERSON
(57) Abstract :
In one aspect, in general, a method is described for managing an archive for determining approximate matches associated with strings
occurring in records. The method includes: processing records to determine a set of string representations that correspond to strings
occurring in the records; generating, for each of at least some of the string representations in the set, a plurality of close
representations that are each generated from at least some of the same characters in the string; and storing entries in the archive that
each represent a potential approximate match between at least two strings based on their respective dose representations.
No. of Pages : 42 No. of Claims : 30










