Kroll-Software
| Diese Seite auf deutsch | Imprint   
 
16 users online   
 
Home Products Download Shop Support Contact

Home > Support > FuzzyDupes FAQ

 

*  FuzzyDupes FAQ


During the duplicate search I get the error "Not enough memory" (Out Of Memory)
FuzzyDupes requires quite a lot of memory to do the duplicate search. The maximum size of the the data that can be searched is limited by the free memory available. A 32-bit system can have a maximum of approximately 2.5 GB of memory to be addressed, even if more physical RAM has been installed. FuzzyDupes 5 was successfully tested with up to 1.5 million data records. This value can fluctuate and depends on free memory, the number of columns and the redundancies in the data. Please close other applications that require a lot of memory. Create a new query in your database that returns only the relevant columns. If this still is not enough, you can pre-select and break your data into smaller chunks according to a specific criteria (eg 1st digit of the ZIP code). In addition to the 32-bit version a more powerful 64-bit Parallel Edition is now available.
Which is the best practice to calculate a negative match with a second list (Black-List, Robinson list)?
FuzzyDupes offers several approaches. Create a new project to your database. Perform a search for duplicates in order to find favorable parameters and thresholds for your data. Then select menu Duplicate search->Fuzzy Import with the output option Duplicates only. This result is in a grouped view and allows you to manually review the found matches. Remove false positive items with the context menu (right mouse button). The result from the option Duplicates only allowes in the next step a positive or negative match against the first data table.
My question is not answered here.
We're here to help, by email, via our Support Form or by telephone.