InfoCodex is capable of processing literally millions of
documents. Because the InfoCodex software is highly scalable, the document
volume is limited only by the capabilities of the available hardware.
Can InfoCodex be optimized to enhance performance?
InfoCodex uses optimized procedures for the import and analysis
of text documents (for example, parallel download from the internet
or making use of an internal database offering extremely
fast
access times as has been proven in various
benchmarks).
If desired, the user can also make use of many additional
tuning options. Here a list of the most frequently used procedures:
- Import of documents on distributed processors
- Distributed processing
- Limitations to specific file types
- Blacklists with unwanted documents
- Fixed categorizations