IDC Server, using Bayesian Infererence and Shannon's Information Theory, builds groups of statistics from text contained in unstructured documents. The IDC Server identifies and stores key concepts found in each document that is indexed into it. After indexing a document, IDC Server forms a conceptual understanding of the document's subject matter and then automatically classifies documents into similar groups.