Texasgordo
TGT Addict
I don't think that she can help them.
I don't think that she can help them.
Hadoop and Lucene. Search all you want. But for 650k emails, you could just use Lucene. Heck just point a Solr instance at it and be done by lunch.
Hadoop, Lucene, Solr, Splink. -all perfectly cromulent words.
I use Elasticsearch for searching logs, which is Lucene based. Work insists on paying for stuff so we have a huge Splink license too. Our High Performance guys are doing a Hadoop/ELK project to do some statisticical and behavioral analysis but sometime simple is best and grep is tour friend. It also helps to have one of the fastest computers in the world at your disposal.
Hadoop, Lucene, Solr, Splink. -all perfectly cromulent words.
They embiggen the smallest man.
Well, obviously nobody taught her about punctuation...