Exemplos de PostingsEnum.freq em Java

Linguagem de programação: Java

Classe / Tipo: PostingsEnum

Método / Função: freq

Exemplos em hotexamples.com: 3

PostingsEnum.freq em Java - 3 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de PostingsEnum.freq em Java extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

nextDoc(7)

docID(3)

freq(3)

endOffset(1)

nextPosition(1)

startOffset(1)

Métodos Frequentes

nextDoc (7)

docID (3)

freq (3)

endOffset (1)

nextPosition (1)

startOffset (1)

Relacionados

Classifier

FolderExplorer

SimpleHTTPServer

ObjectOutputStream

ChocoLogging

Map

ResponseUtils

SessionHelper

java.sql.Connection

SimpleName

Related in langs

Feed (PHP)

feedReader (PHP)

File (C#)

RefCountObj (C#)

glutAttachMenu (C++)

ol_assert_ret (C++)

Get (Go)

ConfigModelFromYAMLBytes (Go)

nearest_palindrome (Python)

array_int16_cast (Python)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: XMoreLikeThis.java Projeto: anywayjiong/elasticsearch

/** * Adds terms and frequencies found in vector into the Map termFreqMap * * @param termFreqMap a Map of terms and their frequencies * @param vector List of terms and their frequencies for a doc/field * @param fieldName Optional field name of the terms for skip terms */ private void addTermFrequencies( Map<String, Int> termFreqMap, Terms vector, @Nullable String fieldName) throws IOException { final TermsEnum termsEnum = vector.iterator(); final CharsRefBuilder spare = new CharsRefBuilder(); BytesRef text; while ((text = termsEnum.next()) != null) { spare.copyUTF8Bytes(text); final String term = spare.toString(); if (isNoiseWord(term)) { continue; } if (isSkipTerm(fieldName, term)) { continue; } final PostingsEnum docs = termsEnum.postings(null, null); int freq = 0; while (docs != null && docs.nextDoc() != DocIdSetIterator.NO_MORE_DOCS) { freq += docs.freq(); } // increment frequency Int cnt = termFreqMap.get(term); if (cnt == null) { cnt = new Int(); termFreqMap.put(term, cnt); cnt.x = freq; } else { cnt.x += freq; } } }

Exemplo n.º 2

0

Exibir arquivo

Arquivo: TermInfoCollector.java Projeto: justin2061/jate

public TermInfo collect(String term) throws IOException { TermInfo info = new TermInfo(); BytesRef luceneTerm = new BytesRef(term.getBytes()); // this gives documents in which the term is found, but no offset information can be retrieved PostingsEnum postings = MultiFields.getTermDocsEnum(indexReader, ngramInfoFieldname, luceneTerm); // now go through each document int docId = postings.nextDoc(); while (docId != PostingsEnum.NO_MORE_DOCS) { // get the term vector for that document. TermsEnum it = indexReader.getTermVector(docId, ngramInfoFieldname).iterator(); // find the term of interest it.seekExact(luceneTerm); // get its posting info. this will contain offset info PostingsEnum postingsInDoc = it.postings(null, PostingsEnum.OFFSETS); postingsInDoc.nextDoc(); Document doc = indexReader.document(docId); String id = doc.get(idFieldname); JATEDocument jd = new JATEDocument(id); Set<int[]> offsets = new HashSet<>(); int totalFreq = postingsInDoc.freq(); for (int i = 0; i < totalFreq; i++) { postingsInDoc.nextPosition(); offsets.add(new int[] {postingsInDoc.startOffset(), postingsInDoc.endOffset()}); } info.getOffsets().put(jd, offsets); docId = postings.nextDoc(); } return info; }

Exemplo n.º 3

0

Exibir arquivo

Arquivo: TestOmitPositions.java Projeto: fullstorydev/lucene-solr

public void testBasic() throws Exception { Directory dir = newDirectory(); RandomIndexWriter w = new RandomIndexWriter(random(), dir); Document doc = new Document(); FieldType ft = new FieldType(TextField.TYPE_NOT_STORED); ft.setIndexOptions(IndexOptions.DOCS_AND_FREQS); Field f = newField("foo", "this is a test test", ft); doc.add(f); for (int i = 0; i < 100; i++) { w.addDocument(doc); } IndexReader reader = w.getReader(); w.close(); assertNotNull(MultiFields.getTermPositionsEnum(reader, "foo", new BytesRef("test"))); PostingsEnum de = TestUtil.docs(random(), reader, "foo", new BytesRef("test"), null, PostingsEnum.FREQS); while (de.nextDoc() != DocIdSetIterator.NO_MORE_DOCS) { assertEquals(2, de.freq()); } reader.close(); dir.close(); }