Java BasicEntityExtractor 예제들

프로그래밍 언어: Java

네임스페이스/패키지 이름: edu.stanford.nlp.ie.machinereading.structure

클래스/타입: BasicEntityExtractor

hotexamples.com에서의 예제들: 2

Java BasicEntityExtractor - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Java의 edu.stanford.nlp.ie.machinereading.structure.BasicEntityExtractor에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

annotationsToSkip(1)

classifier(1)

saveCoNLL(1)

useBIO(1)

useSubTypes(1)

예제 #1

파일 보기

파일: BasicEntityExtractor.java 프로젝트: automenta/corenlp

  /**
   * Loads the model from disk.
   *
   * @param path The location of model that was saved to disk
   * @throws ClassCastException if model is the wrong format
   * @throws IOException if the model file doesn't exist or is otherwise unavailable/incomplete
   * @throws ClassNotFoundException this would probably indicate a serious classpath problem
   */
  public static BasicEntityExtractor load(
      String path,
      Class<? extends BasicEntityExtractor> entityClassifier,
      boolean preferDefaultGazetteer)
      throws ClassCastException, IOException, ClassNotFoundException {

    // load the additional arguments
    // try to load the extra file from the CLASSPATH first
    InputStream is =
        BasicEntityExtractor.class.getClassLoader().getResourceAsStream(path + ".extra");
    // if not found in the CLASSPATH, load from the file system
    if (is == null) is = new FileInputStream(path + ".extra");
    ObjectInputStream in = new ObjectInputStream(is);
    String gazetteerLocation = ErasureUtils.<String>uncheckedCast(in.readObject());
    if (preferDefaultGazetteer) gazetteerLocation = DefaultPaths.DEFAULT_NFL_GAZETTEER;
    Set<String> annotationsToSkip = ErasureUtils.<Set<String>>uncheckedCast(in.readObject());
    Boolean useSubTypes = ErasureUtils.<Boolean>uncheckedCast(in.readObject());
    Boolean useBIO = ErasureUtils.<Boolean>uncheckedCast(in.readObject());
    in.close();
    is.close();

    BasicEntityExtractor extractor =
        (BasicEntityExtractor)
            MachineReading.makeEntityExtractor(entityClassifier, gazetteerLocation);

    // load the CRF classifier (this works from any resource, e.g., classpath or file system)
    extractor.classifier = CRFClassifier.getClassifier(path);

    // copy the extra arguments
    extractor.annotationsToSkip = annotationsToSkip;
    extractor.useSubTypes = useSubTypes;
    extractor.useBIO = useBIO;

    return extractor;
  }

예제 #2

파일 보기

파일: BasicEntityExtractor.java 프로젝트: automenta/corenlp

  /**
   * Annotate an ExtractionDataSet with entities. This will modify the ExtractionDataSet in place.
   *
   * @param doc The dataset to label
   */
  @Override
  public void annotate(Annotation doc) {
    if (SAVE_CONLL_2003) {
      // dump a file in CoNLL-2003 format
      try {
        PrintStream os = new PrintStream(new FileOutputStream("test.conll"));
        List<List<CoreLabel>> labels =
            AnnotationUtils.entityMentionsToCoreLabels(doc, annotationsToSkip, useSubTypes, useBIO);
        BasicEntityExtractor.saveCoNLL(os, labels, true);
        // saveCoNLLFiles("/tmp/ace/test", doc, useSubTypes, useBIO);
        os.close();
      } catch (IOException e) {
        e.printStackTrace();
        System.exit(1);
      }
    }

    List<CoreMap> sents = doc.get(CoreAnnotations.SentencesAnnotation.class);
    int sentCount = 1;
    for (CoreMap sentence : sents) {
      if (useNERTags) {
        this.makeAnnotationFromAllNERTags(sentence);
      } else extractEntities(sentence, sentCount);
      sentCount++;
    }

    /*
    if(SAVE_CONLL_2003){
      try {
        saveCoNLLFiles("test_output/", doc, useSubTypes, useBIO);
        System.err.println("useBIO = " + useBIO);
      } catch (IOException e) {
        e.printStackTrace();
        System.exit(1);
      }
    }
    */
  }