我在我的项目中使用open nlp.我想使用字典词库,但我无法正确输入posModel。我想使用字典词法器,但我无法正确输入posModel,我有一个summaryActivity类,在点击按钮时调用这个函数。
private String summaryTool(String documentText) throws Exception {
InputStream sentInput = getAssets().open("en_sent.bin");
InputStream tokenInput = getAssets().open("en_token.bin");
InputStream lemmaInput = getAssets().open("en_lemmatizer.dict");
FileInputStream posInput = getApplicationContext().openFileInput("en_pos_maxent.bin");
preProcessor = new PreProcessor(sentInput, tokenInput, lemmaInput, posInput);
grapher = new Grapher();
sentInput.close();
tokenInput.close();
lemmaInput.close();
posInput.close();
而我的预处理器类构造函数中,posModel被初始化了
public PreProcessor(InputStream sentenceModel, InputStream tokenizerModel, InputStream lemmaModel, InputStream pos) throws IOException {
SentenceModel sentModel = new SentenceModel(sentenceModel);
sentenceDetector = new SentenceDetectorME(sentModel);
TokenizerModel tokenModel = new TokenizerModel(tokenizerModel);
tokenizer = new TokenizerME(tokenModel);
//line with exception
POSModel posModel = new POSModel(pos);
posTagger = new POSTaggerME(posModel);
lemmatizer = new DictionaryLemmatizer(lemmaModel);
}
这是我的src文件夹的样子。我把en_pos_maxent.bin文件放在了多个地方,并试图加载它。
我曾试过使用 getAssets().open("en_pos_maxent.bin");
但我收到一个错误的输入流格式异常
W/System.err: opennlp.tools.util.InvalidFormatException: The profile data stream has an invalid format!
at opennlp.tools.dictionary.serializer.DictionaryEntryPersistor.create(DictionaryEntryPersistor.java:224)
at opennlp.tools.postag.POSDictionary.create(POSDictionary.java:228)
at opennlp.tools.postag.POSTaggerFactory$POSDictionarySerializer.create(POSTaggerFactory.java:296)
at opennlp.tools.postag.POSTaggerFactory$POSDictionarySerializer.create(POSTaggerFactory.java:293)
at opennlp.tools.util.model.BaseModel.finishLoadingArtifacts(BaseModel.java:312)
at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:242)
at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:176)
at opennlp.tools.postag.POSModel.<init>(POSModel.java:97)
at com.mtah.tools.PreProcessor.<init>(PreProcessor.java:46)
at com.mtah.summerizer.SummaryActivity.summaryTool(SummaryActivity.java:57)
at com.mtah.summerizer.SummaryActivity.onCreate(SummaryActivity.java:36)
at android.app.Activity.performCreate(Activity.java:7820)
at android.app.Activity.performCreate(Activity.java:7809)
at android.app.Instrumentation.callActivityOnCreate(Instrumentation.java:1318)
at android.app.ActivityThread.performLaunchActivity(ActivityThread.java:3362)
at android.app.ActivityThread.handleLaunchActivity(ActivityThread.java:3526)
at android.app.servertransaction.LaunchActivityItem.execute(LaunchActivityItem.java:83)
at android.app.servertransaction.TransactionExecutor.executeCallbacks(TransactionExecutor.java:135)
at android.app.servertransaction.TransactionExecutor.execute(TransactionExecutor.java:95)
at android.app.ActivityThread$H.handleMessage(ActivityThread.java:2122)
at android.os.Handler.dispatchMessage(Handler.java:107)
at android.os.Looper.loop(Looper.java:214)
at android.app.ActivityThread.main(ActivityThread.java:7695)
W/System.err: at java.lang.reflect.Method.invoke(Native Method)
at com.android.internal.os.RuntimeInit$MethodAndArgsCaller.run(RuntimeInit.java:516)
at com.android.internal.os.ZygoteInit.main(ZygoteInit.java:950)
Caused by: org.xml.sax.SAXException: Can't create default XMLReader; is system property org.xml.sax.driver set?
at org.xml.sax.helpers.XMLReaderFactory.createXMLReader(XMLReaderFactory.java:160)
at opennlp.tools.dictionary.serializer.DictionaryEntryPersistor.create(DictionaryEntryPersistor.java:219)
... 25 more
我也尝试了灵魂液,从 此职位 但我得到了一个例外 java.lang.NullPointerException: Attempt to invoke virtual method 'java.lang.String java.util.Properties.getProperty(java.lang.String)' on a null object reference
我的问题是:我如何用以下方法打开 "en_pos_maxent.bin "文件? FileInputStream
貌似opennlps POSModel
只接受 InputStream
类型 FileInputStream
.对不起,如果我遗漏了任何信息,我不经常发帖,请让我知道,我会把它包括在内.任何帮助将被感激。
System.setProperty("org.xml.sax.driver", "org.xmlpull.v1.sax2.Driver");
try {
AssetFileDescriptor fileDescriptor =
getApplicationContext.getAssets().openFd("en_pos_maxent.bin");
FileInputStream inputStream = fileDescriptor.createInputStream();
POSModel posModel = new POSModel(inputStream);
posTaggerME = new POSTaggerME(posModel);
} catch (Exception e) {
//Handle exception
}