关于org.apache.lucene.queryParser.ParseException: Encountered 解决方法
現(xiàn)象:
org.apache.lucene.queryParser.ParseException: Encountered "<EOF>" at line 1, column 0. Was expecting one of:<NOT> ..."+" ..."-" ..."(" ...<QUOTED> ...<TERM> ...<PREFIXTERM> ...<WILDTERM> ..."[" ..."{" ...<NUMBER> ...at org.apache.lucene.queryParser.QueryParser.generateParseException(QueryParser.java:1226) at org.apache.lucene.queryParser.QueryParser.jj_consume_token(QueryParser.java:1109) at org.apache.lucene.queryParser.QueryParser.Clause(QueryParser.java:759) at org.apache.lucene.queryParser.QueryParser.Query(QueryParser.java:684) at ch2.lucenedemo.process.Test.RunVsIndex(Test.java:142) at ch2.lucenedemo.process.Test.main(Test.java:169)方法一:
?如果出現(xiàn)了下列錯誤,那是因?yàn)橛缅e了函數(shù)。把queryParser.Query改稱queryParser.parse就通過了
方法二:
1、提問:
I am working on a classification problem to classify product reviews as positive, negative or neutral as per the training data using Lucene API.
I am using an ArrayList of Review objects - "reviewList" that stores the attributes for each review while crawling the web pages.
The review attributes which include "polarity" & "review content" are then indexed using the indexer. Thereafter, based on the indexes objects, I need to classify the remaining review objects. But while doing so, there is a review object for which the Query parser is encountering an EOF character in the "review content", and hence terminating.
The line causing error has been commented accordingly -
IndexReader reader = IndexReader.open(FSDirectory.open(new File("index")));IndexSearcher searcher = new IndexSearcher(reader);Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_31);QueryParser parser = new QueryParser(Version.LUCENE_31, "Review", analyzer);int length = Crawler.reviewList.size();for (int i = 200; i < length; i++) {String true_class;double r_stars = Crawler.reviewList.get(i).getStars();if (r_stars < 2.0) {true_class = "-1";} else if (r_stars > 3.0) {true_class = "1";} else {true_class = "0";}String[] reviewTokens = Crawler.reviewList.get(i).getReview().split(" ");String parsedReview = "";int j;for (j = 0; j < reviewTokens.length; j++) {if (reviewTokens[j] != null) {if (!((reviewTokens[j].contains("-")) || (reviewTokens[j].contains("!")))) {parsedReview += reviewTokens[j] + " ";}} else {break;}}Query query = parser.parse(parsedReview); // CAUSING ERROR!!TopScoreDocCollector results = TopScoreDocCollector.create(5, true);searcher.search(query, results);ScoreDoc[] hits = results.topDocs().scoreDocs;I've parsed the text manually to remove the characters that are causing the error, apart from checking if the next string is null...but the error persists.
This is the error stack trace -
Exception in thread "main" org.apache.lucene.queryParser.ParseException: Cannot parse 'I made the choice ... be all "thumbs ': Lexical error at line 1, column 938. Encountered: <EOF> after : "\"thumbs " at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:216) at Sentiment_Analysis.Classification.classify(Classification.java:58) at Sentiment_Analysis.Main.main(Main.java:17) Caused by: org.apache.lucene.queryParser.TokenMgrError: Lexical error at line 1, column 938. Encountered: <EOF> after : "\"thumbs " at org.apache.lucene.queryParser.QueryParserTokenManager.getNextToken(QueryParserTokenManager.java:1229) at org.apache.lucene.queryParser.QueryParser.jj_scan_token(QueryParser.java:1709) at org.apache.lucene.queryParser.QueryParser.jj_3R_2(QueryParser.java:1598) at org.apache.lucene.queryParser.QueryParser.jj_3_1(QueryParser.java:1605) at org.apache.lucene.queryParser.QueryParser.jj_2_1(QueryParser.java:1585) at org.apache.lucene.queryParser.QueryParser.Clause(QueryParser.java:1280) at org.apache.lucene.queryParser.QueryParser.Query(QueryParser.java:1266) at org.apache.lucene.queryParser.QueryParser.Clause(QueryParser.java:1313) at org.apache.lucene.queryParser.QueryParser.Query(QueryParser.java:1266) at org.apache.lucene.queryParser.QueryParser.TopLevelQuery(QueryParser.java:1226) at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:206) ... 2 more Java Result: 1Please help me solve this problem...have been banging my head with this for hours now!
2、問答
You should escape the double quote and other special characters via
Query query = parser.parse(QueryParser.escape(parsedReview));As the?QueryParser.escape?Javadoc suggested,
Returns a String where those characters that QueryParser expects to be escaped are escaped by a preceding '\'.小結(jié):使用 QueryParser的靜態(tài)方法QueryParser.escape(string s),進(jìn)行自動轉(zhuǎn)義特殊字符后再進(jìn)行關(guān)鍵字的查詢
?
原文出處:
現(xiàn)象及方法一:
不設(shè)限, org.apache.lucene.queryParser.ParseException: Encountered "<EOF>" at line 1, column 0.?https://blog.csdn.net/tengdazhang770960436/article/details/17881671
方法二:
https://stackoverflow.com/questions/10259907/lucene-exception-query-parser-encountered-eof-after-some-word
轉(zhuǎn)載于:https://www.cnblogs.com/ryelqy/p/10104041.html
總結(jié)
以上是生活随笔為你收集整理的关于org.apache.lucene.queryParser.ParseException: Encountered 解决方法的全部內(nèi)容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: Centos 7 安装 ifconfig
- 下一篇: JAVA设计模式-策略模式