antlr4語法錯誤

我有以下的語法：antlr4語法錯誤

grammar Token; 

prog: (expr NL?)+ EOF; 

expr: '[' type ']'; 

type : typeid ':' value; 

typeid : 'TXT' | 'ENC' | 'USR'; 

value: Text | INT; 

INT : '0' | [1-9] [0-9]*; 

//WS : [ \t]+; 
WS : [ \t\n\r]+ -> skip ; 
NL: '\r'? '\n'; 
Text : ~[\]\[\n\r"]+ ;

，我需要解析的文字是這樣的下面

[TXT:look at me!] 
[USR:19700] 
[TXT:, can I go there?] 
[ENC:124124] 
[TXT:this is needed for you to go...]

我需要拆分這個文本，但我得到了一些當我運行錯誤grun.bat Token prog -gui -trace -diagnostics

enter prog, LT(1)=[ 
enter expr, LT(1)=[ 
consume [@0,0:0='[',<3>,1:0] rule expr 
enter type, LT(1)=TXT:look at me! 
enter typeid, LT(1)=TXT:look at me! 
line 1:1 mismatched input 'TXT:look at me!' expecting {'TXT', 'ENC', 'USR'} 
... much more ...

enter image description here

我的語法有什麼問題？請幫幫我！

來源

2014-10-06 thiagoh

文本相匹配的方式太多了。例如，它匹配「TXT」。嘗試使其更具體。 – 2014-10-06 16:51:40

@TheANTLRGuy，但我需要匹配'TXT'和']'之間的任何文本，我怎樣才能讓'Text'更具體呢？ – thiagoh 2014-10-06 16:54:39

嘗試〜']'也許？ – 2014-10-06 18:02:55

您必須明白，標記不是基於解析器試圖匹配的內容創建的。詞法分析器嘗試匹配儘可能多的字符（獨立於該分析器！）：您的Text標記應該有不同的定義。

你可以讓Text規則成爲一個解析器規則代替，並搭配類似這樣的單個字符標記：

grammar Token; 

prog : expr+ EOF; 
expr : '[' type ']'; 
type : typeid ':' value; 
typeid : 'TXT' | 'ENC' | 'USR'; 
value : text | INT; 
text : CHAR+; 

INT : '0' | [1-9] [0-9]*; 
WS : [ \t\n\r]+ -> skip ; 
CHAR : ~[\[\]\r\n];

來源

2014-10-06 18:41:35

antlr4語法錯誤

回答

相關問題