Difference in grammar parsed by JavaCC and ANTLR - antlr

JavaCC and Antlr wikipedia page says that both the parser generator works for grammar written in EBNF format. Does it mean that any grammar that can be parsed by JavaCC can also be parsed by ANTLR without modifying the grammar structure? If yes, why do we have a grammar repository for ANTLR (https://github.com/antlr/grammars-v4) ? My current understanding is that some differences do exist between the grammars parsed by JavaCC and ANTLR. Can someone please point out the differences.

Related

Standard EBNF grammar of JsonPath

I wan't to implement my own JsonPath library but I can not find standard definition of its grammar. I found https://github.com/kevinbirch/kanabo/blob/master/jsonpath.ebnf but it look like much more complicated. Where can I find the complete version of JsonPath grammar?
The closes thing I can find is the PostgreSQL Bison grammar, and the SQL/JSON proposal. It's not EBNF but it may serve your purpose.
PostgreSQL Bison grammar:
https://github.com/postgres/postgres/blob/0af302af40669d9bc6de7a202e4723fee39d9376/src/backend/utils/adt/jsonpath_gram.y
SQL/JSON proposal (with BNF grammar):
https://www.wiscorp.com/pub/DM32.2-2014-00025r1-sql-json-part-2.pdf

Grammar or specification of Bison/Lemon files

I need to write parser for Bison and Lemon grammar files. I'm looking for specification or grammar of these files format. Any links will be helpful.
It is easy to get a grammar of Bison as it is bootstrapped: just get its parser from the repository.

Generate only a Lexer from Antlr

I am attempting to use Antlr to tokenize and classify the tokens of an input stream. Does anyone know of a way to generate only a Lexer from Antlr using a grammar with only Lexer rules?
You can specify the type of grammar you want using the grammar title line.
grammar MyGrammar;
for combined grammars.
lexer grammar MyLexer;
for a lexer grammar (etc.). Of course in a pure lexer grammar you may only use lexer rules.
You can basically generate a parser and extend the listener class then inside every exitMethod() push the tokens into a stack.
You can't generate only a lexer. If you are not familiar with ANTLR 4 grammar or the steps required to generate a parser, I advise you to spend 10 mins in reading this book "The definitive of ANTLR 4".

Generate EBNF from ANTLR

Anybody know of a tool, that generates EBNF from ANTLR?
ANTLR is already close to EBNF, but for documentation purpose I would like to have a clean EBNF description (without the Code in between).
With antlrworks and this its already nice to get the syntax diagrams:
java -cp antlrworks-1.1.4.jar org.antlr.works.Console -f yql.g -o output/ -sd eps
but it would like to have a bare textual description, preferable text, tex, html, xml, or similar.
I have an online tool that converts foreign grammars to W3C grammar notation. It has an ANTLR3 grammar parser, so maybe this gets close to what you were looking for. W3C notation is also useful for generating syntax diagrams.

Pretty print ANTLR grammar

Some tools output an Antlr grammar in a human-unreadable form, at least with ugly placing of parens and indentation. I'd like to transform the grammar into a more readable (standard?) form. The only reference I found is ANTLR pretty printer which is quite old, and looking at its source, it seems to be removing parts of a grammar rather than pretty print it.
How can I format/pretty print a grammar file?
I know of no tool that does this. The one you mentioned, prettyPrinter, is written in - and seems to handle only - ANTLR v2.x grammars, making it unsuitable for v3 grammars.
If you're going to write your own, I'd recommend using the grammar of ANTLR v3 itself to parse a .g grammar file and emit it in a readable form. Terence Parr has posted the grammar here: http://www.antlr.org/grammar/ANTLR
I just installed an Antlr plugin for Eclipse. It can do a lot more than syntax highlight and code formatting...