Files
llgo/chore/llcppg/design.md

320 lines
7.2 KiB
Markdown
Raw Normal View History

2024-07-25 17:23:34 +08:00
llcppg Design
2024-07-23 09:41:03 +08:00
=====
2024-07-23 09:41:03 +08:00
## Usage
2024-07-23 09:41:03 +08:00
```sh
llcppg [config-file]
```
2024-07-23 09:41:03 +08:00
If `config-file` is not specified, a `llcppg.cfg` file is used in current directory. The configuration file format is as follows:
```json
{
2024-07-25 17:23:34 +08:00
"name": "inih",
2024-07-23 09:41:03 +08:00
"cflags": "$(pkg-config --cflags inireader)",
"include": [
"INIReader.h",
"AnotherHeaderFile.h"
],
2024-07-25 17:23:34 +08:00
"libs": "$(pkg-config --libs inireader)",
"trimPrefixes": ["Ini", "INI"]
2024-07-23 09:41:03 +08:00
}
```
## Steps
1. llcppsymg: Generate symbol table for a C/C++ library
2. Manually modify the desired Go symbols in symbol table
3. llcppsigfetch: Fetch information of C/C++ symbols
4. gogensig: Generate a go package by information of symbols
### llcppsymg
```sh
llcppsymg config-file
2024-07-25 17:23:34 +08:00
llcppsymg - # read config from stdin
2024-07-23 09:41:03 +08:00
```
2024-07-25 17:23:34 +08:00
It generates a symbol table file named `llcppg.symb.json`. Its file format is as follows:
2024-07-23 09:41:03 +08:00
```json
[
{
"mangle": "_ZN9INIReaderC1EPKcm",
"c++": "INIReader::INIReader(char const*, unsigned long)",
"go": "(*Reader).Init__0"
2024-07-25 18:46:40 +08:00
}
2024-07-23 09:41:03 +08:00
]
```
2024-07-25 17:23:34 +08:00
### llcppsigfetch
```sh
llcppsigfetch config-file
llcppsigfetch - # read config from stdin
```
It fetches information of C/C++ symbols and print to stdout. Its format is as follows:
2024-07-25 18:46:40 +08:00
```
2024-07-25 17:23:34 +08:00
TODO: see llgo/xtool/clang/ast
```
### gogensig
```sh
gogensig ast-file
gogensig - # read AST from stdin
```
2024-07-23 09:41:03 +08:00
## Overall
### Process
2024-07-26 16:14:41 +08:00
1. The Parsing Module reads `llcppg.cfg` to obtain dynamic libraries, header files, and the package name. After parsing, it writes the generated `llcppg.symb.json` path into `llcppg.cfg`.
2. The Function Declaration Generation Module reads `llcppg.cfg` to get the package name, header files, and the previously generated `llcppg.symb.json`. After parsing, it generates the function prototype `llcppg.function.json`.
3. Reads the previously generated `llcppg.information.json`, stores it as a structure, and uses gogen to generate code based on the structure.
## Parsing Module
### Input
2024-07-26 16:14:41 +08:00
Obtains the paths to header files and dynamic library files by reading the JSON file `llcppg.cfg`.
```json
{
2024-07-26 16:14:41 +08:00
"name": "inih",
"cflags": "$(pkg-config --cflags INIReader)",
"include": [
"INIReader.h",
"AnotherHeaderFile.h"
],
2024-07-26 16:14:41 +08:00
"libs": "$(pkg-config --libs INIReader)",
"trimPrefixes": ["Ini", "INI"]
}
```
```bash
2024-07-26 16:14:41 +08:00
llcppsymg config-file
```
### Implementation Steps
1. Parse dylib and store:
```go
2024-07-26 16:14:41 +08:00
// types.go
type CPPSymbol struct {
2024-07-26 16:14:41 +08:00
Symbol string `json:"symbol"`
Type string `json:"type"`
Name string `json:"name"`
}
// parser_dylib.go
2024-07-26 16:14:41 +08:00
func parseDylibSymbols(lib string) ([]common.CPPSymbol, error)
```
2. Parse header files and store:
```go
// common.go
type ASTInformation struct {
2024-07-26 16:14:41 +08:00
Namespace string `json:"namespace"`
Class string `json:"class"`
Name string `json:"name"`
BaseClasses []string `json:"baseClasses"`
ReturnType string `json:"returnType"`
Location string `json:"location"`
Parameters []Parameter `json:"parameters"`
Symbol string `json:"symbol"`
}
type Parameter struct {
2024-07-26 16:14:41 +08:00
Name string `json:"name"`
Type string `json:"type"`
}
// parser_ast.go
2024-07-26 16:14:41 +08:00
func parseHeaderFile(config types.Config) ([]common.ASTInformation, error)
```
3. Cross-reference data from the first two steps to get the final output
```go
// common.go
2024-07-26 16:14:41 +08:00
type SymbolInfo struct {
Mangle string `json:"mangle"` // C++ Symbol
CPP string `json:"c++"` // C++ function name
Go string `json:"go"` // Go function name
}
// common_symbols.go
2024-07-26 16:14:41 +08:00
func getCommonSymbols(dylibSymbols []common.CPPSymbol, astInfoList []common.ASTInformation) []common.SymbolInfo {
```
2024-07-26 16:14:41 +08:00
4. Generate `llcppg.symb.json` file and store the JSON file path into `llcppg.cfg`
```go
2024-07-26 16:14:41 +08:00
func generateJSON([]CommonSymbolInfo)
```
2024-07-26 16:14:41 +08:00
5. Example `llcppg.symb.json` file
```json
{
"FunctionName": "A::B::C",
"Symbol": "_ZN9INIReaderC1ERKNSt3__112basic_stringIcNS0_11char_traitsIcEENS0_9allocatorIcEEEE",
"Location": "a.h",
"UserFunctionName": "CFromA"
}
```
## Function Declaration Generation Module
### Input
2024-07-26 16:14:41 +08:00
No input required, directly reads the `llcppg.cfg` file
### Implementation Steps
1. Execute the executable
```bash
2024-07-26 16:14:41 +08:00
llcppsigfetch config-file
```
2. Parse header files
```go
// common.go
type ASTInformation struct {
2024-07-26 16:14:41 +08:00
Namespace string `json:"namespace"`
Class string `json:"class"`
Name string `json:"name"`
BaseClasses []string `json:"baseClasses"`
ReturnType string `json:"returnType"`
Location string `json:"location"`
Parameters []Parameter `json:"parameters"`
Symbol string `json:"symbol"`
}
type Parameter struct {
2024-07-26 16:14:41 +08:00
Name string `json:"name"`
Type string `json:"type"`
}
// parser_ast.go
func ParseHeaderFile(filePath string) ([]common.ASTInformation, error)
```
3. Generate the final JSON mapping file `llcppg.information.json`
```go
func GenerateJSONFile(info []common.ASTInformation)
```
```json
{
2024-07-26 16:14:41 +08:00
"functionName": "A::B::C",
"symbol": "_ZN9INIReaderC1ERKNSt3__112basic_stringIcNS0_11char_traitsIcEENS0_9allocatorIcEEEE",
"location": "a.h",
"returnType": "int",
"userFunctionName": "CFromA",
"parameters": [
{
2024-07-26 16:14:41 +08:00
"arg1": "int"
},
{
2024-07-26 16:14:41 +08:00
"arg2": "*char"
}
2024-07-26 16:14:41 +08:00
]
}
```
## Code Generation Module
### Input
No input required, directly reads `llcppg.information.json` file
### Implementation Steps
1. Execute the executable
```bash
2024-07-26 16:14:41 +08:00
gogensig ast-file
```
2. Parse JSON file
```go
// common.go
type HeaderFileInfo struct {
2024-07-26 16:14:41 +08:00
FunctionName string `json:"functionName"`
Symbol string `json:"symbol"`
Location string `json:"location"`
UserFunctionName string `json:"userFunctionName"`
Parameters map[string]string `json:"parameters"`
}
// parse_json.go
func ParseJSON(jsonFilePath string) ([]common.HeaderFileInfo, error)
```
3. Generate code using the parsed structure with gogen
```go
// generator.go
func GenerateCode(info []common.HeaderFileInfo) {
pkg := gogen.NewPackage("", PackageName, nil)
cm := comment(fmt.Sprintf("llgo:link %s %s", funcName1, symbol1))
pkg.NewFunc(recv, funcName, params, results, variadic).SetComments(pkg, cm).BodyStart(pkg).End()
}
```
### Output
1. Directory structure
```bash
package_name/
├── _demo
├── demo1.go
└── llgo_link.go
└── a.go
└── b.go
└── c.go
```
Note that `demo1.go` file needs to be written by the user
2. `llgo_link.go` is responsible for linking configuration
```go
package inih
const (
LLGoFiles = "$(pkg-config --cflags INIReader): _wrap/reader.cpp"
LLGoPackage = "link: $(pkg-config --libs inih INIReader); -linih -lINIReader"
)
```
3. Example content for `a.go`
```go
package inih
import (
_ "unsafe"
"github.com/goplus/llgo/c"
)
//go:linkname Parse C.ini_parse
func Parse(filename *c.Char, handler func(user c.Pointer, section *c.Char, name *c.Char, value *c.Char) c.Int, user c.Pointer) c.Int
//go:linkname ParseFile C.ini_parse_file
func ParseFile(file c.FilePtr, handler func(user c.Pointer, section *c.Char, name *c.Char, value *c.Char) c.Int, user c.Pointer) c.Int
//go:linkname ParseString C.ini_parse_string
func ParseString(str *c.Char, handler func(user c.Pointer, section *c.Char, name *c.Char, value *c.Char) c.Int, user c.Pointer) c.Int
```