Developer's Manual

Key Components

Este has 3 main components:

Este Pintool
- Analyses the target binary and outputs log files into directory specified by este-config.toml
- Source files for this component is located in directory ./Este.
Post Processing
- Performs processing of log files and outputs into ./web/gen.
- Source files for this component is located in directory ./postprocessing
Visualization
- Starts a python server and opens a browser tab to display the visualization.
- Source files for this component is located in directory ./web

Component 1: Este Pintool

Commandline Usage

Este Pintool is of the form of two binaries Este-32.dll and Este-64.dll (32 bit and 64 bit), located in .\Este\build folders. Both binaries are required for Este to analyse 32 bit and 64 bit binaries.

Este Pintool requires PINTOOL's launcher binary pin.exe. The command to launch Este Pintool takes the following semantics:

pin.exe <PIN commandline switches> <este commandline switches> -- <target command>

PIN's commandline switches are documented here.

Este commandline switches are the following:

-config-file <config file name> 
    Config file for Este. Refer to este-config.toml for more details.

Example Este Pintool usage:

pin.exe -follow_execv -config-file "este-config.toml" -- notepad.exe text.txt

Output Format

Este Pintool outputs into the directory specified within the config file specified by the -config-file commandline switch.

Este Pintool outputs 5 files.

pid<pid>.log
pid<pid>.este.json
pid<pid>.rtn.csv
pid<pid>.bb.csv
pid<pid>.trace.csv

pid<pid>.log contains generic information about Este Pintool's and the target process's runtime, including any Este warnings or errors. In the event of a crash, if the exception was handled by Este Pintool, the error message will be present here.

pid<pid>.este.json contains generic information about the target process:

PID
Architecture
Images loaded

pid<pid>.rtn.csv contains all routines encountered (including those not executed) that are outside of the whitelisted binaries specified in -config-file. The fields are:

Name	Description
`idx`	- Index of routine
`addr`	- Address that marks the start of the routine
`rnt_name`	- Name of routine - Name of routine relies on symbols available. Otherwise it will be named `sub_<addr>`
`image_idx`	- Index of image (as specified in `pid<pid>.este.json`) at which the routine belongs in. - `image_idx = -1` if it doesn't belong to an image.
`section_idx`	- Index if executable section within image (as specified in `pid<pid>.este.json`) at which the routine belongs in - `section_idx = -1` if it doesn't belong to a section

pid<pid>.bb.csv contains all Basic Blocks encountered within the whitelisted binaries specified in -config-file. Basic blocks are defined as a number of contiguous instructions that drop through. Jumps, conditional jumps, calls, returns, etc are at the end of every basic block. The fields are:

Name	Description
`idx`	- Index of basic block (in order of first execution)
`addr`	- Address of start of basic block
`size`	- Size in bytes of basic block
`bytes`	- Hex encoded bytes of basic block
`image_idx`	- Index of image (as specified in `pid<pid>.este.json`) at which the block is in - `image_idx = -1` if it doesn't belong to an image
`section_idx`	- Index of executable section within image (as specified in `pid<pid>.este.json`) at which the block belongs in - `section_idx = -1` if it doesn't belong to a section

pid<pid>.trace.csv contains all basic blocks executed that are within the whitelisted binaries in -config-file. E.g. in a loop, the blocks within the loop are logged multiple times within the trace file. The fields are:

Name	Description
`bb_idx`	- Index of basic block executed (as specified in `pid<pid>.bb.csv`) - `bb_idx = -1` if the basic block is outside of whitelisted binaries. This entry indicates a jump outside of whitelisted binaries.
`os_tid`	- OS Thread ID that executed said block
`pin_tid`	- PIN Thread ID that executed said block (order of creation)
`rtn_called_idx`	- Index of routine called (as specified in `pid<pid>.rtn.csv`) - `rtn_called_idx = -1` if no routine was called in basic block.

Debugging Este Pintool

Add the PIN commandline switch -pause_tool <seconds> to pause the tool upon PIN initialization, and attach a debugger (e.g. WinDbg), and resume execution.

Component 2: Post Processing

Description

The Post Processing component involves the use of the files in the directory ./postprocessing, format_helper.py & main.py

Output format

Post Processing component outputs files into the ./web/gen directory

It outputs 1 + 'number of threads spawned' files

pid_tid_map.json
pid<pid>_tid<otid1>_ptid<ptid1>.json
pid<pid>_tid<otid2>_ptid<ptid2>.json
...

pid_tid_map.json contains the mapping of pid to the otids and ptids that are spawned, in this format:

<pid>: [
    {
        "pin_tid": <ptid>,
        "os_tid": <otid>
    },
    ...
]

pid<pid>_tid<otid>_ptid<ptid>.json contains the node and link object in a list, in this format:

{
    nodes: [
        <node object 1>,
        <node object 2>,
        ...
    ],
    links: [
        <link object 1>,
        <link object 2>,
        ...
    ]
}

A node object contains the details of the node that will be displayed using the d3js visualisation framework. It has the same fields as the Basic Blocks in pid<pid>.bb.csv with some additional ones:

Name	Description
`id`	- id of the node, will be refered to in the `source` and `target` fields of the link object
`count`	- number of times the Basic Block has been executed

A link object contains the details of the links that will join the nodes in the d3js framework. The fields are:

Name	Description
`source`	- id of the node, that the link will stem from
`target`	- id of the node, that the link will end at
`count`	- number of times the Basic Block has been executed

Component 3: Visualisation

Description

The Visualisation component consists of server.py, that starts up the web server, , default port 7777, serving the index.html file.

It uses the d3js visualisation framework and the 3d-force-graph

Usage of the framework is documented here: https://github.com/vasturiano/3d-force-graph

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Developers-Manual.md

Developers-Manual.md

Developer's Manual

Key Components

Component 1: Este Pintool

Commandline Usage

Output Format

Debugging Este Pintool

Component 2: Post Processing

Description

Output format

Component 3: Visualisation

Description

Files

Developers-Manual.md

Latest commit

History

Developers-Manual.md

File metadata and controls

Developer's Manual

Key Components

Component 1: Este Pintool

Commandline Usage

Output Format

Debugging Este Pintool

Component 2: Post Processing

Description

Output format

Component 3: Visualisation

Description