summaryrefslogtreecommitdiffstats
path: root/debian/uncrustify-trinity/uncrustify-trinity-0.78.1/documentation/theory.txt
diff options
context:
space:
mode:
Diffstat (limited to 'debian/uncrustify-trinity/uncrustify-trinity-0.78.1/documentation/theory.txt')
-rw-r--r--debian/uncrustify-trinity/uncrustify-trinity-0.78.1/documentation/theory.txt129
1 files changed, 0 insertions, 129 deletions
diff --git a/debian/uncrustify-trinity/uncrustify-trinity-0.78.1/documentation/theory.txt b/debian/uncrustify-trinity/uncrustify-trinity-0.78.1/documentation/theory.txt
deleted file mode 100644
index 37ba9da1..00000000
--- a/debian/uncrustify-trinity/uncrustify-trinity-0.78.1/documentation/theory.txt
+++ /dev/null
@@ -1,129 +0,0 @@
-
--------------------------------------------------------------------------------
-This document is too incomplete to be of much use.
-Patches are welcome!
-
-
-Theory of operation
--------------------
-
-Uncrustify goes through several steps to reformat code.
-The first step, parsing, is the most complex and important.
-
-
-Step 1 - Tokenize
------------------
-C code must be understood to some degree to be able to be properly indented.
-The parsing step reads in a text buffer and breaks it into chunks and puts
-those chunks in a list.
-
-When a chunk is parsed, the original column and line are recorded.
-
-These are the chunks that are parsed:
- - punctuators
- - numbers
- - words (keywords, variables, etc)
- - comments
- - strings
- - whitespace
- - preprocessors
-
-See token_enum.h for a complete list.
-See punctuators.cpp and keywords.cpp for examples of how they are used.
-
-In the code, chunk types are prefixed with 'CT_'.
-The CT_WORD token is changed into a more specific token using the lookup table
-in keywords.cpp
-
-
-Step 2 - Tokenize Cleanup
--------------------------
-
-The second step is to change the token type for certain constructs that need
-to be adjusted early on.
-For example, the '<' token can be either a CT_COMPARE or CT_ANGLE_OPEN.
-Both are handled very differently.
-If a CT_WORD follows CT_ENUM/CT_STRUCT/CT_UNION, then it is marked as a CT_TYPE.
-Basically, anything that doesn't depend on the nesting level can be done at this
-stage.
-
-
-Step 3 - Brace Cleanup
--------------------------
-
-This is possibly the most difficult step.
-do/if/else/for/switch/while bodies are examined and virtual braces are added.
-Brace parent types are set.
-Statement start and expression starts are labeled.
-And #ifdef constructs are handled.
-
-This step determines the levels (m_braceLevel, level and m_ppLevel).
-
-REVISIT:
- The code in brace_cleanup.cpp needs to be reworked to take advantage of being
- able to scan forward and backward. The original code was going to be merged
- into tokenize.cpp, but that was WAY too complex.
-
-
-Step 4 - Fix Symbols (combine.cpp)
-----------------------------------
-
-This step is no longer properly named.
-In the original design, neighboring chunks were to be combined into longer
-chunks. This proved to be a silly idea. But the name of the file stuck.
-
-This is where most of the interesting identification stuff goes on.
-Colons type are detected, variables are marked, functions are labeled, etc.
-Also, all the punctuators are classified. Ie, CT_MINUS become CT_NEG or CT_ARITH.
-
- - Types are marked.
- - Functions are marked.
- - Parenthesis and braces are marked where appropriate.
- - finds and marks casts
- - finds and marks variable definitions (for aligning)
- - finds and marks assignments that may be aligned
- - changes CT_INCDEC_AFTER to CT_INCDEC_BEFORE
- - changes CT_STAR to either CT_PTR_TYPE, CT_DEREF or CT_ARITH
- - changes CT_MINUS to either CT_NEG or CT_ARITH
- - changes CT_PLUS and CT_ADDR to CT_ARITH, if needed
- - other stuff?
-
-
-Casts
------
-Casts are detected as follows:
- - paren pair not part of if/for/etc nor part of a function
- - contains only CT_QUALIFIER, CT_TYPE, '*', and no more than one CT_WORD
- - is not followed by CT_ARITH
-
-Tough cases:
-(foo) * bar;
-
-If uncertain about a cast like this: (foo_t), some simple rules are applied.
-If the word ends in '_t', it is a cast, unless followed by '+'.
-If the word is all caps (FOO), it is a cast.
-If you use custom types (very likely) that aren't detected properly (unlikely),
-the add them to the config file like so: (example Using C-Sharp types)
-type UInt32 UInt16 UInt8 Byte
-type Int32 Int16 Int8
-
-
-Step 6+ Everything else
--------------------------
-
-From this point on, many filters are run on the chunk list to change the
-token columns.
-
-indent.cpp sets the left-most column.
-align.cpp set the column for individual chunks.
-space.cpp sets the spacing between chunks.
-Others insert newlines, change token position, etc.
-
-
-Last Step - Output
--------------------------
-
-At the final step the list is printed to the output.
-Everything except comments are printed as-is.
-Comments are reformatted in the output stage.
-