We use ContextManager (“with … as” statement) in Python because Python’s fundamental language design (garbage collecting objects) broke RAII

Posted on February 26, 2025 by admin

[TLDR] Python doesn’t have RAII. C++ and MATLAB allows RAII. You can have a proper RAII only if destructor timing is 100% controllable by the programmer.

Python uses Context Manager (with ... as idiom) to address the old issue of opening up a resource handler (say a file or network socket) and automatically close (free) it regardless of whether the program quit abruptly or it gracefully terminates after it’s done with the resource.

Unlike destructors in C++ and MATLAB, which registers what to do (such as closing the resource) when the program quits or right before the resource (object) is gone, Python’s Context Manager is basically rehasing the old try-block idea by creating a rigid framework around it.

It’s not that Python doesn’t know the RAII mechanism (which is much cleaner), but Python’s fundamental language design choices drove itself to a corner so it’s stuck micro-optimizing the try-except/catch-finally approach of managing opened resourecs:

Everything is seen as object in Python. Even integers have a ton of methods.
MATLAB and C++ treats POD, Plain Old Data, such as integers separately from classes
Python’s garbage collector controls the timing of when the destructor of any object is called (del merely decrement the reference count).
MATLAB’s do not garbage-collect objects so the destructor timing is guaranteed.
C++ has no garbage collection so the destructor timing is guaranteed and managed by the programmer.

Python cannot easily exclude garbage collecting classes (which breaks RAII) because fundamentally everything are classes (dictionaries potentially with callables) in Python.

This is one of the reasons why I have a lot of respects for MATLAB for giving a lot of consideration for corner cases (like what ’empty’ means) in their language design decisions. Python has many excellent ideas but not enough thoughts was given to how these ideas interact to produce unwanted/surprising side effects.

Pythons documentation says out loud right what it does: with ... as ... is effectively a rigidly defined try-except-finally block:

Context Manager heavily depends on resource opener function (EXPR) to return a constructed class instance that implements __exit__ and __enter__, so if you have a C external library imported to Python, like python-ft4222, likely you have to write in your context manager in full when you write your wrapper.

Typically the destructor should check if the resource is already closed first, then close it if it wasn’t already closed. Take io.IOBase as an example:

However, this is only a convenience when you are at the interpreter and can live with the destructor called with a slight delay.

To make sure your code work reliably without timing bugs, you’ll need to explicitly close it somewhere other than at a destructor or rely on object lifecycle timing. The destructor can acts as a double guard to close it again if it hasn’t, but it should not be relied on.

The with ... as construct is extremely ugly, but it’s one of the downsides of Python that cannot be worked around easily. It also makes it difficult for users to retry acquiring a resource because one way or another retrying involves injecting the retry logic in __enter__. It’s not that much typographic savings using with ... as over try-except-finally block if you don’t plan to recycle th contextmanager and the cleanup code is a one-liner.

Pandas DataFrame in Python (1): Disadvantage of using attributes (dot notation) to access columns. Use `[]` (getitem) operator instead

Posted on February 19, 2025 by admin

There are two ways to access columns in DataFrame. The preferred way is by square brackets (indexing into it like a dictionary), while it’s tempting to use the neater dot notation (treating columns like an attribute), my recommendation is don’t!

Python has dictionaries that handles arbitary labels well while it doesn’t have dynamic field names like MATLAB do. This puts DataFrame at a disadvantage developing dot notation syntax while the dictionary syntax opens up a lot of possibilities that are worth giving up dot notation for. The nature of the language design makes the dot notation very half-baked in Python and it’s better to avoid it altogether

Reason 1: Cannot create new columns with dot notation

UserWarning: Pandas doesn't allow columns to be created via a new attribute name - see https://pandas.pydata.org/pandas-docs/stable/indexing.html#attribute-access

Reason 2: Only column names that doesn’t happen to be valid Python attribute names (say, no spaces) AND a DataFrame that does not have any method with the same name as the column can be accessed through dot notation.

Take an example of dataframe constructed from device info dictionaries created by the package pyft4222. I added a column called 'test me' to a table converted from the dictionary of device info. The tabe T looks like this:

I tried dir() on the table and noticed:

The column name "test me" did not appear anywhere, not even mangled. It has a space in between so it’s not a valid attribute or variable name, so this column is effectively hidden from the dot notation
flags is an internal attribute of DataFrame and it was not overriden by the data column flags when called by the dot notation. This means the flags column was also shadowed in (aka hidden to) the dot notation as there were no mangled name for it either

Even more weird is that getattr() works for columns with non-qualified attribute name like test me (despite the dot notation cannot access it because of the lack of dynamic field names syntax yet test me doesn’t show up in dir()). getattr(T, 'flags') still gets the DataFrame’s internal attribute flags instead of the column called flags as expected.

Dictionary of equivalent/analogous concepts in programming languages

Posted on February 19, 2025 by admin

Common	C	C++	MATLAB	Python
Variable arguments	`<stdarg.h>` `T f(...)` Packed in `va_arg`	Very BAD! Cannot overload when signatures are uncertain.	`varargin` `varargout` Both packed as cells. MATLAB does not have named arguments	`args` (simple, stored as tuples) `*kwargs` (specify input by keyword, stored as a dictionary)
Referencing	N/A	`operator[]`	`(_)` is for references `subsindex subsassgn` `[_]` is for concat `{_}` is for (un)pack	`__getitem__() __setitem__()`
Default values	N/A	Supported	Not supported. Manage with `inputParser()` or newer `arguments`	Non-intuitive static data behavior. Stick to `None` or immutables.
Name-Value Argument Matching			Old way: `.., 'PropName', Value` and parse `varargin` Since R2021a: `Name=Value` `options` in `arguments`	`Name=Value` `**kwargs`
Major Dimension	Row	Row	Column	Row (Native/Numpy) Column for Pandas
Constness	`const`	`const`	Only in classes	N/A (Consenting adults)
Variable Aliasing	Pointers	References	NO! Rely on Copy-on-write (No in-place functions*) Handle classes under limited circumstances	References
`=` assignment	Copy one element	Values: Copy References: Bind	New Copy Copy-on-write	NO VALUES Bind references only (could be to unnamed objects)
Chained access operators	N/A	Difficult to operator overload it right	Difficult to get it right. MATLAB had some chaining bugs with `dataset()` as well.	Chains correctly natively
Assignment expressions (assignment evaluates to assigned lvalue)	`=`	`=`	N/A	Named Expression `:=`
Version Management			`verLessThan()` `isMATLABReleaseOlderThan`	`virtenv` (Virtual Environment)
Exponentiation	`<math.h>` `pow()`	`<cmath>` `pow()`	`^`	`**`
Stream (Conveyor belt mechanism. Saves memory)	I/O (std, file, sockets)	`iterator` in STL containers	MATLAB doesn’t do references. Just increment indices.	iterators (uni-directional only) `iter(): __iter__()` `next(): __next__()`
Looping	for(init, cont_cond, next)	C-style for(auto running: iterable)	for k = array to iterate	list-comp for (index, thing) in enumerate(lists)

Since MATLAB doesn’t do references, iterators (by extension generators) and functions that do in-place operations do not make sense (unless you bend it very hard with anti-patterns such as handles and dbstack).

Data Types

Common	C	C++	MATLAB	Python
Sets	N/A	`std::set`	Only set operations, not set data type	`{ , , ...}`
Dictionaries		`std::unordered_map`	– Dynamic fieldnames (qualified varnames as keys) – `containers.Map()` or `dictionary()` since R2022b	Dictionaries `{key:value}` (Native)
Heterogeneous containers			cells `{}`	lists (mutable) tuples (immutable)
Structured Heterogeneous containers			`table()` `dataset()` [Old] Mix in classes	Pandas Dataframe
Array, Matrices & Tensors			Native `[ , ; , ]`	Numpy/PyTorch
Records	struct	class (members)	dynamic field (structs) properties (class) `getfield()/setfield()`	No structs (use dicts) attribute (class) `getattr()/setattr()`
Type deduction	N/A	`auto`	Native	Native
Type extraction	N/A	`decltype()` for compile time (static) `typeid()` for RTTI (runtime)	`class()`	`type()`
Categorical Arrays		`categorical()` Previously `ordinal()/nominal()`	`pd.cut(x, bins, labels)`

Native sets operations in Python are not stable and there’s no option to use stable algorithm like MATLAB does. Consider installing orderly-set package.

Array Operations

Common	MATLAB	Python
Repeat	`repmat()`	`[] * N` `np.repeat()`
Logical Indexing	Native	List comprehension Boolean Indexing (Numpy)
Equally spaced numbers	Internally `colon()`: `start:step:end` `linspace`/`logspace`	`range(begin, past_end, step)` produces an iterator `list(range())` or `tuple(range())` iterates to realize the vector
Equally spaced indexing	MATLAB has no generators, so produced vector only	`[start:past_end:step]` is internally `slice()` which produces a slice object, not range/lists/tuple. Faster but not iterable
Shallow copy	Deep copy-on-write	Slice: `x = y[:]` `copy.copy()`
Deep copy	Deep copy-on-write	`copy.deepcopy()`

Editor Syntax

Common	C	C++	MATLAB	Python
Commenting	`/* ... */` `//` (only for newer C)	`//` (single line) `/* ... */` (block)	`%` (single line) (Block): `%{ ... %}`	`#` (single line) `"""` or `'''` is docstring which might be undersirably picked up
Reliable multi-line commenting (IDE)			Ctrl+(Shift)+`R`(Windows), `/` (Mac or Linux)	[Spyder]: Ctrl+`1`(toggle), `4`(comment), `5`(uncomment)
Code cell (IDE)			`%%`	[Spyder]: `# %%`
Line Continuation	`\`	`\`	`...`	`\`
Console Precision			`format`	`%precision` (IPython)
Clear variables			`clear` / `clearvars`	`%reset -sf` (IPython)

Macros only make sense in C/C++. This makes code less transparent and is frowned upon in higher level programming languages. Even its use in C++ should be limited. Use inline functions whenever possible.

Python is messy about the workspace, so if you just delete

Object Oriented Programming Constructs

Common	C++	MATLAB	Python
Getters Setters	No native syntax. Name mangle (prefix or suffix) yourself to manage	Define methods: `get.x` `set.x`	Getter: `@property def x(self): ...` Setter: `@x.setter def x(self, value): ...`
Deleters	Members can’t be changed on the fly	Members can’t be changed on the fly	Deleter (removing attributes dynamically by `del`)
Overloading (Dispatch function by signature)	Overloading	Overload only by first argument	`@overload` (Static type) `@singledispath @multipledispatch`
Initializing class variables	Initializer Lists Constructor	Constructor	Constructor
Constructor	`ClassName()` Does not return (`*this` is implicit)	`obj=ClassName(...)` MUST output the constructed object	`__init__(self, ...)` Object to be constructed is 1st argument
Destructor	`~ClassName()`	`delete()`	`__del__()`
Special methods	Special member functions	(no name) method that control specific behaviors	Magic/Dunder methods
Operator overloading	`operator`	operator methods to define	Dunder methods
Resource Self-cleanup	RIAA	`onCleanup()`: make a dummy object with cleanup operation as destructor to be removed when it goes out of scope	`with` Context Managers
Naming for the object itself	Class: (class’s own name by SRO `::`) Instance: `*this`	Class: (class’s own name) Instance: `obj` (or any output name defined in constructor)	Class: `cls` Instance: `self` (Recommended PEP8 names)

Python allows adding members (attributes) on the fly with setattr(), which includes methods. MATLAB’s dynamicprops allows adding properties (data members) on the fly with addprop

onCleanup() does not work reliably on Python because MATLAB’s object destructor time is deterministic (MATLAB specifically do not garbage collect user objects to avoid this mess. It only garbage collects PODs) while Python leaves it up to garbage collector.

*this is implicitly passed in C++ and not spelled out in the method declaration. The self object must be the first argument in the instance method’s signature/prototype for both MATLAB and Python.

Functional Programming Constructs

Common	C++	MATLAB	Python


Function as variable	Functors (Function Objects) `operator()`	Function Handle	Callables (Function Objects) `__call__()`
Lambda Syntax	Lambda `[capture](inputs) {expr} -> optional trailing return type`	Anonymous Function `@(inputs) expr`	Lambda `lambda inputs: expr`
Closure (Early binding): an instance of function objects	Capture `[]` only as necessary. Early binding `[=]` is capture all.	Early binding ONLY for anonymous functions (lambda). Late binding for function handles to loose or nested functions.	Late binding* by default, even for Lambdas. Can capture `Po` through default values `lambda x,P=Po: x+P` (We’re relying users to not enter the captured/optional input argument)

Concepts of Early/Late Binding also apply to non-lambda functions. It’s about when to access (usually read) the ‘global’ or broader scope (such as during nested functions) variables that gets recruited as a non-input variable that’s local to the function itself.

An instance of a function object is not a closure if there’s any parameter that’s late bound. All lambdas (anonymous functions) in MATLAB are early bound (at creation).

The more proper way (without creating an extra optional argument that’s not supposed to be used, aka defaults overridden) to convert late binding to early binding (by capturing variables) is called partial application, where you freeze the parameters (to be captured) by making them inputs to an outer layer function and return a function object (could be lambda) that uses these parameters.

The same trick (partial application) applies to bind (capture) variables in simple/nested function handles in MATLAB which do behave the same way (early binding) like anonymous functions (lambda).

Currying is partial application one parameter at a time, which is tedious way to stay faithful to pure functional programming.

List comprehension is a shorthand syntax for transform/map() and copy_if/remove_if/filter() in one shot, but not accumulate/reduce(). MATLAB and C/C++ does not have listcomp, but listcomp is not specific to Python. Even Powershell has it.

Listcomp syntax, if wrapped in round brackets like (x**x for x in range(5)), gives a generator. Wrapping in square bracket is the shortcut of casting the generator into a list, so [x**x for x in range(5)] is the same as list(x**x for x in range(5)).

Coroutines / Asynchronous Programming

MATLAB natively does not support coroutines.

Common	C++20	Python
Generators	Input Iterators	Functions that `yield value_to_spit_out_on_next` (Implicitly return a generator/functor with `iter` and `next`)
Coroutines		Functions that `value_accepted_from_outside = yield` Send value to the continuation by `g.send(user_input)` `async`/`await` (native coroutines)

Matrix Arrays

The way Numpy requires users to specify matrices with a bracket for every row drives me nuts. Not only there’s a lot of typing, the superfulous brackets reinforce C’s idea of row-major which is horrendous to people with a proper math background who see matrices as column-major $\mathbf{A}_{r,c}$ . Pytorch is the same.

Once you are trained in APL/MATLAB’s matrix world-view, you’ll discover going back to the world where matrices aren’t first class citizens is clumsy AF.

With Python, you lose the clutter free readability where your MATLAB code is one step away from the matrix equations in your scientific computing work, despite a lot of the features that addresses frequent use patterns are implemented earlier in Python than MATLAB.

Don’t believe those who haven’t lived and breathed MATLAB tell you Python is strictly superior. No it isn’t. They just didn’t know what they were missing as they haven’t made the intellectual leap in MATLAB yet. Python is very convenient as a swiss-army knife but scientific computing is an afterthought in Python’s language design.

The only way to use MATLAB-like semi-colon to change rows only works for np.matrix() type, which they plan to deprecate. For now one can cast matrix into array like np.array(np.matrix(matrix_string)).

Even numpy’s ndarray (or matrix to be deprecated) are CONCEPTUALLY equivalent to a matrix of cells in MATLAB. There isn’t native numerical matrices like in MATLAB that doesn’t have the overhead of unpacking arbitrary data types. You don’t want to do numerical matrices in MATLAB with cell matrices as it’s insanely slow.

You get away without the unpacking penalty in Numpy if all the contents of the ndarray happens to have the same dtype (such as numerical), aka known to be uniform. In other words, MATLAB’s matrices are uniform if it’s formed by [] and heterogeneous if formed by {}, while for Python [] is context-dependent, kept track of by dtype.

Concept	MATLAB	Numpy
Construction	`[8,9;6,4]`	`np.array([[8,9],[6,4]])`
Size by dimension	`size()`	`A.shape`
Concatenate within existing dimensions	`[A;B]` or `vertcat()` `[A,B]` or `horzcat()` `cat(dim, A, B, ...)`	`np.vstack()` `np.hstack()` `np.concatenate(list, dim)`
Concatenate expanding to 3D (expand in last dimension)	`cat(3, A, B, ...)`	`np.dstack()` ‘d’ for depth (3rd dimension)
Concatenate expanding dimensions	`cat(newdim, A, B, ...)` then `permute()`	`np.stack([A, ..], expand_at_axis)` `np.array([A, ..])` expands at first dimension as outermost bracket refers to first dimension
Tiling	`repmat()`	`np.tile()`
Fill with same value	`repmat()`	`np.full()`
Fill with ones/zeros	`ones(), zeros()`	`np.ones(), np.zeros()`
Fill minicking another array’s size	`repmat(x, size(B)) ones(x, size(B))` `zeros(x, size(B))`	`np.full_like(B, x)` `np.ones_like(B)` `np.zeros_like(B)`
Preallocate	Any of the above (Must be initialized)	`np.empty()` `np.empty_like()` UNINITIALIZED

repelem() is just repmat() with the repetition by axes vector expanded out as variable input arguments one per dimension. Using ones vector to broadcast a singleton instead of repmat() is horrendously inefficient and non-intuitive.

Heterogeneous Data Structures

Heterogeneous Data Structures are typically column major as it is a concept that derives from Structs of Arrays (SoA) and people typically expect columns to have the same data type from spreadsheets.

While Pandas offers a lot of useful features that I’ve easily implemented with wrappers in MATLAB, the indexing syntax of Pandas/Python is awkward and confusing. It’s due to the nature that matrix is a first-class citizen in MATLAB while it’s an afterthought in Python.

Python does not have the { } cell pack/unpack operator in MATLAB, so in Pandas, you select the Series object (think of it as a supercharged list with conveniences such as handling missing values and keeping track of row/column labels) then call its .values attribute.

However, Pandas is a lot more advanced than MATLAB in terms of using multiple columns as keys and have more tools to exploit multi-key row names (row names not mandatory in MATLAB but mandatory in Pandas). In the old days I had to write my own MATLAB function with unique(.., 'rows') exploit its index output to build unique keys under the hood.

Concept	MATLAB	Python (Pandas Dataframe)

Rows	Observations (`dataset()`) Row (`table()`)	Rows index
Columns	Variables	Columns
Select rows/columns	`T(rows, cols)`	`T.loc[r, col_name]` `T.iloc[r,c]` Caveats: – single index (not wrapped in list) have content extracted – `iloc` on LHS cannot expand table but `loc` can, but it can only inject 1 row – can get index number of names by `T.get_loc()` to use with `T.iloc[]`
Remove rows/columns	`T(rows, cols) = []`	`T.drop(index=rows, columns=cols)` Optionally: `inplace=True` `del T[rows, cols]` does NOT work
Extract one column	`T{:, c}`	`T[c].values`
Extract one entry	`T{r, c}`	`T.at[r,col_name]` `T.iat[r,c]` Faster than `loc/iloc`
Show first few rows	`T(1:5, :)`	`T.head()`
Drop duplicate rows	`unique(T, 'stable')`	`T.drop_duplicates()`


Ordinal	`categorical()` `ordinal()`	`Categorical()` `Index()`
Getting column names/labels	`T.Properties.VariableNames` (returns `cellstr()` only)	`T.columns` (returns `Index()` or `RangeIndex()`)
Getting row names/labels	`T.Properties.RowNames`	`T.index`
Transpose table	`rows2vars()`	`T.transpose()`

Move columns by name	`movevars()` since R2023a
Rename columns	`renamevars()` since R2020a	`T.rename(columns={source:target})`
Rename rows	Modify `T.Properties.RowNames`	`T.rename(index={source:target})`
Use column as row indices	`T.Properties.RowNames` = `T.cellstr_variablename` If multiple columns are needed, need to combine them into one column using some user rules	`T.set_index(column_to_use)` Dataframe allows multiple columns as row index keys
Reorder or partial selection	`T[rows, cols]`	`T.reindex(columns=..., index=...)` New labels will autofill by `NaN`
Select columns	`T[:, cols]`	`T[list_of_cols]`
Pick column by data type	T[:, `varfun(...)]`	`T.select_dtypes(include=[list of type names])`
Pick column by string match	T[:, `varfun(...)]`	`T.filter(like=str_to_match)`
Blindly concatenate columns of 2 tables	`[T1, T2]` If you defined optional rownames, they must match. You can delete it with `T.Properties.RowNames = {}`	Pandas assign row indices (labels) by default. Mismatched row labels do not combine in the same row. Consider `reset_index()` or overwrite the row indices of one table with another, like `pd.concat([T1, T2.set_index(T1.index)]`
Blindly concatenate rows of 2 tables	`[T1; T2]`	`pd.concat([T1, T2], ignore_index=True)`

Format export	`writetable()`	`.to_*()`

MATLAB tables does not support ranging through column names (such as 'apple':'grapes') yet Pandas DataFrame support it. I don’t think it’s fine to use it in the interpreter to poke around, but this is just asking for confusing logic bugs when the columns are moved around and the programmer has a false sense of security knowing exactly what’s where because they are using only names.

Dataframe is a little smarter than MATLAB’s table() in terms of managing column names and indices as it’s tracked with Index() type which is the same idea as MATLAB’s ordinal() ordered categorical type, where uniques names are mapped to unique indices and it’s the indices under the hood. This is how 'apple':'grapes' can work in Python but not MATLAB.

MATLAB T.Properties.VariableNames is a little clumsy. I usually implement a consistent interface called varnames() that’d output the same cellstr() headings whether it’s struct, dataset or table objects.

MATLAB’s table() by default do not make up row names. Pandas make up row names by default sequentially.

MATLAB table() do requires qualified string characters as variable names. Dataframe doesn’t care what labels you use as long as Index() takes it. It can get confusing because you can have a number 1 and ‘1’ as column headers at the same time and they look the same when displayed in the console.

”

Spyder traps for MATLAB users (1): By default, Spyder’s F5/Run executes the script from clean workspace.

Posted on February 10, 2025 by admin

This is another example of open source projects not going through a comprehensive use case study before changing the default behavior, which ended up pulling a prank on some users.

This time it’s Spyder’s good-intentions trying to proactively prevent user mistakes (such as not keeping track of the workspace) throwing the people who meticulously understand their workspace off.

I was working on a FT4222 device which should not be opened again if it’s already opened, aka the ft4222 class object exists. So naturally like in MATLAB, at the top of the script I check if the device object already exist and only create/open it when it’s not already there, like this:

if 'dev' in locals():
    pass
else:
    print('Branch')
    dev = ft4222.openByDescription('FT4222 A')

To my surprise it doesn’t work. 'dev' in locals() always return False every time I press F5, despite when I check again after the script runs, the variable is indeed in there and 'dev' in locals() returns True. WTF?!

Turns out I was not alone! Somebody had the exact same idiom as I did. Spyder 4 changed the default behavior, and we are supposed to manually check this dialog box entry so the scripts do not run off a clean slate when we press F5!

It’s an extremely terrible idea to have the IDE muck with the state by default. In MATLAB, if we want the script to start with clean state, we either put clear at the top of the script or clearvars -except to keep the variable.

It’s even harder to catch the new default insidious behavior of Spyder given it runs the script from a clean slate from F5/Run then dump the values to the workspace. It’s now a merge between pre-existing variables in the local() workspace and the results of the script from from a blank state!

The people who decided change to this default behaveior certainly didn’t think through this and rushed to do the obvious to please the careless programmers. If a programmer made a mistake by re-running the script without clearing the workspace and was impacted by the dirty variables, they can always reset everything and get out of this (and learn they should clean up the dirty state through the experience), however, somebody who know what they are doing will not be able to easily find out what they did wrong until they search for a behavior that looked more like a bug from Spyder/Python! It’s just horrible design choice! MATLAB doesn’t casually to throw users off like this. Damn!

Also I looked into code cells #%% (MATLAB has the equivalent %%), but there’s another annoyance in Spyder: block commenting through """ or ``` pairs is interpreted as output string from runcelll()! In other words, runcelll() outputs docstrings! So every time you execute the cell, the code you commented out will be concatenated into one long raw string with escape characters and pollute your console screen! Damn!

Spyder annoyances (3): The shortcut key Ctrl+D to reset console doesn’t work unless there’s nothing half typed in the console.

Exploiting Short-Circuit Evaluation for conditional execution

Posted on November 1, 2023 by admin

TLDR:

T && X is equivalent to "if( T) then run X"
T || X is equivalent to "if(!T) then run X"

I don’t exploit this too much in C/C++ because it’s hard to read (and therefore hard to keep track of it to make sure it’s bug free) and most often I’m interested in the output value so I have to watch out for the side effects. However this is common in Bash scripts

Domination Property

In languages that expressions evaluates to a value, sometimes if-statements can be replaced by short-circuit evaluation because short-circuit evaluation exploits the domination property of AND and OR logic operations:

\newcommand{\Hquad}{\hspace{0.5em}} \begin{alignat*}{2} 0 \Hquad & \mathrm{AND} & \Hquad X = 0 \\ 1 \Hquad & \mathrm{OR} & \Hquad X = 1 \end{alignat*}

When you FIRST run into the dominant value for the binary operation (0 for AND) and (1 for OR), evaluate no further (i.e. skip the rest) because rest won’t change the overall result away from the dominant value.

So in this use case (emulating if-then statements), what the latter expression X evaluates to or what the combined logic value is irrelevant. We are merely tricking the short-circuit mechanism to trip (short) to NOT evaluate based on what the earlier expression turned out. Action is the ‘norm’. Conditional inaction is the essence of this idiom.

The dual of domination property is idempotent, which is easier to reason because if pre-condition (say T) forces overall expression to boil down to the expression we want to conditionally execute (say X), we are stuck evaluating X if condition T is met.

\newcommand{\Hquad}{\hspace{0.5em}} \begin{alignat*}{2} 1 \Hquad & \mathrm{AND} & \Hquad X = X \\ 0 \Hquad & \mathrm{OR} & \Hquad X = X \end{alignat*}

These 2 possibilities (domination and idempotency) partitions to space (choices) of possibilities (i.e. cover all possible combinations), in other words there are no other scenarios than described. So the precondition T decides whether you run X or not, which is the equivalent of an if-then statement.

Operator (function) view of logic domination [Functional programming perspective]

By grouping the first value and the binary logic operator with a pair of parenthesis, in dominance view

(0 AND) is also called the ‘clear’ operator
(1 OR) is also called the ‘set’ operator

but this view is not too interesting for our case because we are not interested in what the conditional expression and the overall expression evaluates to, which is signified by ‘clear’ and ‘set’.

On the other hand, (1 AND) and (0 OR) are pass-through (idempotent) operators which passes the evaluation to the latter expression X.

\newcommand{\Hquad}{\hspace{0.5em}} \begin{alignat*}{2} (1 \Hquad & \mathrm{AND}) \Hquad & \circ & \Hquad X = X \\ (0 \Hquad & \mathrm{OR}) \Hquad & \circ & \Hquad X = X \end{alignat*}

This reads

(1 AND): pass-through (evaluate latter expression) if (earlier expression is) TRUE
(0 OR): pass-through (evaluate latter expression) if (earlier expression is) FALSE

Let’s call T the condition to test (the ‘if’-condition). The expression to run remain X.

\newcommand{\Hquad}{\hspace{0.5em}} \begin{alignat}{2} (T \Hquad \mathrm{AND}) X & = \overline{f_T}(X) \\ (T \Hquad \mathrm{OR}) X & = f_T(X) \end{alignat}

where

$\overline{f_T}$ reads “Run if T is false”
$f_T$ reads “Run if T is true”.

M	T	W	T	F	S	S
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Rambling Nerd with a Plan

Hoi Wong's blog

Category Archives: Computer Science