Data Types
On the last page, you vaguely heard about the int
data type. But what is a data type, what are they, and how do you use them?
Purpose Of Data Types
In Asylum, you can think of everything in an “is a” way. For example, 5 “is a” number, the vehicle you drive “is a” car, and main
“is a” function. A data type lets the computer know what kind of data you are working with. Obviously, your computer needs to do different things to add numbers together compared to concatting text. And of course, if you write a function that adds numbers together, you wouldn’t want anyone to give it text, a function, or even a car.
Primitive Data Types
A primitive data type is a type that can not be recreated using other types available in the language. These types are built into the language and can be used anywhere.
Void
The void
type can be thought of to mean nothing or be void of anything. Functions return void
from default as the two functions below are the exact same:
fn printItem(str item)
{
println(item)
}
fn printItem(str item) -> void
{
println(item)
}
It is illegal to declare a parameter or variable of type `void`! You can't have a variable that is nothing!
Bool
Any item that is either true or false can be represented as bool
. A bool
is true
if it is true
and false
if it is false
. By default, a bool
is false
. Converting a bool
to an integer will return a 1
if true
and 0
if false.
Integer Types
Perhaps the most common group of types in programming are integer types. Integers are used to represent whole numbers. Signed integers can be negative, while unsigned integers can not be negative. Below are the available integer types:
Type |
Minimum Value |
Maximum Value |
Description |
Overflow/Underflow |
---|---|---|---|---|
s |
-(2^ |
2^ |
A standard signed integer where |
Undefined |
u |
0 |
2^ |
A standard unsigned integer where |
Undefined |
so |
-(2^ |
2^ |
Like |
Wraps |
uo |
0 |
2^ |
Like |
Wraps |
int |
- |
- |
A signed integer that is the bitwidth of the system’s registers. It is at least 16 bits. It is impossible to have an array index bigger than its range. Pointer math is also done with this. |
Undefined |
uint |
- |
- |
An unsigned integer that is the bitwidth of the system’s registers. It is at least 16 bits. It is impossible to have a type that has a size bigger than this. |
Undefined |
Any integer has a default value of 0
.
It is also important to note that division by 0 will abort the program and dump a stack trace in debug mode and cause undefined behavior in release mode. In debug mode, an overflow or underflow will abort the program and dump a stack trace if it is undefined.
Floating Point Types
Another common primitive are numbers that have decimal points in them. Below are the available floating-point types:
Type |
Description |
---|---|
f |
Floating-point number with a bitwidth of |
float |
Floating-point number the size of floating-point registers on the system. It is at least 16-bits. |
Any floating point number has a default value of 0.0
.
Note that any overflow, underflow, division by 0, or invalid operation will abort the program and dump a stack trace in debug mode and cause undefined behavior in release mode.
Other Numbers
Asylum also supports other types of numbers that are neither integer or floating point types.
Type |
Description |
---|---|
fix |
A fixed point number with |
varlen |
A number with infinite range. It can represent any number imaginable, though it comes with the cost of being very slow. |
Both number types have a default value of 0.0
.
Note that any overflow, underflow, division by 0, or invalid operation will abort the program and dump a stack trace in debug mode and cause undefined behavior in release mode.
Text Based
Strings of text are composed of characters. Below are the types allowed:
Type |
Description |
---|---|
char |
A text character. Characters are in the UTF8 format. You can think of this as a single item in text like ‘A’. |
wchar |
A wide text character. Characters are in the UTF16 format. |
str |
A “string” of |
wstr |
A “wide string” of |
By default, characters are set to h
. The default value of a string is the empty string.
Function Type
The function type is in the form func<T(...)>
It is a type that is a function that has a return type of T
and parameter types for the function in the parenthesis. The add function from earlier is type func<int(int, int)>
. It is mostly used for storing functions into variables.
Functions do not have a default value and must be initialized immediately.
Non-Types
These types are not really types and are context sensitive.
Type |
Description |
---|---|
var |
This is not a type, it turns into another type that is automatically determined at compile time. It has its uses, but it is generally not recommended, this is a matter of opinion though. |
This |
The current type being implemented in an implementation block. |
Special Types
Like most other languages, Asylum has special types that are types that involve other types. This statement is confusing, but it’ll make sense after looking at the special types below (where T
is any non-void data type). It is not expected for everything listed here to make sense, as they are discussed in more detail later:
Tuple Types
These are used for grouping types so you can return multiple types at the same time, or work with variables that contain multiple types.
Type |
Name |
Description |
---|---|---|
|
Tuple |
This allows you to store a bunch of other data types in one variable. Ex: |
( |
Tuple |
This is the exact same as tuple, except with parenthesis around it. This is useful for removing ambiguity. |
A tuple has the default value where all its elements are at its default value if possible.
Array Types
Arrays are used for storing blocks of data that do not grow or shrink in size.
Type |
Name |
Description |
---|---|---|
|
Dynamic array. This is an array of |
|
|
Constant array. This is an array of |
|
|
A dimensional array where |
Arrays will have their elements be their default values if possible. Note that is not possible to store a dynamic array in a struct
since they do not have a definite size.
Other Types
There are some other types that have special uses as well.
Type |
Name |
Description |
---|---|---|
|
Reference |
A not-null reference to a data type. For now, think of it as a shortcut to already existing data of the same type that always references data. (L-value reference for those familiar with C++). |
|
Move |
Data being transferred that can not be written to (R-value reference for those familiar with C++). |
|
Pointer |
A pointer that can only be used in unsafe contexts. For now, think of it as a shortcut to existing data without any safety mechanisms. This also allows you to do math operations with memory positions (it’s for more low-level uses so you most likely don’t need it). |
References and moved data do not have a default value and must be initialized immediately. Pointers are null
by default.
EASL Types
EASL (Embedded Asylum Standard Library) is a thin layer over Asylum that helps implement the language itself. It is responsible for providing the types below. Note that all EASL types start with upper-case letters as only the primitive types start with lowercase.
C Compatibility
C uses different types from Asylum, so the below are for C compatibility. EASL typedefs, or creates new types that really just shortcut back to the original type:
Type |
C Type |
True Type |
Range |
---|---|---|---|
C_schar_t |
signed char |
s8 |
-128 to 127 |
C_uchar_t |
unsigned char |
u8 |
0 to 255 |
C_char_t |
char |
- |
- |
C_short_t |
short |
s16 |
-32,768 to 32,767 |
C_ushort_t |
unsigned short |
u16 |
0 to 65,535 |
C_int_t |
int |
s32 |
-2,147,483,648 to 2,147,483,647 |
C_uint_t |
unsigned int |
u32 |
0 to 4,294,967,295 |
C_long_t |
long |
s32 |
-2,147,483,648 to 2,147,483,647 |
C_ulong_t |
unsigned long |
u32 |
0 to 4,294,967,295 |
C_longlong_t |
long long |
s64 |
-9,223,372,036,854,775,808 to 9,223,372,036,854,775,807 |
C_ulonglong_t |
unsigned long long |
u64 |
0 to 18,446,744,073,709,551,615 |
C_float_t |
float |
f32 |
3.4E +/- 38 (7 digits) |
C_double_t |
double |
f64 |
1.7E +/- 308 (15 digits) |
C_ssize_t |
ssize_t |
int |
- |
C_size_t |
size_t |
uint |
- |
C_intptr_t |
intptr_t |
int |
- |
C_uintptr_t |
uintptr_t |
uint |
- |
Despite the fact that this table is final, it is highly recommended to continue using types such as u16
when the type expected must be a 16-bit unsigned number. This helps better communicate the fact that the operation depends on there being 16-bits.
Smart References
Smart references are used to access data stored somewhere else without having to copy it. With the exception of nullable references, references will always have valid data.
Type |
Name |
Description |
---|---|---|
|
Nullable Reference |
Smart reference to a data that may be null. |
|
Owning Reference |
Smart reference that owns data. When it goes out of scope, the value it owns dies with it. Owning references may only moved to other owning references, never copied. It is not possible for this reference to be null. |
|
Counted Reference |
Smart reference that counts references to data. Copying it to another |
All references must be initialized while declared as they have no default value with the exception of the nullable reference, which is null
by default.
Vectors
Vectors are shortcuts to tuples of multiple of the same type. For example, a Vector<float, 4>
will produce a tuple of 4
float
elements. EASL defines typedefs for commonly used vector types:
Type |
True Type |
---|---|
Vec2 |
Vector<float, 2> |
Vec3 |
Vector<float, 3> |
Vec4 |
Vector<float, 4> |
UVec2 |
Vector<uint, 2> |
UVec3 |
Vector<uint, 3> |
UVec4 |
Vector<uint, 4> |
SVec2 |
Vector<int, 2> |
SVec3 |
Vector<int, 3> |
SVec4 |
Vector<int, 4> |
Matrices
EASL also allows you to declare a matrix with Matrix<T, U, V>
where T
is the element type, U
is the number of rows, and V
is the number of columns. This means a Matrix<float, 3, 4>
is a matrix of float
elements with 3
rows and 4
columns. Note that T
must have addition, subtraction, multiplication, and division defined with itself. Like vectors, EASL defines some typedefs for commonly used matrix types:
Type |
True Type |
---|---|
Matrix2x2 |
Matrix<float, 2, 2> |
Matrix2x3 |
Matrix<float, 2, 3> |
Matrix3x3 |
Matrix<float, 3, 3> |
Matrix3x4 |
Matrix<float, 3, 4> |
Matrix4x4 |
Matrix<float, 4, 4> |
UMatrix2x2 |
Matrix<uint, 2, 2> |
UMatrix2x3 |
Matrix<uint, 2, 3> |
UMatrix3x3 |
Matrix<uint, 3, 3> |
UMatrix3x4 |
Matrix<uint, 3, 4> |
UMatrix4x4 |
Matrix<uint, 4, 4> |
SMatrix2x2 |
Matrix<int, 2, 2> |
SMatrix2x3 |
Matrix<int, 2, 3> |
SMatrix3x3 |
Matrix<int, 3, 3> |
SMatrix3x4 |
Matrix<int, 3, 4> |
SMatrix4x4 |
Matrix<int, 4, 4> |
Unlike Vector<T, U>
, Matrix<T, U, V>
is a struct rather than a tuple. Its composure looks like:
struct Matrix<type T: Add<T> & Sub<T> & Mul<T> & Div<T>, uint U where U > 1, uint V where V > 1> {
Vector<T, U>[V] cols;
}
Other Types
Some other types that EASL defines:
Type |
Name |
Description |
---|---|---|
|
Optional |
An optional value. Either contains |
Res< |
When a success occurs, the value of type |
Optional is null
by default. Results by default are set to an error of the default type of U
if possible. It’s also important to note that T?
and Res<T, void>
are identical types.
Storage Qualifiers
You’re allowed to tag on storage qualifiers to add rules on how data is allowed to be accessed.
Type |
Name |
Description |
---|---|---|
readonly/ro |
A type |
|
writeonly/wo |
A type |
|
static |
A type |
|
volatile |
If you need to make sure |
|
atomic |
Declaring a type as atomic will make sure only one thread is allowed to use it at a time. |
Again, do not be overwhelmed if these do not make sense. They will all be explored in more detail later.
Typedefs
Asylum creates those above “shortcuts” but doing what is called a typedef statement. These are to be placed outside of functions, structures, and implementations.
typedef medIntArr = s24[];
This example creates a new type called, medIntArr
, which is really just a dynamic array of 24-bit signed integers in disguise. You can use medIntArr
and s24[]
interchangibly, they are the exact same thing.
Structs
Of course, making other names for existing types is cool and all, but it can only go so far. Structs allow you to group together a bunch of data types like so:
struct Color {
byte red;
byte green;
byte blue;
}
struct Car {
str licensePlate;
Color rgbColor;
int year;
str model;
}
Now you can pass around more meaningful data!
Infinite Structs
As you can see above, you can put structs inside of other structs. But one thing that is not allowed is putting a struct inside itself. Why is this a problem? Well let’s think about it:
Warning
struct IllegalStruct {
str name;
int number;
IllegalStruct illegalStruct;
}
Nothing too evil looking, right? But the compiler needs to determine the size of a struct at compile time. So let’s try and think of how that happens. We know that Asylum takes care of strings, and trying to figure out its size is a complex topic. But strings do have a fixed length, even if it is a black box to us. Ok, int
, that’s easy, 32 bits or 4 bytes. Ok, and it contains an IllegalStruct
, so we just give it the length of the IllegalStruct
. Ok, now we need to go inside that IllegalStruct
and figure out the length of its IllegalStruct
… This is the problem. We’ll keep trying to figure out the length in an infinite loop. So how do we fix it? If you remember from earlier, we have references which allow us to have a “shortcut” to data:
struct LegalStruct {
str name;
int number;
LegalStruct^? legalStruct;
}
This is legal, since we don’t need to know the length of LegalStruct
to calculate the length of LegalStruct
. A reference/shortcut always has a fixed length, no matter what type of data it shortcuts/points to. We need to also make the owning reference optional so that we can end the chain of structs when there is no more (or else it’d have to go on forever). Using a reference instead does add some things we will have to do when using it, but this will be discussed later. Also note the following illegal code:
Warning
struct IllegalA {
str name;
int number;
IllegalB illegalB;
}
struct IllegalB {
str name;
int number;
IllegalA illegalA;
}
Similarly, like a struct containing itself, this will also produce an infinite loop for the compiler that won’t be able to determine the length of the structure. This can again be fixed by making either the illegalA
or illegalB
members a reference. Note that just having one reference solves the infinite struct error.
Challenges
Explain the difference between unsigned and signed integers. What is a common signed type?
Explain the difference between a floating-point and a fixed-point number. What is a common floating-point type?
What type of number is
c_long_t
? Is it unsigned, signed, floating-point, or fixed-point?Remember the
add
function from earlier? Remake it so thata
andb
can have a decimal point.Create a
struct
that represents a circle. Each circle has a radius, a line width, color, and can have another circle inside of it. Create any other necessarystruct
s.