References

In Asylum, you typically do not want to copy data around, especially if it is large. At the same time, you may not want to change how data is moved. References exist for this purpose, to allow you to use memory located somewhere else.

Standard References

Earlier, we have discussed different ways of passing function arguments. Here is a reminder of passing by reference:

fn modifyVal(ref int val) {
    val = 5;
}

fn main() {
    int val = 3;
    modifyVal(ref val);
    println(val);
}

Output:

By passing by reference, we are allowed to modify data of the variable we pass to the function. However, there is an alternate way of representing references which we will use for the rest of the page:

fn modifyVal(int& val) {
    val = 5;
}

fn main() {
    int val = 3;
    modifyVal(val&);
    println(val);
}

Output:

Reference Types

That int& from earier is not just passing by reference, but it is a type you can store to as well! All a reference does is allow you to have multiple variables share the same value. For example:

fn main() {
    int val1 = 3;
    int& val2 = val1&;
    println(val1 + ", " + val2);
    val2 = 5;
    println(val1 + ", " + val2);
    val1 = 7;
    println(val1 + ", " + val2);
}

Output:

3, 3
5, 5
7, 7

As you can see above, the values of val1 and val2 are always the same, and using one is the equivalent of using the other. There is also a short-hand notation for creating a reference:

int& val2 -> val1;

The arrow (points to operator) can be thought to read as val2 “references” val1. This syntax is preferred and will be used for the rest of this page.

More References

You can add as many references as you want!

fn main() {
    int val1 = 3;
    int& val2 -> val1;
    int& val3 -> val1;
    int& val4 -> val3;
    val4 = 5;
    println(val1 + ", " + val2 + ", " + val3 + ", " + val4);
}

Output:

5, 5, 5, 5

Notice that int& val4 -> val3 does not have a reference reference the reference val3 but rather the original val1 value. This is because using a reference is the same as using the original value!

Accessing References

Sometimes you will want to modify or retrieve a reference instead of the value that is being referenced. In order to do this, you can use the as assignable reference operator @:

fn main() {
    int val1 = 3;
    int val2 = 5;
    int& val3 -> val1;
    println(val3);
    @val3 -> val2;
    int&& val4 -> @val3;
    println(@val3);
    println(val4);
    println(@val4);
}

Output:

3
-> 5
-> 5
-> -> 5

We first initalize val1 and val2 with their values 3 and 5 respectively.
Variable val3 references val1. We then inspect its value.
The reference val3 is changed to now be a reference to val2. Notice how we need to put @ before the name of the reference in order to modify it.
A reference to a reference named val4 is created. In order to reference val3 we need to treat val3 as a reference rather than what it is referencing. This is done by putting @ before the name of the variable.
Treating val3 as a reference is printed and we see it “references” 3. Printing the value of val4 shows the same as val3 as a reference. Printing val4 as a reference shows that it is indeed a reference to a reference to 3.

Dangers Of References

It is important to note that standard references do not hold any data. If the data they are referring to is not there anymore, this can cause problems. For example:

fn main() {
    int x = 3;
    int& y -> x;
    {
        int z = 5;
        @y -> z;
    } // Variable z goes out of scope here, y is now an invalid reference!
    println(y);
}

Output:

[Segmentation Fault]

Since y is referring to z and z dies before y does, we get a situation in which y tries to reference z, but does not know that z is gone. Thus when we try to use y (therefore we’re trying to use z), our program crashes. The compiler will warn you when this may happen, but definitely keep this in mind.

Function Arguments

You can not create a reference from function arguments that are of type in or move. However, you can copy parameters that are references:

fn myFunc1(int& val) {
    myFunc2(val&); // This makes a new reference.
    myFunc2(@val): // This uses the existing reference (preferred).
}

fn myFunc2(int& val) {
    println(val);
}

Warning

The in parameters are read-only, and so making a reference to this data breaks the promise that the data is read-only:

fn myFunc(int val) {
    int& myRef -> val; // This is not allowed!
}

Warning

The move parameters do not necessarily come from a variable (they can be from constant values), so there may not be any variable to reference. Thus, this is not allowed.

fn myFunc(int% val) {
    int& myRef -> val; // This is not allowed!
}

Nullable References

Standard references are great for when you know the data they reference is alive, however this is not always the case. There are some situations where you want to reference data and be able to check if the data still exists or not before using it. This is what the nullable reference is for. Nullable references also allow a reference to reference nothing or “null” as their name implies.

fn main() {
    int@ x;
    println(@x);
    {
        int y = 3;
        @x -> y;
        println(@x);
    }
    println(@x);
}

Output:

null
-> 3
null

We can declare a nullable reference by using @ instead of &. Notice that we do not have to set the value of it like with standard references. Declaring int@ x; is the same as declaring int@ x = null; When printing the reference, or that it references a value like usual.

Note

Nullable references are not very performant due to the code required to null a nullable pointer when the data being referenced is no longer accessible. Usage with owned references, counted references, and allocators are fairly inexpensive. However, using nullable references with stack variables generates additional code requiring branching which is not ideal for performance.

Warning

It is important to check the value of a nullable reference before using it if you don’t know if it is null or not! The following code produces a segmentation fault like earlier:

fn myFunc(int@ x) {
    println(x);
}

Output:

[Segmentation Fault]

The correct way is this:

fn myFunc(int@ x) {
    if (@x) println(x);
}

The value of @x can be converted to a bool implicitly. If x is a nullable reference, this value is true if the reference is not null, or false if it is null. Note that @x where x is a standard reference will always implicitly be converted to true!

If you want to guarantee that the variable being passed is not null, then either pass the value by in or by ref! Nullable references should only be used as arguments to move nullable references around or when references can be null.

Warning

Nullable references are not a magic bullet. They will work properly if created from owning or counted references (discussed later) or by referencing data on the stack (as shown on the first example with nullable references). If a nullable reference is created from a standard reference, it will not know when to become null since the reference guarantees the referenced data will always exist!

fn main() {
    int x = 3;
    int& y -> x;
    int@ z -> y; // No problems so far. Even if `y` references something else later, `z` will still reference `x` since that's what `y` is doing at this time.
    {
        int w = 5;
        @y -> w; // Nullable reference `z` still references `x`.
        println(z);
    }
    z -> y; // Using `z` will produce segmentation fault now!
    println(@z);
}

Output:

3
[Segmentation Fault]

Even though we would expect z to be null, there is no way of knowing that y is an invalid reference. Thus, the compiler will think the memory that is referenced is valid. While you are allowed to create nullable references from standard references, it is highly recommended you do not.

Smart References

There are certain scenarios in which you want finer control of when data lives. Some of which include not being able to instantiate a struct in its constructor or the struct does not have a definitive compile-time size (as in referred to by an abstract base). The solution to this are smart references.

Owning References

Owning references are fairly simple in design. Only one reference can be crowned the owner of data. Owning references have type T^. You can think of the ^ as a crown that the reference is wearing.

struct MyStruct {
   pub int val;
}

impl MyStruct {
    pub MyStruct(int val) : val(_) {}
}

fn main() {
    MyStruct^ item = MyStruct^(3);
    println(item.val);
}

Output:

The above code shows how an owning reference is created. Owning references construct an instance of the type they hold on the heap. When they go out of scope, the data they hold goes out of scope as well. They behave similarly to Box<T> in Rust and std::unique_ptr<T> in C++.

Nullable References

Nullable references are allowed to copy from an owning reference, though nullable references are not allowed to own any data and thus just reference the same data. When the owning reference goes out of scope, the nullable reference is set to null.

fn main() {
    MyStruct@ r1;
    {
        MyStruct^ item = MyStruct^(3);
        @r1 -> item;
    }
    println(@r1):
}

Output:

null

Passing Arguments

It is possible to pass an owning reference as a nullable reference and a standard reference:

fn myFunc(MyStruct& a, MyStruct@ b) -> int? {
    if (!@b) return null;
    else return a.val + b.val;
}

fn main() {
    MyStruct^ item = MyStruct^(4);
    println(myFunc(@item, @item));
}

Output:

Transfering Ownership

Remember that is not possible to copy the value of an owning reference. However, data can be moved instead. This is useful for transferring ownership of data:

struct MyData {
    pub int^ val;
}

impl MyData {
    pub MyData(int^% val) { // Can also do `: val(_%)` here instead.
        this.val := val;
    }
}

fn main() {
    MyData data1 = MyData(int^(3));
    MyData data2 = MyData(move @data1.val); // Alternative is MyData(@data1.val%);
    println(@data1);
    println(@data2);
}

Output:

null
-> 3

In the above code, the ownership of 3 is transferred from data1.val to data2.val. When transferring the owner, it is important to make sure the reference gets transferred rather than the int it is referring to. Remember to place @ before references when you want to manipulate the references.

Counted References

Ownership is distributed among the T# references. Nullable references that point to it get nulled when all counted references are out of scope.

fn myFunc(MyStruct& a, MyStruct@ b) -> int? {
    if (!@b) return null;
    else return a.val + b.val;
}

fn main() {
    MyStruct# item1 = MyStruct^(4);
    {
        MyStruct# item2 -> item1;
        println(myFunc(@item1, @item2));
        println(@item1.count);
    }
    println(@item1.count);
}

Output:

8
2
1

The above code shows how a counted reference is created. Counted references construct an instance of the type they hold on the heap. Copying from a counted reference to another counted reference increments the reference count. When a counted reference goes out of scope, the reference count decreases. When the reference counter reaches 0, the memory referenced is deleted. Counted references behave similarly to Rc<T> in Rust and std::shared_ptr<T> in C++.

Nullable References

Nullable references for counted references work similarly how they do for owned references. when the number of counted references reaches 0, the nullable references that reference the data are now null.

About Cycles

Counted references do not have their memory freed until all counted references that use the memory have died. Therefore, if there is a situation where a counted reference is not able to die (a cycle), then memory can not be freed:

struct Data {
    pub int val;
    pub Data# linked;
};

impl Data {
    pub Data(int val) : val(_) {}
}

fn func() {
    Data# data1 = Data#(3);
    Data# data2 = Data#(5);
    @data1.linked -> data2;
    @data2.linked -> data1;
}

When the function func is called, the memory for data1 and data2 will never be freed. While the first counted references (the ones made first) are deleted from going out of scope, data1 and data2 reference each other and so it never occurs for one to be deleted since the count is above 0. The way to avoid this is by changing the design pattern used. Counted references should only be used for items that “own” the data. Nullable references should be used for items that access the data. A version of the above that takes this into account would look like this:

struct Data {
    pub int val;
    pub Data@ linked;
};

impl Data {
    pub Data(int val) : val(_) {}
}

fn func() {
    Data# data1 = Data#(3);
    Data# data2 = Data#(5);
    @data1.linked -> data2;
    @data2.linked -> data1;
}

Since linked is a nullable reference and so does not count towards the counted reference count, the data dies when func is over.

When To Use Which Reference

With all these different types of references, it can be hard to decide the right one to use in each scenario. The below steps should help you understand which is the right reference to use.

If a function only needs to read data, then it should take an in parameter and not deal with references.
If the data being referenced is guaranteed to exist and the function operating on it does not manage the ownership of the data, then it should be a standard reference.
If there is only one single owner of the data and the function in question transfers ownership, then an owning reference should be used.
If there are multiple owners of the data and you are transferring ownership of one of the owners, then a counted reference should be used.
If the data is from unknown origin and can expire at an unexpected or unknown time, only then should nullable references be used.

These are the general cases for using the above different reference types, though they are not hard rules and depend on your particular use case. Nullable references can make it hard to track where data originates from so use them with caution. They are most useful for when stored as members to structs when the referenced data in question has an unknown expiration. However, we will see some special uses in the section with allocators.

Challenges

TODO!!!

Challenge Solutions

TODO!!!