Skip to main content

Stateful and procedural functions

Stateful functions

A stateful function is a function whose internal state may change when it is called. For example, the wall clock function like now() and random number generators like rand(Integer high)->Integer are stateful.

Stateful functions can be called in queries. The query optimizer is aware of that they may return different values for given arguments every time they are called. Therefore the optimizer is careful not to change the order in which stateful functions are called in a query.

note

The stream function extract(s) is stateful since it runs stream s to extract the elements. The state of stream s may thereby change. For example, a stream reading sensor values will continuously change its state when new sensor readings arrive.

The function stateful(Charstring pat)->Bag of Charstring returns the signatures of the stateful resolvents for functions whose names match pat.

Example:

stateful('%rand%')

Procedural functions

A procedural function is a stateful function defined as a sequence of OSQL statements that may have side effects. The typical use of procedural functions is for consistent updates of the database. The return statement inside a procedural function returns a value.

Example: Let's create a table to store equipment properties.

drop table Equipment; -- Drop old table
create table Equipment(name Charstring,
weight Integer,
unique(name))

The following procedural Boolean function new_equipment(nm,e) inserts a new item into the equipment table and returns true if the insert succeeded:

create function new_equipment(Charstring n, Integer w) -> Boolean
as {
insert into Equipment values (n, w);
return rowcount()=1;
}
note

You can have several OSQL statements in the body of a procedural function by enclosing them in curly brackets {...}. This is called a block. The curly brackets can be omitted if the block contains only a single statement.

note

The stateful function rowcount() returns the number of affected rows in the preceding update of a table.

Test it:

new_equipment('Sensor',10);
select * from Equipment

The function new_equipment is stateful:

stateful('new_equipment')

If the return statement is not called in the procedural function, the result is empty. If a procedural function is used for its side effects only, not returning any value, the result type Boolean can be specified.

Iterating over queries

You can make procedures that execute a statement or a block for each result tuple in the result of a query using the for statement:

for <variables> in <query> <statement>

Example: The following procedural function populates the table Equipment with n items named E1, E2,.. along with random simulated weights between 1 and 100.

create function populate_equipment1(Integer n) -> Boolean
as for i in range(n)
{
insert into Equipment values ('E' || i, rand(1,100));
commit;
}
populate_equipment1(10)
select * from Equipment
note

The type of the variable i is implicitly determined by the system from the type of the query and need not be declared.

note

The commit after each insert is very important for saving memory during the population. If it had not been present all updates would have been logged and the log would use memory proportional to n.

Try calling populate_equipment1 again:

populate_equipment1(10)
note

The call to populate_equipment1 fails when it is called a second time, since the column name is unique.

When experimenting with different table sizes one would like to remove all rows in the table Equipment before populating it. This can be done by executing the SQL statement truncate table Equipment before inserting the rows;


create function populate_equipment2(Integer n) -> Integer
as {
truncate table Equipment;
for i in range(n)
{
insert into Equipment values ('E' || i, rand(1,100));
commit; -- To clear log
};
return n;
}

Try it:

populate_equipment2(10)
select * from Equipment
Note

The SQL statement truncate table Equipment removes all rows from the table without logging. If we had excuted delete from Equipment the row deletions would have been logged, which would have used a lot of memory.

Local declarations

You can declare local variables Vi with types Ti in a block with a declare statement placed first in the block:

declare T1 V1, T2 V2,...

Example: The following procedural function populates Equipment while also computing the avererage weights.

create function populate_equipment3(Integer n) -> Real
as {
declare Integer sum;
set sum = 0;
truncate table Equipment;
for i in range(n)
{
declare Integer w;
set w = rand(1,100);
insert into Equipment values ('E' || i, w);
commit; -- To clear log
set sum = sum + w;
};
return sum/n;
}

Run it:

populate_equipment3(10)

Check that the average weight was computed correctly:

select avg(weight) from Equipment

Returning bags

For procedural functions returning bags the statement yield e is used for iteratively adding a new element to the result.

Example: The following procedural function populates Equipment and returns a bag of tuples containing the new item names and their weights.

create function populate_equipment4(Integer n) -> Bag of (Charstring, Integer)
as {
truncate table Equipment;
for i in range(n)
{
declare Charstring nm, Integer w;
set nm = 'E' || i;
set w = rand(1,100);
insert into Equipment values (nm, w);
commit; -- To clear log
yield (nm, w); -- Add result tuple
};
};

Run it:

populate_equipment4(10);

Iterative updates

Use SQL's update statement in procedural functions for iterative updates of table rows fulfilling some conditions.

Example:

create function increase_weights(Integer inc,Integer thres) -> Integer
as {
update Equipment
set weight = weight + inc
where weight>thres;
return rowcount();
}

Let's test it:

select sum(weight) from Equipment
increase_weights(10,50);
select sum(weight) from Equipment