Type System Concepts
Type System Concepts
Introduction
Most modern programming languages come with their own native object systems and additional fundamental algorithmic language constructs. Just as GLib serves as an implementation of such fundamental types and algorithms (linked lists, hash tables and so forth), the GLib Object System provides the required implementations of a flexible, extensible, and intentionally easy to map (into other languages) object-oriented framework for C. The substantial elements that are provided can be summarized as:
- A generic type system to register arbitrary single-inherited flat and deep derived types as well as interfaces for structured types. It takes care of creation, initialization and memory management of the assorted object and class structures, maintains parent/child relationships and deals with dynamic implementations of such types. That is, their type specific implementations are relocatable/unloadable during runtime.
- A collection of fundamental type implementations, such as integers, doubles, enums and structured types, to name a few.
- A sample fundamental type implementation to base object hierarchies upon - the GObject fundamental type.
- A signal system that allows very flexible user customization of virtual/overridable object methods and can serve as a powerful notification mechanism.
- An extensible parameter/value system, supporting all the provided fundamental types that can be used to generically handle object properties or otherwise parameterized types.
Background
GObject, and its lower-level type system, GType, are used by GTK and most GNOME libraries to provide:
- object-oriented C-based APIs and
- automatic transparent API bindings to other compiled or interpreted languages.
A lot of programmers are used to working with compiled-only or dynamically interpreted-only languages and do not understand the challenges associated with cross-language interoperability. This introduction tries to provide an insight into these challenges and briefly describes the solution chosen by GLib.
The following chapters go into greater detail into how GType and GObject work and how you can use them as a C programmer. It is useful to keep in mind that allowing access to C objects from other interpreted languages was one of the major design goals: this can often explain the sometimes rather convoluted APIs and features present in this library.
Data types and programming
One could say that a programming language is merely a way to create data types and manipulate them. Most languages provide a number of language-native types and a few primitives to create more complex types based on these primitive types.
In C, the language provides types such as char, long, pointer. During compilation of C code, the compiler maps these language types to the compiler’s target architecture machine types. If you are using a C interpreter (assuming one exists), the interpreter (the program which interprets the source code and executes it) maps the language types to the machine types of the target machine at runtime, during the program execution (or just before execution if it uses a Just In Time compiler engine).
Perl and Python are interpreted languages which do not really provide type definitions similar to those used by C. Perl and Python programmers manipulate variables and the type of the variables is decided only upon the first assignment or upon the first use which forces a type on the variable. The interpreter also often provides a lot of automatic conversions from one type to the other. For example, in Perl, a variable which holds an integer can be automatically converted to a string given the required context:
my $tmp = 10;
print "this is an integer converted to a string:" . $tmp . "\n";
Of course, it is also often possible to explicitly specify conversions when the default conversions provided by the language are not intuitive.
Exporting a C API
C APIs are defined by a set of functions and global variables which are usually exported from a binary. C functions have an arbitrary number of arguments and one return value. Each function is thus uniquely identified by the function name and the set of C types which describe the function arguments and return value. The global variables exported by the API are similarly identified by their name and their type.
A C API is thus merely defined by a set of names to which a set of types are associated. If you know the function calling convention and the mapping of the C types to the machine types used by the platform you are on, you can resolve the name of each function to find where the code associated to this function is located in memory, and then construct a valid argument list for the function. Finally, all you have to do is trigger a call to the target C function with the argument list.
For the sake of discussion, here is a sample C function and the associated 32 bit x86 assembly code generated by GCC on a Linux computer:
static void
function_foo (int foo)
{
}
int
main (int argc,
char *argv[])
{
function_foo (10);
return 0;
}
push $0xa
call 0x80482f4 <function_foo>
The assembly code shown above is pretty straightforward: the first
instruction pushes the hexadecimal value 0xa (decimal value 10) as a 32-bit
integer on the stack and calls function_foo
. As you can see, C function
calls are implemented by GCC as native function calls (this is probably the
fastest implementation possible).
Now, let’s say we want to call the C function function_foo
from a Python
program. To do this, the Python interpreter needs to:
- Find where the function is located. This probably means finding the binary generated by the C compiler which exports this function.
- Load the code of the function in executable memory.
- Convert the Python parameters to C-compatible parameters before calling the function.
- Call the function with the right calling convention.
- Convert the return values of the C function to Python-compatible variables to return them to the Python code.
The process described above is pretty complex and there are a lot of ways to make it entirely automatic and transparent to C and Python programmers:
- The first solution is to write by hand a lot of glue code, once for each function exported or imported, which does the Python-to-C parameter conversion and the C-to-Python return value conversion. This glue code is then linked with the interpreter which allows Python programs to call Python functions which delegate work to C functions.
- Another, nicer solution is to automatically generate the glue code, once for each function exported or imported, with a special compiler which reads the original function signature.
The solution used by GLib is to use the GType library which holds at runtime a description of all the objects manipulated by the programmer. This so-called dynamic type library is then used by special generic glue code to automatically convert function parameters and function calling conventions between different runtime domains.
The greatest advantage of the solution implemented by GType is that the glue code sitting at the runtime domain boundaries is written once: the figure below states this more clearly.
Currently, there exist multiple generic glue code which makes it possible to use C objects written with GType directly in a variety of languages, with a minimum amount of work: there is no need to generate huge amounts of glue code either automatically or by hand.
Although that goal was arguably laudable, its pursuit has had a major influence on the whole GType/GObject library. C programmers are likely to be puzzled at the complexity of the features exposed in the following chapters if they forget that the GType/GObject library was not only designed to offer OO-like features to C programmers but also transparent cross-language interoperability.
The GLib Dynamic Type System
A type, as manipulated by the GLib type system, is much more generic than what is usually understood as an Object type. It is best explained by looking at the structure and the functions used to register new types in the type system.
typedef struct _GTypeInfo GTypeInfo;
struct _GTypeInfo
{
/* interface types, classed types, instantiated types */
guint16 class_size;
GBaseInitFunc base_init;
GBaseFinalizeFunc base_finalize;
/* classed types, instantiated types */
GClassInitFunc class_init;
GClassFinalizeFunc class_finalize;
gconstpointer class_data;
/* instantiated types */
guint16 instance_size;
guint16 n_preallocs;
GInstanceInitFunc instance_init;
/* value handling */
const GTypeValueTable *value_table;
};
GType
g_type_register_static (GType parent_type,
const gchar *type_name,
const GTypeInfo *info,
GTypeFlags flags);
GType
g_type_register_fundamental (GType type_id,
const gchar *type_name,
const GTypeInfo *info,
const GTypeFundamentalInfo *finfo,
GTypeFlags flags);
g_type_register_static()
, g_type_register_dynamic()
and
g_type_register_fundamental()
are the C functions, defined in gtype.h
and implemented in gtype.c
which you should use to register a new GType in
the program’s type system. It is not likely you will ever need to use
g_type_register_fundamental()
but in case you want to, the last chapter
explains how to create new fundamental types.
Fundamental types are top-level types which do not derive from any other type while other non-fundamental types derive from other types. Upon initialization, the type system not only initializes its internal data structures but it also registers a number of core types: some of these are fundamental types. Others are types derived from these fundamental types.
Fundamental and non-fundamental types are defined by:
- class size: the
class_size
field inGTypeInfo
. - class initialization functions (C++ constructor): the
base_init
andclass_init
fields inGTypeInfo
. - class destruction functions (C++ destructor): the
base_finalize
andclass_finalize
fields inGTypeInfo
. - instance size (C++ parameter to new): the
instance_size
field inGTypeInfo
. - instantiation policy (C++ type of new operator): the
n_preallocs
field inGTypeInfo
. - copy functions (C++ copy operators): the
value_table
field inGTypeInfo
. - type characteristic flags:
GTypeFlags
.
Fundamental types are also defined by a set of GTypeFundamentalFlags
which
are stored in a GTypeFundamentalInfo
. Non-fundamental types are
furthermore defined by the type of their parent which is passed as the
parent_type
parameter to g_type_register_static()
and
g_type_register_dynamic()
.
Copy functions
The major common point between all GLib types (fundamental and non-fundamental, classed and non-classed, instantiatable and non-instantiatable) is that they can all be manipulated through a single API to copy/assign them.
The GValue
structure is used as an abstract container for all of these
types. Its simplistic API (defined in gobject/gvalue.h
) can be used to
invoke the value_table
functions registered during type registration: for
example g_value_copy()
copies the content of a GValue
to another
GValue
. This is similar to a C++ assignment which invokes the C++ copy
operator to modify the default bit-by-bit copy semantics of C++/C structures/classes.
The following code shows how you can copy around a 64 bit integer, as well
as a GObject
instance pointer:
static void test_int (void)
{
GValue a_value = G_VALUE_INIT;
GValue b_value = G_VALUE_INIT;
guint64 a, b;
a = 0xdeadbeef;
g_value_init (&a_value, G_TYPE_UINT64);
g_value_set_uint64 (&a_value, a);
g_value_init (&b_value, G_TYPE_UINT64);
g_value_copy (&a_value, &b_value);
b = g_value_get_uint64 (&b_value);
if (a == b) {
g_print ("Yay !! 10 lines of code to copy around a uint64.\n");
} else {
g_print ("Are you sure this is not a Z80 ?\n");
}
}
static void test_object (void)
{
GObject *obj;
GValue obj_vala = G_VALUE_INIT;
GValue obj_valb = G_VALUE_INIT;
obj = g_object_new (VIEWER_TYPE_FILE, NULL);
g_value_init (&obj_vala, VIEWER_TYPE_FILE);
g_value_set_object (&obj_vala, obj);
g_value_init (&obj_valb, G_TYPE_OBJECT);
/* g_value_copy's semantics for G_TYPE_OBJECT types is to copy the reference.
* This function thus calls g_object_ref.
* It is interesting to note that the assignment works here because
* VIEWER_TYPE_FILE is a G_TYPE_OBJECT.
*/
g_value_copy (&obj_vala, &obj_valb);
g_object_unref (G_OBJECT (obj));
g_object_unref (G_OBJECT (obj));
}
The important point about the above code is that the exact semantics of the
copy calls is undefined since they depend on the implementation of the copy
function. Certain copy functions might decide to allocate a new chunk of
memory and then to copy the data from the source to the destination. Others
might want to simply increment the reference count of the instance and copy
the reference to the new GValue
.
The value table used to specify these assignment functions is documented in
GTypeValueTable
.
Interestingly, it is also very unlikely you will ever need to specify a
value_table
during type registration because these value_tables
are
inherited from the parent types for non-fundamental types.
Conventions
There are a number of conventions users are expected to follow when creating new types which are to be exported in a header file:
- Type names (including object names) must be at least three characters long and start with a–z, A–Z or ‘_’.
- Use the
object_method
pattern for function names: to invoke the method named ‘save’ on an instance of object type ‘file’, callfile_save
. - Use prefixing to avoid namespace conflicts with other projects. If your
library (or application) is named
Viewer
, prefix all your function names withviewer_
. For example:viewer_file_save
. - The prefix should be a single term, i.e. should not contain any capital
letters after the first letter. For example,
Exampleprefix
rather thanExamplePrefix
. - This allows the lowercase-with-underscores versions of names to be automatically and unambiguously generated from the camel-case versions. See Name mangling.
- Multiple-term prefixes can be supported if additional arguments are passed to type and introspection tooling, but it’s best to avoid the need for this.
- Object/Class names (such as
File
in the examples here) may contain multiple terms. For example,LocalFile
. - Create a macro named
PREFIX_TYPE_OBJECT
which always returns theGType
for the associated object type. For an object of typeFile
in theViewer
namespace, use:VIEWER_TYPE_FILE
. This macro is implemented using a function namedprefix_object_get_type
; for example,viewer_file_get_type
. - Use
G_DECLARE_FINAL_TYPE
orG_DECLARE_DERIVABLE_TYPE
to define various other conventional macros for your object: PREFIX_OBJECT (obj)
, which returns a pointer of typePrefixObject
. This macro is used to enforce static type safety by doing explicit casts wherever needed. It also enforces dynamic type safety by doing runtime checks. It is possible to disable the dynamic type checks in production builds (see Building GLib). For example, we would createVIEWER_FILE (obj)
to keep the previous example.PREFIX_OBJECT_CLASS (klass)
, which is strictly equivalent to the previous casting macro: it does static casting with dynamic type checking of class structures. It is expected to return a pointer to a class structure of typePrefixObjectClass
. An example is:VIEWER_FILE_CLASS
.PREFIX_IS_OBJECT (obj)
, which returns a boolean which indicates whether the input object instance pointer is non-NULL
and of typeOBJECT
. For example,VIEWER_IS_FILE
.PREFIX_IS_OBJECT_CLASS (klass)
, which returns a boolean if the input class pointer is a pointer to a class of typeOBJECT
. For example,VIEWER_IS_FILE_CLASS
.PREFIX_OBJECT_GET_CLASS (obj)
, which returns the class pointer associated to an instance of a given type. This macro is used for static and dynamic type safety purposes (just like the previous casting macros). For example,VIEWER_FILE_GET_CLASS
.
The implementation of these macros is pretty straightforward: a number of
simple-to-use macros are provided in gtype.h
. For the example we used above,
we would write the following trivial code to declare the macros:
#define VIEWER_TYPE_FILE viewer_file_get_type()
G_DECLARE_FINAL_TYPE (ViewerFile, viewer_file, VIEWER, FILE, GObject)
Unless your code has special requirements, you can use the G_DEFINE_TYPE
macro to define a class:
G_DEFINE_TYPE (ViewerFile, viewer_file, G_TYPE_OBJECT)
Otherwise, the viewer_file_get_type
function must be implemented manually:
GType viewer_file_get_type (void)
{
static GType type = 0;
if (type == 0) {
const GTypeInfo info = {
/* You fill this structure. */
};
type = g_type_register_static (G_TYPE_OBJECT,
"ViewerFile",
&info, 0);
}
return type;
}
Name mangling
GObject tooling, in particular introspection, relies on being able to
unambiguously convert between type names (in camel-case, such as
GNetworkMonitor
or MyViewerFile
) and function prefixes (in
lowercase-with-underscores, such as g_network_monitor
or my_viewer_file
).
The latter can then be used to prefix methods such as
g_network_monitor_can_reach()
or my_viewer_file_get_type()
.
The algorithm for converting from camel-case to lowercase-with-underscores is:
- The output is a lower-case version of the input, with zero or more ‘splits’.
- Wherever the input is ‘split’, insert an underscore in the output.
- Split the input on:
- Each character (after index zero) which is uppercase and the previous character is not uppercase
- The second character (index one) if it is uppercase and the first character (index zero) is uppercase
- Each character (after index two) which is uppercase and the previous two characters are also uppercase
Non-instantiatable non-classed fundamental types
A lot of types are not instantiatable by the type system and do not have a class. Most of these types are fundamental trivial types such as gchar
, and are already registered by GLib.
In the rare case of needing to register such a type in the type system, fill a GTypeInfo
structure with zeros since these types are also most of the time fundamental:
GTypeInfo info = {
.class_size = 0,
.base_init = NULL,
.base_finalize = NULL,
.class_init = NULL,
.class_finalize = NULL,
.class_data = NULL,
.instance_size = 0,
.n_preallocs = 0,
.instance_init = NULL,
.value_table = NULL,
};
static const GTypeValueTable value_table = {
.value_init = value_init_long0,
.value_free = NULL,
.value_copy = value_copy_long0,
.value_peek_pointer = NULL,
.collect_format = "i",
.collect_value = value_collect_int,
.lcopy_format = "p",
.lcopy_value = value_lcopy_char,
};
info.value_table = &value_table;
type = g_type_register_fundamental (G_TYPE_CHAR, "gchar", &info, &finfo, 0);
Having non-instantiatable types might seem a bit useless: what good is a
type if you cannot instantiate an instance of that type? Most of these types
are used in conjunction with GValue
s: a GValue
is initialized with an
integer or a string and it is passed around by using the registered type’s
value_table
. GValue
s (and by extension these trivial fundamental types)
are most useful when used in conjunction with object properties and signals.
Instantiatable classed types: objects
Types which are registered with a class and are declared instantiatable are
what most closely resembles an object. Although GObject
s are the most well
known type of instantiatable classed types, other kinds of similar objects
used as the base of an inheritance hierarchy have been externally developed
and they are all built on the fundamental features described below.
For example, the code below shows how you could register such a fundamental object type in the type system (using none of the GObject convenience API):
typedef struct {
GObject parent_instance;
/* instance members */
char *filename;
} ViewerFile;
typedef struct {
GObjectClass parent_class;
/* class members */
/* the first is public, pure and virtual */
void (*open) (ViewerFile *self,
GError **error);
/* the second is public and virtual */
void (*close) (ViewerFile *self,
GError **error);
} ViewerFileClass;
#define VIEWER_TYPE_FILE (viewer_file_get_type ())
GType
viewer_file_get_type (void)
{
static GType type = 0;
if (type == 0) {
const GTypeInfo info = {
.class_size = sizeof (ViewerFileClass),
.base_init = NULL,
.base_finalize = NULL,
.class_init = (GClassInitFunc) viewer_file_class_init,
.class_finalize = NULL,
.class_data = NULL,
.instance_size = sizeof (ViewerFile),
.n_preallocs = 0,
.instance_init = (GInstanceInitFunc) viewer_file_init,
};
type = g_type_register_static (G_TYPE_OBJECT,
"ViewerFile",
&info, 0);
}
return type;
}
Upon the first call to viewer_file_get_type
, the type named ViewerFile
will be registered in the type system as inheriting from the type G_TYPE_OBJECT
.
Every object must define two structures: its class structure and its instance structure. All class structures must contain as first member a GTypeClass
structure. All instance structures must contain as first member a GTypeInstance
structure. The declaration of these C types, coming from gtype.h
is shown below:
struct _GTypeClass
{
GType g_type;
};
struct _GTypeInstance
{
GTypeClass *g_class;
};
These constraints allow the type system to make sure that every object instance (identified by a pointer to the object’s instance structure) contains in its first bytes a pointer to the object’s class structure.
This relationship is best explained by an example: let’s take object B which inherits from object A:
/* A definitions */
typedef struct {
GTypeInstance parent;
int field_a;
int field_b;
} A;
typedef struct {
GTypeClass parent_class;
void (*method_a) (void);
void (*method_b) (void);
} AClass;
/* B definitions. */
typedef struct {
A parent;
int field_c;
int field_d;
} B;
typedef struct {
AClass parent_class;
void (*method_c) (void);
void (*method_d) (void);
} BClass;
The C standard mandates that the first field of a C structure is stored starting in the first byte of the buffer used to hold the structure’s fields in memory. This means that the first field of an instance of an object B is A’s first field which in turn is GTypeInstance
‘s first field which in turn is g_class
, a pointer to B’s class structure.
Thanks to these simple conditions, it is possible to detect the type of every object instance by doing:
B *b;
b->parent.parent.g_class->g_type
or, more compactly:
B *b;
((GTypeInstance *) b)->g_class->g_type
Initialization and destruction
Instantiation of these types can be done with g_type_create_instance()
, which
will look up the type information structure associated with the type
requested. Then, the instance size and instantiation policy (if the
n_preallocs
field is set to a non-zero value, the type system allocates the
object’s instance structures in chunks rather than mallocing for every
instance) declared by the user are used to get a buffer to hold the object’s
instance structure.
If this is the first instance of the object ever created, the type system
must create a class structure. It allocates a buffer to hold the object’s
class structure and initializes it. The first part of the class structure
(ie: the embedded parent class structure) is initialized by copying the
contents from the class structure of the parent class. The rest of class
structure is initialized to zero. If there is no parent, the entire class
structure is initialized to zero. The type system then invokes the
base_init
functions (GBaseInitFunc)
from topmost fundamental
object to bottom-most most derived object. The object’s class_init
(GClassInitFunc)
function is invoked afterwards to complete initialization
of the class structure. Finally, the object’s interfaces are initialized (we
will discuss interface initialization in more detail later).
Once the type system has a pointer to an initialized class structure, it
sets the object’s instance class pointer to the object’s class structure and
invokes the object’s instance_init
(GInstanceInitFunc)
functions, from
top-most fundamental type to bottom-most most-derived type.
Object instance destruction through g_type_free_instance()
is very simple:
the instance structure is returned to the instance pool if there is one and
if this was the last living instance of the object, the class is destroyed.
Class destruction (the concept of destruction is sometimes partly referred
to as finalization in GType) is the symmetric process of the initialization:
interfaces are destroyed first. Then, the most derived class_finalize
(GClassFinalizeFunc)
function is invoked. Finally, the base_finalize
(GBaseFinalizeFunc)
functions are invoked from bottom-most most-derived type
to top-most fundamental type and the class structure is freed.
The base initialization/finalization process is very similar to the C++
constructor/destructor paradigm. The practical details are different though
and it is important not to get confused by superficial similarities. GTypes
have no instance destruction mechanism. It is the user’s responsibility to
implement correct destruction semantics on top of the existing GType code.
(This is what GObject
does) Furthermore, C++
code equivalent to the base_init
and class_init
callbacks of GType is
usually not needed because C++ cannot really create object types at runtime.
The instantiation/finalization process can be summarized as follows:
Invocation time | Function invoked | Function’s parameters |
---|---|---|
First call to g_type_create_instance() for target type |
type’s base_init function |
On the inheritance tree of classes from fundamental type to target type. base_init is invoked once for each class structure. |
target type’s class_init function |
On target type’s class structure | |
interface initialization, see the section called “Interface Initialization” | ||
Each call to g_type_create_instance() for target type |
target type’s instance_init function |
On object’s instance |
Last call to g_type_free_instance() for target type |
interface destruction, see the section called “Interface Destruction” | |
target type’s class_finalize function |
On target type’s class structure | |
type’s base_finalize function |
On the inheritance tree of classes from fundamental type to target type. base_finalize is invoked once for each class structure. |
Non-instantiatable classed types: interfaces
GType’s interfaces are very similar to Java’s interfaces. They allow to describe a common API that several classes will adhere to. Imagine the play, pause and stop buttons on hi-fi equipment—those can be seen as a playback interface. Once you know what they do, you can control your CD player, MP3 player or anything that uses these symbols.
To declare an interface you have to register a non-instantiatable classed
type which derives from GTypeInterface
. The following piece of code declares
such an interface:
#define VIEWER_TYPE_EDITABLE viewer_editable_get_type ()
G_DECLARE_INTERFACE (ViewerEditable, viewer_editable, VIEWER, EDITABLE, GObject)
struct _ViewerEditableInterface {
GTypeInterface parent;
void (*save) (ViewerEditable *self,
GError **error);
};
void viewer_editable_save (ViewerEditable *self,
GError **error);
The interface function, viewer_editable_save
is implemented in a pretty
simple way:
void
viewer_editable_save (ViewerEditable *self,
GError **error)
{
ViewerEditableinterface *iface;
g_return_if_fail (VIEWER_IS_EDITABLE (self));
g_return_if_fail (error == NULL || *error == NULL);
iface = VIEWER_EDITABLE_GET_IFACE (self);
g_return_if_fail (iface->save != NULL);
iface->save (self);
}
viewer_editable_get_type
registers a type named ViewerEditable
which
inherits from G_TYPE_INTERFACE
. All interfaces must be children of
G_TYPE_INTERFACE
in the inheritance tree.
An interface is defined by only one structure which must contain as first
member a GTypeInterface
structure. The interface structure is expected to
contain the function pointers of the interface methods. It is good style to
define helper functions for each of the interface methods which simply call
the interface’s method directly: viewer_editable_save
is one of these.
If you have no special requirements you can use the G_IMPLEMENT_INTERFACE
macro to implement an interface:
static void
viewer_file_save (ViewerEditable *self)
{
g_print ("File implementation of editable interface save method.\n");
}
static void
viewer_file_editable_interface_init (ViewerEditableInterface *iface)
{
iface->save = viewer_file_save;
}
G_DEFINE_TYPE_WITH_CODE (ViewerFile, viewer_file, VIEWER_TYPE_FILE,
G_IMPLEMENT_INTERFACE (VIEWER_TYPE_EDITABLE,
viewer_file_editable_interface_init))
If your code does have special requirements, you must write a custom
get_type
function to register your GType which inherits from some GObject
and which implements the interface ViewerEditable
. For example, this code
registers a new ViewerFile
class which implements ViewerEditable
:
static void
viewer_file_save (ViewerEditable *editable)
{
g_print ("File implementation of editable interface save method.\n");
}
static void
viewer_file_editable_interface_init (gpointer g_iface,
gpointer iface_data)
{
ViewerEditableInterface *iface = g_iface;
iface->save = viewer_file_save;
}
GType
viewer_file_get_type (void)
{
static GType type = 0;
if (type == 0) {
const GTypeInfo info = {
.class_size = sizeof (ViewerFileClass),
.base_init = NULL,
.base_finalize = NULL,
.class_init = (GClassInitFunc) viewer_file_class_init,
.class_finalize = NULL,
.class_data = NULL,
.instance_size = sizeof (ViewerFile),
.n_preallocs = 0,
.instance_init = (GInstanceInitFunc) viewer_file_init
};
const GInterfaceInfo editable_info = {
.interface_init = (GInterfaceInitFunc) viewer_file_editable_interface_init,
.interface_finalize = NULL,
.interface_data = NULL,
};
type = g_type_register_static (VIEWER_TYPE_FILE,
"ViewerFile",
&info, 0);
g_type_add_interface_static (type,
VIEWER_TYPE_EDITABLE,
&editable_info);
}
return type;
}
g_type_add_interface_static()
records in the type system that the given
ViewerFile
type implements also ViewerEditable
(viewer_editable_get_type()
returns the type of ViewerEditable
). The
GInterfaceInfo
structure holds information about the implementation of the interface:
struct _GInterfaceInfo
{
GInterfaceInitFunc interface_init;
GInterfaceFinalizeFunc interface_finalize;
gpointer interface_data;
};
Interface initialization
When an instantiatable classed type which implements an interface (either directly or by inheriting an implementation from a superclass) is created for the first time, its class structure is initialized following the process described in the section called “Instantiatable classed types: objects”. After that, the interface implementations associated with the type are initialized.
First a memory buffer is allocated to hold the interface structure. The
parent’s interface structure is then copied over to the new interface
structure (the parent interface is already initialized at that point). If
there is no parent interface, the interface structure is initialized with
zeros. The g_type
and the g_instance_type
fields are then initialized:
g_type
is set to the type of the most-derived interface and g_instance_type
is set to the type of the most derived type which implements this interface.
The interface’s base_init
function is called, and then the interface’s
default_init
is invoked. Finally if the type has registered an
implementation of the interface, the implementation’s interface_init
function is invoked. If there are multiple implementations of an interface
the base_init
and interface_init
functions will be invoked once for each
implementation initialized.
It is thus recommended to use a default_init
function to initialize an
interface. This function is called only once for the interface no matter how
many implementations there are. The default_init
function is declared by
G_DEFINE_INTERFACE
which can be used to define the interface:
G_DEFINE_INTERFACE (ViewerEditable, viewer_editable, G_TYPE_OBJECT)
static void
viewer_editable_default_init (ViewerEditableInterface *iface)
{
/* add properties and signals here, will only be called once */
}
Or you can do that yourself in a GType function for your interface:
GType
viewer_editable_get_type (void)
{
static gsize type_id = 0;
if (g_once_init_enter (&type_id)) {
const GTypeInfo info = {
sizeof (ViewerEditableInterface),
NULL, /* base_init */
NULL, /* base_finalize */
viewer_editable_default_init, /* class_init */
NULL, /* class_finalize */
NULL, /* class_data */
0, /* instance_size */
0, /* n_preallocs */
NULL /* instance_init */
};
GType type = g_type_register_static (G_TYPE_INTERFACE,
"ViewerEditable",
&info, 0);
g_once_init_leave (&type_id, type);
}
return type_id;
}
static void
viewer_editable_default_init (ViewerEditableInterface *iface)
{
/* add properties and signals here, will only called once */
}
In summary, interface initialization uses the following functions:
Invocation time | Function Invoked | Function’s parameters | Remark |
---|---|---|---|
First call to g_type_create_instance() for any type implementing interface |
interface’s base_init function |
On interface’s vtable | Rarely necessary to use this. Called once per instantiated classed type implementing the interface. |
First call to g_type_create_instance() for each type implementing interface |
interface’s default_init function |
On interface’s vtable | Register interface’s signals, properties, etc. here. Will be called once. |
First call to g_type_create_instance() for any type implementing interface |
implementation’s interface_init function |
On interface’s vtable | Initialize interface implementation. Called for each class that that implements the interface. Initialize the interface method pointers in the interface structure to the implementing class’s implementation. |
Interface Destruction
When the last instance of an instantiatable type which registered an interface implementation is destroyed, the interface’s implementations associated to the type are destroyed.
To destroy an interface implementation, GType first calls the
implementation’s interface_finalize
function and then the interface’s
most-derived base_finalize
function.
Again, it is important to understand, as in the section called “Interface
Initialization”, that both interface_finalize
and base_finalize
are
invoked exactly once for the destruction of each implementation of an
interface. Thus, if you were to use one of these functions, you would need
to use a static integer variable which would hold the number of instances of
implementations of an interface such that the interface’s class is destroyed
only once (when the integer variable reaches zero).
The above process can be summarized as follows:
Invocation time | Function Invoked | Function’s parameters |
---|---|---|
Last call to g_type_free_instance() for type implementing interface |
interface’s interface_finalize function |
On interface’s vtable |
interface’s base_finalize function |
On interface’s vtable |
The GObject base class
The previous chapter discussed the details of GLib’s Dynamic Type System.
The GObject library also contains an implementation for a base fundamental
type named GObject
.
GObject
is a fundamental classed instantiatable type. It implements:
- memory management with reference counting
- construction/Destruction of instances
- generic per-object properties with set/get function pairs
- easy use of signals
All the GNOME libraries which use the GLib type system (like GTK and
GStreamer) inherit from GObject
which is why it is important to understand
the details of how it works.
Object instantiation
The g_object_new()
family of functions can be used to instantiate any
GType which inherits from the GObject base type. All these functions make
sure the class and instance structures have been correctly initialized by
GLib’s type system and then invoke at one point or another the constructor
class method which is used to:
- allocate and clear memory through
g_type_create_instance()
- initialize the object’s instance with the construction properties.
GObject explicitly guarantees that all class and instance members (except the fields pointing to the parents) to be set to zero.
Once all construction operations have been completed and constructor properties set, the constructed class method is called.
Objects which inherit from GObject
are allowed to override this
constructed class method. The example below shows how ViewerFile
overrides
the parent’s construction process:
#define VIEWER_TYPE_FILE viewer_file_get_type ()
G_DECLARE_FINAL_TYPE (ViewerFile, viewer_file, VIEWER, FILE, GObject)
struct _ViewerFile
{
GObject parent_instance;
/* instance members */
char *filename;
guint zoom_level;
};
/* will create viewer_file_get_type and set viewer_file_parent_class */
G_DEFINE_TYPE (ViewerFile, viewer_file, G_TYPE_OBJECT)
static void
viewer_file_constructed (GObject *obj)
{
/* update the object state depending on constructor properties */
/* Always chain up to the parent constructed function to complete object
* initialisation. */
G_OBJECT_CLASS (viewer_file_parent_class)->constructed (obj);
}
static void
viewer_file_finalize (GObject *obj)
{
ViewerFile *self = VIEWER_FILE (obj);
g_free (self->filename);
/* Always chain up to the parent finalize function to complete object
* destruction. */
G_OBJECT_CLASS (viewer_file_parent_class)->finalize (obj);
}
static void
viewer_file_class_init (ViewerFileClass *klass)
{
GObjectClass *object_class = G_OBJECT_CLASS (klass);
object_class->constructed = viewer_file_constructed;
object_class->finalize = viewer_file_finalize;
}
static void
viewer_file_init (ViewerFile *self)
{
/* initialize the object */
}
If the user instantiates an object ViewerFile
with:
ViewerFile *file = g_object_new (VIEWER_TYPE_FILE, NULL);
If this is the first instantiation of such an object, the
viewer_file_class_init
function will be invoked after any
viewer_file_base_class_init
function. This will make sure the class
structure of this new object is correctly initialized. Here,
viewer_file_class_init
is expected to override the object’s class methods
and setup the class’ own methods. In the example above, the constructed
method is the only overridden method: it is set to
viewer_file_constructed
.
Once g_object_new()
has obtained a reference to an initialized class
structure, it invokes its constructor method to create an instance of the
new object, if the constructor has been overridden in
viewer_file_class_init
. Overridden constructors must chain up to their
parent’s constructor. In order to find the parent class and chain up to the
parent class constructor, we can use the viewer_file_parent_class
pointer
that has been set up for us by the G_DEFINE_TYPE
macro.
Finally, at one point or another, g_object_constructor
is invoked by the
last constructor in the chain. This function allocates the object’s instance
buffer through g_type_create_instance()
which means that the
instance_init
function is invoked at this point if one was registered.
After instance_init
returns, the object is fully initialized and should be
ready to have its methods called by the user. When
g_type_create_instance()
returns, g_object_constructor
sets the
construction properties (i.e. the properties which were given to
g_object_new()
) and returns to the user’s constructor.
The process described above might seem a bit complicated, but it can be
summarized easily by the table below which lists the functions invoked by
g_object_new()
and their order of invocation:
Invocation time | Function invoked | Function’s parameters | Remark |
---|---|---|---|
First call to g_object_new() for target type |
target type’s base_init function |
On the inheritance tree of classes from fundamental type to target type. base_init is invoked once for each class structure. |
Never used in practice. Unlikely you will need it. |
target type’s class_init function |
On target type’s class structure | Here, you should make sure to initialize or override class methods (that is, assign to each class’ method its function pointer) and create the signals and the properties associated to your object. | |
interface’s base_init function |
On interface’s vtable | ||
interface’s interface_init function |
On interface’s vtable | ||
Each call to g_object_new() for target type |
target type’s class constructor method: GObjectClass->constructor
|
On object’s instance | If you need to handle construct properties in a custom way, or implement a singleton class, override the constructor method and make sure to chain up to the object’s parent class before doing your own initialization. In doubt, do not override the constructor method. |
type’s instance_init function |
On the inheritance tree of classes from fundamental type to target type. The instance_init provided for each type is invoked once for each instance structure. |
Provide an instance_init function to initialize your object before its construction properties are set. This is the preferred way to initialize a GObject instance. This function is equivalent to C++ constructors. |
|
target type’s class constructed method: GObjectClass->constructed |
On object’s instance | If you need to perform object initialization steps after all construct properties have been set. This is the final step in the object initialization process, and is only called if the constructor method returned a new object instance (rather than, for example, an existing singleton). |
Readers should feel concerned about one little twist in the order in which
functions are invoked: while, technically, the class’ constructor method is
called before the GType’s instance_init
function (since
g_type_create_instance()
which calls instance_init
is called by
g_object_constructor
which is the top-level class constructor method and
to which users are expected to chain to), the user’s code which runs in a
user-provided constructor will always run after GType’s instance_init
function since the user-provided constructor must (you’ve been warned) chain
up before doing anything useful.
Object memory management
The memory-management API for GObjects is a bit complicated but the idea behind it is pretty simple: the goal is to provide a flexible model based on reference counting which can be integrated in applications which use or require different memory management models (such as garbage collection). The methods which are used to manipulate this reference count are described below.
Reference count
The functions g_object_ref()
and g_object_unref()
increase and decrease
the reference count, respectively. These functions are thread-safe.
g_clear_object()
is a convenience wrapper around g_object_unref()
which
also clears the pointer passed to it.
The reference count is initialized to one by g_object_new()
which means
that the caller is currently the sole owner of the newly-created reference.
(If the object is derived from GInitiallyUnowned
, this reference is
“floating”, and must be “sunk”, i.e. transformed into a real reference.)
When the reference count reaches zero, that is, when g_object_unref()
is
called by the last owner of a reference to the object, the dispose()
and
the finalize()
class methods are invoked.
Finally, after finalize()
is invoked, g_type_free_instance()
is called
to free the object instance. Depending on the memory allocation policy
decided when the type was registered (through one of the g_type_register_*
functions), the object’s instance memory will be freed or returned to the
object pool for this type. Once the object has been freed, if it was the
last instance of the type, the type’s class will be destroyed as described
in the section called “Instantiatable classed types: objects” and the
section called “Non-instantiatable classed types: interfaces”.
The table below summarizes the destruction process of a GObject
:
Invocation time | Function invoked | Function’s parameters | Remark |
---|---|---|---|
Last call to g_object_unref() for an instance of target type |
target type’s dispose class function | GObject instance | When dispose ends, the object should not hold any reference to any other member object. The object is also expected to be able to answer client method invocations (with possibly an error code but no memory violation) until finalize is executed. dispose can be executed more than once. dispose should chain up to its parent implementation just before returning to the caller. |
target type’s finalize class function | GObject instance | Finalize is expected to complete the destruction process initiated by dispose. It should complete the object’s destruction. finalize will be executed only once. finalize should chain up to its parent implementation just before returning to the caller. See the section on “Reference counts and cycles” for more information. | |
Last call to g_object_unref() for the last instance of target type |
interface’s interface_finalize function |
On interface’s vtable | Never used in practice. Unlikely you will need it. |
interface’s base_finalize function |
On interface’s vtable | Never used in practice. Unlikely you will need it. | |
target type’s class_finalize function |
On target type’s class structure | Never used in practice. Unlikely you will need it. | |
type’s base_finalize function |
On the inheritance tree of classes from fundamental type to target type. base_init is invoked once for each class structure. |
Never used in practice. Unlikely you will need it. |
Weak References
Weak references are used to monitor object finalization:
g_object_weak_ref()
adds a monitoring callback which does not hold a
reference to the object but which is invoked when the object runs its
dispose method. Weak references on the object are automatically dropped when
the instance is disposed, so there is no need to invoke g_object_weak_unref()
from the GWeakNotify
callback. Remember that the object instance is not
passed to the GWeakNotify
callback because the object has already been
disposed. Instead, the callback receives a pointer to where the object
previously was.
Weak references are also used to implement g_object_add_weak_pointer()
and
g_object_remove_weak_pointer()
. These functions add a weak reference to
the object they are applied to which makes sure to nullify the pointer given
by the user when object is finalized.
Similarly, GWeakRef
can be used to implement weak references if thread
safety is required.
Reference counts and cycles
GObject’s memory management model was designed to be easily integrated in
existing code using garbage collection. This is why the destruction process
is split in two phases: the first phase, executed in the dispose()
handler is
supposed to release all references to other member objects. The second
phase, executed by the finalize()
handler is supposed to complete the object’s
destruction process. Object methods should be able to run without program
error in-between the two phases.
This two-step destruction process is very useful to break reference counting
cycles. While the detection of the cycles is up to the external code, once
the cycles have been detected, the external code can invoke
g_object_run_dispose()
which will indeed break any existing cycles since
it will run the dispose handler associated to the object and thus release
all references to other objects.
This explains one of the rules about the dispose()
handler stated earlier:
the dispose()
handler can be invoked multiple times. Let’s say we have a
reference count cycle: object A references B which itself references object
A. Let’s say we have detected the cycle and we want to destroy the two
objects. One way to do this would be to invoke g_object_run_dispose()
on one
of the objects.
If object A releases all its references to all objects, this means it releases its reference to object B. If object B was not owned by anyone else, this is its last reference count which means this last unref runs B’s dispose handler which, in turn, releases B’s reference on object A. If this is A’s last reference count, this last unref runs A’s dispose handler which is running for the second time before A’s finalize handler is invoked!
The above example, which might seem a bit contrived, can really happen if GObjects are being handled by language bindings—hence the rules for object destruction should be closely followed.
Object properties
One of GObject’s nice features is its generic get/set mechanism for object
properties. When an object is instantiated, the object’s class_init
handler should be used to register the object’s properties with
g_object_class_install_properties()
.
The best way to understand how object properties work is by looking at a real example of how it is used:
// Implementation
typedef enum
{
PROP_FILENAME = 1,
PROP_ZOOM_LEVEL,
N_PROPERTIES
} ViewerFileProperty;
static GParamSpec *obj_properties[N_PROPERTIES] = { NULL, };
static void
viewer_file_set_property (GObject *object,
guint property_id,
const GValue *value,
GParamSpec *pspec)
{
ViewerFile *self = VIEWER_FILE (object);
switch ((ViewerFileProperty) property_id)
{
case PROP_FILENAME:
g_free (self->filename);
self->filename = g_value_dup_string (value);
g_print ("filename: %s\n", self->filename);
break;
case PROP_ZOOM_LEVEL:
self->zoom_level = g_value_get_uint (value);
g_print ("zoom level: %u\n", self->zoom_level);
break;
default:
/* We don't have any other property... */
G_OBJECT_WARN_INVALID_PROPERTY_ID (object, property_id, pspec);
break;
}
}
static void
viewer_file_get_property (GObject *object,
guint property_id,
GValue *value,
GParamSpec *pspec)
{
ViewerFile *self = VIEWER_FILE (object);
switch ((ViewerFileProperty) property_id)
{
case PROP_FILENAME:
g_value_set_string (value, self->filename);
break;
case PROP_ZOOM_LEVEL:
g_value_set_uint (value, self->zoom_level);
break;
default:
/* We don't have any other property... */
G_OBJECT_WARN_INVALID_PROPERTY_ID (object, property_id, pspec);
break;
}
}
static void
viewer_file_class_init (ViewerFileClass *klass)
{
GObjectClass *object_class = G_OBJECT_CLASS (klass);
object_class->set_property = viewer_file_set_property;
object_class->get_property = viewer_file_get_property;
obj_properties[PROP_FILENAME] =
g_param_spec_string ("filename",
"Filename",
"Name of the file to load and display from.",
NULL /* default value */,
G_PARAM_CONSTRUCT_ONLY | G_PARAM_READWRITE);
obj_properties[PROP_ZOOM_LEVEL] =
g_param_spec_uint ("zoom-level",
"Zoom level",
"Zoom level to view the file at.",
0 /* minimum value */,
10 /* maximum value */,
2 /* default value */,
G_PARAM_READWRITE);
g_object_class_install_properties (object_class,
N_PROPERTIES,
obj_properties);
}
// Use
ViewerFile *file;
GValue val = G_VALUE_INIT;
file = g_object_new (VIEWER_TYPE_FILE, NULL);
g_value_init (&val, G_TYPE_UINT);
g_value_set_char (&val, 11);
g_object_set_property (G_OBJECT (file), "zoom-level", &val);
g_value_unset (&val);
The client code above looks simple but a lot of things happen under the hood:
g_object_set_property()
first ensures a property with this name was
registered in file’s class_init
handler. If so it walks the class
hierarchy, from bottom-most most-derived type, to top-most fundamental type
to find the class which registered that property. It then tries to convert
the user-provided GValue into a GValue whose type is that of the associated property.
If the user provides a signed char GValue, as is shown here, and if the
object’s property was registered as an unsigned int
, g_value_transform()
will try to transform the input signed char into an unsigned int. Of course,
the success of the transformation depends on the availability of the
required transform function. In practice, there will almost always be a
transformation which matches and conversion will be carried out if needed.
After transformation, the GValue is validated by g_param_value_validate()
which makes sure the user’s data stored in the GValue matches the
characteristics specified by the property’s GParamSpec. Here, the GParamSpec
we provided in class_init
has a validation function which makes sure that
the GValue contains a value which respects the minimum and maximum bounds of
the GParamSpec. In the example above, the client’s GValue does not respect
these constraints (it is set to 11, while the maximum is 10). As such, the
g_object_set_property()
function will return with an error.
If the user’s GValue had been set to a valid value,
g_object_set_property()
would have proceeded with calling the object’s
set_property
class method. Here, since our implementation of ViewerFile
did override this method, execution would jump to viewer_file_set_property
after having retrieved from the GParamSpec the param_id
which had been
stored by g_object_class_install_property()
.
Once the property has been set by the object’s set_property
class method,
execution returns to g_object_set_property()
which makes sure that the
“notify” signal is emitted on the object’s instance with the changed
property as parameter unless notifications were frozen by
g_object_freeze_notify()
.
g_object_thaw_notify()
can be used to re-enable notification of property
modifications through the “notify” signal. It is important to remember that
even if properties are changed while property change notification is frozen,
the “notify” signal will be emitted once for each of these changed
properties as soon as the property change notification is thawed: no
property change is lost for the “notify” signal, although multiple
notifications for a single property are compressed. Signals can only be
delayed by the notification freezing mechanism.
It sounds like a tedious task to set up GValues every time when one wants to
modify a property. In practice one will rarely do this. The functions
g_object_set_property()
and g_object_get_property()
are meant to be used
by language bindings. For application there is an easier way and that is
described next.
Accessing multiple properties at once
It is interesting to note that the g_object_set()
and
g_object_set_valist()
(variadic version) functions can be used to set
multiple properties at once. The client code shown above can then be
re-written as:
ViewerFile *file;
file = /* */;
g_object_set (G_OBJECT (file),
"zoom-level", 6,
"filename", "~/some-file.txt",
NULL);
This saves us from managing the GValues that we were needing to handle when
using g_object_set_property()
. The code above will trigger one notify
signal emission for each property modified.
Equivalent _get
versions are also available: g_object_get()
and
g_object_get_valist()
(variadic version) can be used to get numerous
properties at once.
These high level functions have one drawback — they don’t provide a return value. One should pay attention to the argument types and ranges when using them. A known source of errors is to pass a different type from what the property expects; for instance, passing an integer when the property expects a floating point value and thus shifting all subsequent parameters by some number of bytes. Also forgetting the terminating NULL will lead to undefined behaviour.
This explains how g_object_new()
, g_object_newv()
and
g_object_new_valist()
work: they parse the user-provided variable number
of parameters and invoke g_object_set()
on the parameters only after the
object has been successfully constructed. The “notify” signal will be
emitted for each property set.
The GObject messaging system
Closures
Closures are central to the concept of asynchronous signal delivery which is widely used throughout GTK and GNOME applications. A closure is an abstraction, a generic representation of a callback. It is a small structure which contains three objects:
- a function pointer (the callback itself) whose prototype looks like:
return_type function_callback (... , gpointer user_data);
- the
user_data
pointer which is passed to the callback upon invocation of the closure - a function pointer which represents the destructor of the closure: whenever the closure’s refcount reaches zero, this function will be called before the closure structure is freed
The GClosure
structure represents the common functionality of all closure
implementations: there exists a different closure implementation for each
separate runtime which wants to use the GObject type system. The GObject
library provides a simple GCClosure type which is a specific implementation
of closures to be used with C/C++ callbacks.
A GClosure
provides simple services:
- invocation (
g_closure_invoke()
): this is what closures were created for; they hide the details of callback invocation from the callback invoker. - notification: the closure notifies listeners of certain events such as
closure invocation, closure invalidation and closure finalization.
Listeners can be registered with
g_closure_add_finalize_notifier()
(finalization notification),g_closure_add_invalidate_notifier()
(invalidation notification) andg_closure_add_marshal_guards()
(invocation notification). There exist symmetric deregistration functions for finalization and invalidation events (g_closure_remove_finalize_notifier()
andg_closure_remove_invalidate_notifier()
) but not for the invocation process
C Closures
If you are using C or C++ to connect a callback to a given event, you will
either use simple GCClosures
which have a pretty minimal API or the even
simpler g_signal_connect()
functions (which will be presented a bit later).
g_cclosure_new()
will create a new closure which can invoke the
user-provided callback_func
with the user-provided user_data
as its last
parameter. When the closure is finalized (second stage of the destruction
process), it will invoke the destroy_data
function if the user has
supplied one.
g_cclosure_new_swap()
will create a new closure which can invoke the
user-provided callback_func
with the user-provided user_data
as its
first parameter (instead of being the last parameter as with
g_cclosure_new()
). When the closure is finalized (second stage of the
destruction process), it will invoke the destroy_data
function if the user
has supplied one.
Non-C closures (for the fearless)
As was explained above, closures hide the details of callback invocation. In C, callback invocation is just like function invocation: it is a matter of creating the correct stack frame for the called function and executing a call assembly instruction.
C closure marshallers transform the array of GValues which represent the parameters to the target function into a C-style function parameter list, invoke the user-supplied C function with this new parameter list, get the return value of the function, transform it into a GValue and return this GValue to the marshaller caller.
A generic C closure marshaller is available as
g_cclosure_marshal_generic()
which implements marshalling for all function
types using libffi. Custom marshallers for different types are not needed
apart from performance critical code where the libffi-based marshaller may
be too slow.
An example of a custom marshaller is given below, illustrating how GValues
can be converted to a C function call. The marshaller is for a C function
which takes an integer as its first parameter and returns void
.
g_cclosure_marshal_VOID__INT (GClosure *closure,
GValue *return_value,
guint n_param_values,
const GValue *param_values,
gpointer invocation_hint,
gpointer marshal_data)
{
typedef void (*GMarshalFunc_VOID__INT) (gpointer data1,
gint arg_1,
gpointer data2);
register GMarshalFunc_VOID__INT callback;
register GCClosure *cc = (GCClosure*) closure;
register gpointer data1, data2;
g_return_if_fail (n_param_values == 2);
data1 = g_value_peek_pointer (param_values + 0);
data2 = closure->data;
callback = (GMarshalFunc_VOID__INT) (marshal_data ? marshal_data : cc->callback);
callback (data1,
g_marshal_value_peek_int (param_values + 1),
data2);
}
There exist other kinds of marshallers, for example there is a generic Python marshaller which is used by all Python closures (a Python closure is used to invoke a callback written in Python). This Python marshaller transforms the input GValue list representing the function parameters into a Python tuple which is the equivalent structure in Python.
Signals
GObject’s signals have nothing to do with standard UNIX signals: they connect arbitrary application-specific events with any number of listeners. For example, in GTK, every user event (keystroke or mouse move) is received from the windowing system and generates a GTK event in the form of a signal emission on the widget object instance.
Each signal is registered in the type system together with the type on which it can be emitted: users of the type are said to connect to the signal on a given type instance when they register a closure to be invoked upon the signal emission. Users can also emit the signal by themselves or stop the emission of the signal from within one of the closures connected to the signal.
When a signal is emitted on a given type instance, all the closures connected to this signal on this type instance will be invoked. All the closures connected to such a signal represent callbacks whose signature looks like:
return_type
function_callback (gpointer instance,
...,
gpointer user_data);
Signal registration
To register a new signal on an existing type, we can use any of
g_signal_newv()
, g_signal_new_valist()
or g_signal_new()
functions:
guint
g_signal_newv (const gchar *signal_name,
GType itype,
GSignalFlags signal_flags,
GClosure *class_closure,
GSignalAccumulator accumulator,
gpointer accu_data,
GSignalCMarshaller c_marshaller,
GType return_type,
guint n_params,
GType *param_types);
The number of parameters to these functions is a bit intimidating but they are relatively simple:
signal_name
: is a string which can be used to uniquely identify a given signalitype
: is the instance type on which this signal can be emittedsignal_flags
: partly defines the order in which closures which were connected to the signal are invokedclass_closure
: this is the default closure for the signal: if it is notNULL
upon the signal emission, it will be invoked upon this emission of the signal. The moment where this closure is invoked compared to other closures connected to that signal depends partly on thesignal_flags
accumulator
: this is a function pointer which is invoked after each closure has been invoked. If it returnsFALSE
, signal emission is stopped. If it returnsTRUE
, signal emission proceeds normally. It is also used to compute the return value of the signal based on the return value of all the invoked closures. For example, an accumulator could ignoreNULL
returns from closures; or it could build a list of the values returned by the closuresaccu_data
: this pointer will be passed down to each invocation of the accumulator during emissionc_marshaller
: this is the default C marshaller for any closure which is connected to this signalreturn_type
: this is the type of the return value of the signaln_params
: this is the number of parameters this signal takesparam_types
: this is an array of GTypes which indicate the type of each parameter of the signal. The length of this array is indicated byn_params
.
As you can see from the above definition, a signal is basically a description of the closures which can be connected to this signal and a description of the order in which the closures connected to this signal will be invoked.
Signal connection
If you want to connect to a signal with a closure, you have three possibilities:
- you can register a class closure at signal registration: this is a system-wide operation. i.e.: the class closure will be invoked during each emission of a given signal on any of the instances of the type which supports that signal
- you can use
g_signal_override_class_closure()
which overrides the class closure of a given type. It is possible to call this function only on a derived type of the type on which the signal was registered. This function is of use only to language bindings - you can register a closure with the
g_signal_connect()
family of functions. This is an instance-specific operation: the closure will be invoked only during emission of a given signal on a given instance
It is also possible to connect a different kind of callback on a given
signal: emission hooks are invoked whenever a given signal is emitted
whatever the instance on which it is emitted. Emission hooks are connected
with g_signal_add_emission_hook()
and removed with
g_signal_remove_emission_hook()
.
Signal emission
Signal emission is done through the use of the g_signal_emit()
family of functions.
void
g_signal_emitv (const GValue instance_and_params[],
guint signal_id,
GQuark detail,
GValue *return_value);
- the
instance_and_params
array of GValues contains the list of input parameters to the signal. The first element of the array is the instance pointer on which to invoke the signal. The following elements of the array contain the list of parameters to the signal signal_id
identifies the signal to invokedetail
identifies the specific detail of the signal to invoke. A detail is a kind of magic token/argument which is passed around during signal emission and which is used by closures connected to the signal to filter out unwanted signal emissions. In most cases, you can safely set this value to zero. See the section called “The detail argument” for more information about this parameterreturn_value
holds the return value of the last closure invoked during emission if no accumulator was specified. If an accumulator was specified during signal creation, this accumulator is used to calculate the return value as a function of the return values of all the closures invoked during emission. If no closure is invoked during emission, thereturn_value
is nonetheless initialized to zero/NULL
Signal emission can be decomposed in 6 steps:
RUN_FIRST
: if theG_SIGNAL_RUN_FIRST
flag was used during signal registration and if there exists a class closure for this signal, the class closure is invoked.EMISSION_HOOK
: if any emission hook was added to the signal, they are invoked from first to last added. Accumulate return values.HANDLER_RUN_FIRST
: if any closure were connected with theg_signal_connect()
family of functions, and if they are not blocked (with theg_signal_handler_block()
family of functions) they are run here, from first to last connected.RUN_LAST
: if theG_SIGNAL_RUN_LAST
flag was set during registration and if a class closure was set, it is invoked here.HANDLER_RUN_LAST
: if any closure were connected with theg_signal_connect_after()
family of functions, if they were not invoked duringHANDLER_RUN_FIRST
and if they are not blocked, they are run here, from first to last connected.RUN_CLEANUP
: if theG_SIGNAL_RUN_CLEANUP
flag was set during registration and if a class closure was set, it is invoked here. Signal emission is completed here.
If, at any point during the emission (except in the RUN_CLEANUP
or
EMISSION_HOOK
states), one of the closures stops the signal emission with
g_signal_stop_emission()
, the emission jumps to the RUN_CLEANUP
state.
If, at any point during emission, one of the closures or emission hook emits
the same signal on the same instance, emission is restarted from the
RUN_FIRST
state.
The accumulator function is invoked in all states, after invocation of each
closure (except in RUN_EMISSION_HOOK
and RUN_CLEANUP
). It accumulates
the closure return value into the signal return value and returns TRUE
or
FALSE
. If, at any point, it does not return TRUE
, emission jumps to
RUN_CLEANUP
state.
If no accumulator function was provided, the value returned by the last
handler run will be returned by g_signal_emit()
.
The detail argument
All the functions related to signal emission or signal connection have a parameter named the detail. Sometimes, this parameter is hidden by the API but it is always there, in one form or another.
Of the three main connection functions, only one has an explicit detail
parameter as a GQuark: g_signal_connect_closure_by_id()
.
The two other functions, g_signal_connect_closure()
and
g_signal_connect_data()
hide the detail parameter in the signal name
identification. Their detailed_signal
parameter is a string which
identifies the name of the signal to connect to. The format of this string
should match signal_name::detail_name
. For example, connecting to the
signal named notify::cursor_position
will actually connect to the signal
named notify
with the cursor_position
detail. Internally, the detail
string is transformed to a GQuark if it is present.
Of the four main signal emission functions, one hides it in its signal name
parameter: g_signal_emit_by_name()
. The other three have an explicit
detail parameter as a GQuark again: g_signal_emit()
, g_signal_emitv()
and g_signal_emit_valist()
.
If a detail is provided by the user to the emission function, it is used during emission to match against the closures which also provide a detail. If a closure’s detail does not match the detail provided by the user, it will not be invoked (even though it is connected to a signal which is being emitted).
This completely optional filtering mechanism is mainly used as an optimization for signals which are often emitted for many different reasons: the clients can filter out which events they are interested in before the closure’s marshalling code runs. For example, this is used extensively by the notify signal of GObject: whenever a property is modified on a GObject, instead of just emitting the notify signal, GObject associates as a detail to this signal emission the name of the property modified. This allows clients who wish to be notified of changes to only one property to filter most events before receiving them.
As a simple rule, users can and should set the detail parameter to zero: this will disable completely this optional filtering for that signal.