inria inria inria
dossier thématique

Research

Standardization

Without realizing it, many of the technologies that we use on a daily basis are in fact based on standards that are subject to a broad consensus generally on a global level : we cannot imagine exchanging documents via the Internet without having recourse to common conventions, such as Internet (IP) routing protocols or web presentation formats (HTML or XML). How can we share these common resources via telecommunications networks or IT grids ? This special feature underline INRIA research teams who have contributed to these standards with several industrial and academic partners within international organization for standardization and normalization.

Standardization: maximising the impact of research results

stand

We cannot imagine exchanging documents via the Internet without having recourse to common conventions, such as Internet (IP) routing protocols or Web presentation formats (HTML or XML). How can we share these common resources via telecommunications networks or IT grids?
In the IT world, rapid changes in technologies and the desire on the part of companies to provide products integrating new solutions in a timely fashion have led, over the past two decades, to the creation of specialised consortia, often unincorporated, uniting players from public and private research. These committees issue recommendations, that is, a set of specifications describing the implementation principle for the technique with a view to interoperability between the products manufactured by the players. Each company is free to follow these recommendations or not, but generally they are adopted and thus become a standard. The term standardization therefore implies the adoption, by most of the participants, of specifications drawn up within a collective working framework. All the same time, the players are aware of the advantage of being involved in the activities of standardization organisations (ISO, ETSI, etc.) that issue standards that are more regulatory in nature.
At INRIA (National Institute for Research in Computer Science and Automatic Control), for a number of reasons, this standardization culture has quickly entered into the everyday practices of a number of research teams. For these researchers, it is sometimes the best way to distribute and sustain their solutions. For some time now, the Institute itself has travelled this path by hosting the W3C (World wide web consortium), the consortium responsible for the development and promotion of Web standards, from as early as its creation in 1994 up to 2002. In this way, some 20 INRIA members participate in the W3C technical team. Moreover, an INRIA researcher, Jean-François Abramatic, presided over the W3C for 4 years, up to 2001.

An approach unequivocally supported by the Institute
INRIA solidly supports the standardization approach, says Gérard Giraudon, Director of Development and industrial relations: « In certain sectors, standardization is the only way to widely disseminate the work of researchers. For example, in the world of networks, irrespective of the merits of a technical solution, a solution will never be adopted by economic players if it is not recognised as a standard. Standardization is therefore indispensable to ensure the future existence of these technologies. It also offers worldwide scientific recognition of the work of researchers and in doing so, raises the profile of the Institute. This comes hand in hand with economic benefits: companies dedicated to the growth of technological developments and reinforced cooperation with the industrial companies concerned ».
Assisting researchers who wish to promote their solutions by providing financial and human resources is in particular indispensable in the telecommunications and middleware domains where the scientific, economic and social stakes are enormous.

A number of spin-off benefits
It is clear that the determination on the part of the institute and scientists conducting INRIA research projects, manifested by their efforts on these standardization committees, has contributed to international recognition of French research in STIC. These organisations provide moreover both a vehicle where researchers can showcase their solutions and also an area for testing, which often enables participants to anticipate technical and scientific developments. For participants, in addition to being an expert in a field, it also means having the opportunity to discover new fields of research, which in turn provides a way of monitoring technological developments.
At times, spin-off benefits can be as far-reaching as the creation of innovative companies, such as UDcast, which was founded as a result of work conducted by a research project to standardise the integration of satellite links in the Internet infrastructure (UDLR protocol, RFC 3077 at IETF) and Luceor, which is based on the OLSR protocol (RFC at IETF). In other cases, industrial partnerships and joint projects are initiated or reinforced such as within the framework of the Carroll research programme uniting INRIA, Thalès and the CEA in the software engineering and middleware technological fields.

Afnor - Standarmedia: an inventory of the standardization committees in the TIC field: http://www.standarmedia.com/

Internet: protocols on the front line

The protocols which are the keystone of the Internet must be adapted to face the explosive growth in numbers of connected machines. In addition, the components that integrate these protocols must be reliable, capable of high performance and compatible with one another.

We all know the success of the Internet that started at the beginning of the 1990s. The deployment of the Internet allowed access to numerous new online services and applications. The price to pay for such enthusiasm was that the limits of the worldwide network were soon brought to light. The Internet protocols, a set of rules, conventions and mechanisms that make it possible for the network to work correctly, are on the front line. These protocols require in particular that each machine be allocated an address—its IP address—an indispensable open sesame prior to all network communication. The IP address is a number of fixed size, and the number of connected machines is now exceeding the maximum allocation capacity (slightly more than 4 billion distinct addresses).

At the present time, the network primarily uses the IPv4 protocol (Internet Protocol version 4) that was standardized by the IETF (Internet Engineering Task Force), the Internet protocol standardization body. However, just like phone numbers had to go from 8 to 10 digits in order to increase the connection capabilities of the French telephone network, a new addressing mechanism allowing for a larger number of Internet addresses is currently being deployed. This mechanism called IPv6 should soon become the standard used by the new Internet. The first IETF specifications were published in December 1995.

Very early on, the major French players regrouped within the G6 association to foster the development and deployment of the new version. The ARMOR team (Rennes) was very active in it from the beginning. The team gathers together researchers from a department of ENST Bretagne (National Superior School of Telecommunications of Brittany) specializing in network protocols and researchers from INRIA Rennes well-known for their competence in network modeling, test and evaluation.

Preparing the Internet of the future
The team proposed a solution to the addressing problem at the IETF. The solution involves a mechanism that can be used temporarily to solve the lack of IPv4 Internet addresses. Moreover, much of the team's work concerns the transition from IPv4 to IPv6, via the temporary allocation of IPv4 addresses only when an application requires them. This makes it possible to use the new version of the protocol whenever possible and the “old” applications can nonetheless benefit from the advantages of IPv6 (such as mobility, self-configuring). ARMOR research scientists proposed solution for the protocol itself and its impact on applications, such as the DSTM technology (Dual Stack Transition Mechanism) which was submitted at the IETF in 1999 to let IPv4 and IPv6 coexist on the same machine. DSTM is in competition with other solutions. Three implementations have already been effected, at ENST Bretagne, by the ETRI (Electronics and Telecommunications Research Institute) in Korea and by Hewlett Packard.
In addition, problems that were not addressed by IPv4, such as router configuration (routers are the communication nodes of the network), are likely to take on increasing importance with IPv6, due to the explosion in numbers of technologies involved, machines connected to the network, and growing complexity thereof, since the network topology changes with user mobility. Concretely, if it is now possible to automatically configure machines, this is not the case for routers, which require the intervention of a network administrator. The latter has to configure the various links and routers of the network, and renew the configuration every time the network topology changes. This problem occurs in particular on the local scale, when a small company needs to deploy an internal network, or in the home where various wireless technologies have the wind in their sails.

In 2002, researchers of projects ARMOR and ARES (Lyon) proposed a protocol?l called NAP (No Administration Protocol) at the IETF, for the self-configuration of IPv6 routers and network. It was the first proposal on the subject. It did not however attract enough attention at that time to warrant the creation of a dedicated work group. The researchers nonetheless continued their work in the framework of research projects with France Telecom and Alcatel. There is little doubt that the topic will soon be hot again at the IETF.

Testing the interoperability of new components
In parallel, in order to ensure a reliable deployment of the network, it is crucial to check that the new IPv6 routers conform to the specifications already defined by the IETF and are compatible with one another, even coming from different manufacturers (Cisco, Hewlett Packard, Samsung,...). To achieve this goal, each new product must be tested for compatibility with the specifications and interoperability.

This is one of the activities of ARMOR research scientists who have been developing protocol testing methods for four years in collaboration with the ETSI (European Telecommunications Standards Institute), the European standardization body in matters of telecommunications. Comparable testing processes developed by American and Japanese departments have been implemented in the United States and in Asia. In order to ensure an international validation process, called the IPv6 ready logo program, the different players regrouped last year at the instigation of the IPv6 Forum, the international organization promoting IPv6. The program is based on three test sessions a year, successively in Japan, the United States and Europe, during which all the companies involved in IPv6 deployment have their components tested. ARMOR was instrumental in launching this worldwide certification program and is its European representative. Eight researchers and engineers are working on it today around César Viho. About a hundred IPv6 components coming mostly from American (Cisco, Microsoft,...) and Asian companies (Samsung...) have obtained this certification. In Europe, the first certified companies are 6wind (FR) and Ericsson (SE).

Improving Internet security and mobility

Several INRIA teams are developing solutions adapted to new Internet uses while ensuring exchange security.

The Internet has been constantly changing since the beginning of the 1990s and its incredible worldwide development. It must continuously face new challenges due to evolutions in the way it is implemented. One of the most worrying of these challenges today is communication security: it relatively easy to spoof the identity of a user and retrieve information in his or her name. Much research is devoted to plugging this hole.

Another concern has assumed growing importance in recent years: the adaptation of the network to ceaselessly growing user mobility. Like cell phones, computers also are becoming more and more mobile. The challenge is to maintain their Internet connection during any kind of journey, either inside the usual connecting network for the device (also called its mother network) or over very large distances, from one country to the next for example. With third generation cell phones, billions of mobile devices will be connected to the network. Mobility also poses specific security problems.
These different aspects are the object of standardization proposals at the IETF (Internet Engineering Task Force), the Internet protocol standardization body, especially in the context of the new IPv6 Internet protocol. Like its predecessor IPv4, this protocol automatically allocates addresses to each machine (the IP address is a long list of numbers), an indispensable open sesame prior to all network communication. In order for users not to have to memorize the IP addresses of the machines, a name is given to each of them. The correspondence between machine addresses and names is stored in a large database called DNS (Domain Name System).

Improving transmission security
However, access to this base is not secure, and for the time being, the identity of each machine is not authenticated. An identity can thus easily be usurped during a connection request and data can be hacked. Researchers from project ARMOR in Rennes have been participating in an IETF work group called DNSext on the topic since 2002. They are proposing several methods to make IP address requests secure.

The IETF is also thinking about a scheme to make Internet communications secure. Researchers from project PLANETE in Grenoble are defending a solution of cryptographic IPv6 addresses called CGA (Cryptographically Generated Addresses), a secure identifier, together with Sun Microsystems researchers. The solution makes it possible for a machine to prove that it is using an address that was allocated to it. There are many applications. The address spoofing problem can thus be solved, as well as the IPv6 mobile connection highjacking problem. The protocol used by the machine to configure its IPv6 address can also be made secure. Such cryptographic addresses are in the process of being standardized by the IET.

The ensuing security mechanism is called HIP (Host Identity Protocol). It introduces a new naming space to securely identify the extremities, i.e. the terminals, in a communication. Concretely, each extremity or machine is allocated a secure identifier obtained from its public key that will then be used by applications in order to identify the extremities during a communication. In this way, such upper layers become independent from the IP (v4 or v6) addresses, and thus from the localization, and use secure identifiers. All this is made possible by the HIP protocol that performs the conversion between identifiers and IP addresses. Research scientists from project RESO (INRIA Rhône Alpes) are participating in this work in the framework of the HIP work group, in collaboration with Sun Microsystems. Project RESO is studying data transmission solutions adapted to computing grids in which hundreds or thousands of computers are pooled together over the network to supply large computing capacities. Security problems are at the forefront of the team's concerns for obvious reasons of confidentiality and protection of interconnected resources. Several of the team's proposals have been accepted and are on the way to being standardized: one of them consists in creating an extension of the DNS database for the HIP protocol, another one in bypassing the DNS. Two implementations have already been completed, one at INRIA and the other one at HIIT, a Finnish research department.

HIPE
HIP separates end-point identifier and location

Taking mobility into account
Concerning mobility, the solution currently favored by the IETF is a protocol called Mobile IP. With Mobile IP, mobile devices have a permanent IP address known to all, and a temporary address in connection with its displacements. All outside communications arrive at the permanent address and are then forwarded to the temporary address. All these exchanges increase the risk of attack through hacking of the signalization messages. Since the beginning of 2000, research scientists of project ARMOR, with contributions from PLANETE, have been supporting a solution to strengthen the security of communications between a mobile device and its mother network at the IETF. This solution makes it possible for a device using the Mobile IPv6 protocol to move without unveiling its permanent IPv6 address. The ARMOR proposition was accepted and standardized since June 2004 under RFC 3776 (Request For Comments). Research and implementation work are continuing.

Another problem linked to Mobile IP protocols is that they process micro and macro-mobility in the same way. A machine must communicate its new temporary address every time it moves, irrespective of how far it moved, even though the majority of displacements are local. Obviously, the resulting quantity of messages generated is likely to crash the network. Researchers from project PLANETE are proposing to adopt a hierarchical approach to the problem. The idea is to maintain the principle of communication with the mother network and the Mobile IP protocol for large displacements, but to manage local mobility without systematically sending the information back to the mother network. An IETF work group was created on the topic in 2000. Among the various solutions proposed, INRIA's was accepted and is now defended by Ericsson. This solution is called HMIPv6 (Hierarchical Mobile IPv6). It uses an internal protocol for local movements that hides them from other users. In august 2005, the solution achieve the experimental RFC status 4140. nother characteristic of the HMIPv6 protocol is that it makes it possible to hide the geographical position of Internet mobile devices. As a matter of fact, it only reveals a global address that supplies very little information on the geographical location of the device. This is a sometimes useful feature.

Casebook standardization

This is the story of researchers who solved the problem of integrating satellites into the Internet infrastructure, created a startup and are now developing new applications of their standard.

In 1996, Eutelsat, a worldwide operator in satellite infrastructure, approached INRIA in order to study the possibility of integrating satellites into the Internet and thus transmit broadband data. Their idea was simple: geostationary satellites put into orbit in view of the deployment of digital TV (using the DVB standard, Digital Video Broadcast) have a huge unused transmission capacity. Why not try and use them just like the fiber optics infrastructure to transmit broadband Internet data (at that time the most powerful modems connected to the telephone network were liberally transmitting 56 kilobits per second)? INRIA research scientists rapidly identified the technological deadlocks—the routing protocols used to transmit the information and data. These protocols work through an exchange of data between routers—the communication nodes—that let them know about the network topology and find transmission paths. This functioning mode assumes a two-way communication: point B is assumed to be reachable by router A if router A is receiving information from B.

Now in the case of a satellite, due to economic reasons, a home user may only receive data: reception antennas are cheap as opposed to transmitter antennas which are very costly. The idea was thus to use satellites to receive broadband data and keep the telephone network and modems to send data, without modifying the functioning of the network. In other words, the point was to integrate satellites using one-way communications into a network, the routing protocols of which assume two-way links.

A routing protocol dedicated to satellites
Eutelsat was convinced that satellites were an interesting solution for broadband data transmission. The company thus funded a study carried out by the team of project RODEO of Sophia Antipolis. A transmitter antenna was purchased and Eutelsat set aside hours of satellite transmission for INRIA during a few years to perform experiments. Within six months, the researchers had devised a first, theoretical solution. The trick was to make believe that the satellite link was two-way.

In December 1996, INRIA created a dedicated work group at the IETF (Internet Engineering Task Force), the Internet protocol standardization body. The group was called UDLR (Unidirectional Link Routing) and co-chaired by Walid Dabbous, Head of project RODEO (renamed PLANETE since 2001) and Yongguang Zhang (Hughes Research Labs). “We needed one year of theoretical discussions before we could even think about the first developments,” remembers Walid Dabbous, “Then our team worked for two years on software, computer boards, experiments and development.” In addition to RODEO members, Japanese researchers from project WIDE (a group of Japanese scientists and industry representatives) participated in the design of the standard.

The protocol, also called UDLR, and the deployment of services were tested on satellite links independently by INRIA and a Japanese team. In April 2001, the researchers' efforts were crowned with success. After four years of validation by the international community, their protocol was recognized by the IETF under RFC 3077 (Request For Comments), an indispensable open sesame prior to any standardization. The protocol is now validated as a “proposed standard.”

UDLR
UDLR protocol dedicated to unidirectional routing

A startup to boot
In June 2000, bolstered by their success, four INRIA researchers-engineers (Emmanuel Duros, Luc Ottavj, Patrick Cipière and Antoine Clerget) together with Didier Tymen launched themselves in industry in order to market and integrate the UDLR protocol into ready for use software. They founded the UDcast company at Sophia Antipolis.
In the meantime, the advent of broadband, especially that of ADSL over phone lines, had considerably diminished the interest of satellite Internet integration. In effect, the ADSL technology is cheaper and offers better performance with shorter transmission delays, which allows for better interactivity. Nonetheless, satellites remain an interesting solution to establishing broadband links in rural areas, or in countries with problematic connectivity even in urban areas. They also make it possible to secure terrestrial corporate networks against natural or criminal risks. Emergency satellite links offer the guarantee of a separate, direct and immune path in case of failure of terrestrial equipment.

Satellites and interactive digital TV
In addition, the UDLR protocol as it was designed is not specifically dedicated to satellite links, but above all to unidirectional routing. Today, it finds an increasing number of applications in entirely different contexts. This is the case for example in terrestrial digital audio streaming or in digital television for portable devices, in particular to make such broadcasted services fully interactive. The UDLR protocol also paradoxically regains interest in terrestrial networks of TV on demand contents distribution over fiber optics, which are characterized by heavy downlink throughput towards the user and light uplink throughput. The protocol makes it possible to practically divide the number of fibers by two and reduce the cost of coupling devices (the interfaces used to send data into the fiber optics).

The excellent health of UDcast is proof of the multiple applications of UDLR. The company now has a staff of 22 with an office in Paris and one in Washington D.C. The company focused on solving performance and security problems concerning the distribution of IP protocols and offers Internet access and application providers products, hardware and software. It has established a strong partnership with Nokia to distribute data or TV on mobile phones using the frequencies of digital terrestrial TV.

Sending data reliably and massively

Two INRIA teams are contributing to group distribution, that is to say sending data to a large number of addressees at the same time.

In certain cases, it is necessary to send the same set of data to millions of addressees over the Internet. It is possible to do this very economically with a single operation on the part of the sender by using what is called multicast distribution. This process is of interest to software publishers to inform their numerous clients that an update is available, or to inform subscribers that such and such new service can now be accessed. Similarly, when several persons wish to participate in a videoconference, the same set of audio or video data must be sent to a large number of addressees.

In order to cut distribution costs down to a minimum, the network itself must be capable of duplicating the information and relaying it to the addressees. The parts of the networks and machines that are not concerned by the distribution must not be congested by traffic that is of no interest to them. In addition, the data must arrive without corruption, which is what specialists call transmission reliability. In the case of one addressee, reliability is ensured by a two-way exchange: the addressee sends back control messages to the source until complete reception of the data.

Similarly, for the time being, most reliable multicast Internet protocols work bidirectionally. However, on a large scale and for a large number of addressees, this solution is not satisfactory. It entails the management of considerable quantity of back messages that flood the source and network. In this case, the solution consists in establishing a unidirectional link, without back route, and to define specific transmission control mechanisms to ensure reliability.

The solution to reliable transfer of large files
A work group called RMT (Reliable Multicast Transport) was created at the IETF (Internet Engineering Task Force), the Internet protocol standardization body, in order to deal with the problem. Members of project PLANETE have been participating in RMT since 2002.
This work group develops a reliable unidirectional protocol for large scale data distribution called ALC (Asynchronous Layered Coding) that makes it possible to avoid data loss problems. The data is sent redundantly until the user terminates the reception once all the data is received.
INRIA researchers participated in the development of an ALC-based solution adapted to the specific problem of file transfers. This solution is called FLUTE (File delivery over unidirectional transport). It explains how to best make use of ALC in order to provide a file distribution service, in particular for “large” files. FLUTE defines the mechanisms necessary to transmit files, transport metadata (information pertaining to the data, such as the name, size, encoding, etc.) and set up filters on the receiving end.
FLUTE was tested and validated in December 2003 with Nokia and the University of Tempere in Finland. It was then standardized by the IETF (RFC 3926, Request For Comments) in October 2004. FLUTE is already a key element in the distribution of multimedia data over radio networks, in the process of being standardized at the 3GPP (3rd Generation Partnership Project). The 3GPP is the organization that proposes standards for GSM, GPRS and UMTS devices, as well as for Internet distribution on telephones and other portable terminals, in the framework of the DVB-H (Digital Video Broadcasting, Handheld) standardization. The DVB-H is an industrial consortium dedicated to standards for digital television on small mobile screens.

Routing protocols adapted to multicast distribution

multicast
Multi-cast routing tree

Multicast distribution poses specific problems from the point of view of routing, either to ensure data transmission quality, or to optimize the transmission. The ARMOR team in Rennes (especially Bernard Cousin) has proposed three multicast protocols at the IETF since 2001. These protocols are compatible with the IPv4 and IPv6 data transmission Internet protocols. The first protocol makes it possible to force distributed packets to go through a specific path in the network, thus avoiding congestion points (something similar to "alternate routes” to go around traffic jams on the road). The second protocol accelerates the distribution of data packets to the addressees by diminishing the load of network equipment. The network equipment located on the path but not in charge of duplicating packets does not have to process these packets. Lastly, the third protocol is specifically intended for small groups. This is typical of the case of networked games where the number of players for each game may be small, but the number of simultaneous games may be very large.

INRIA at the heart of wireless network technology

INRIA researchers were among the first to think up broadband wireless communication networks between mobile computers. They are still on the forefront today in the standardization of the protocols necessary for a new generation of wireless networks, the ad hoc networks.

Let’s go back in time. Before wireless networks, corporate network architectures were especially complex and fixed. Each computer was connected to the others via a quantity of wires managed by a powerful network server. In a second stage around twenty years ago, a revolutionary solution was developed, considerably lighter and economical—Ethernet. A single coaxial cable connected every machine to the server. This time, several communications had to share the cable and computer resources.

A team of a dozen INRIA researchers got interested in the subject of how to improve wire networks and decentralize the management of communications, in order to avoid having to go through this central “policeman” that was the server to go in or out of the network. “We rapidly realized that the last element penalizing these networks was... the wire,” says Philippe Jacquet, Head of project HIPERCOM at Rocquencourt. “We started to think about a high speed wireless Ethernet network, possibly with mobile users. We felt there could be many applications in offices, vehicles...”

As surprising as it may seem today, at that time Ethernet networks were just starting to be deployed and most of the industry was very dubious about the idea of wireless computer networks. Some companies were interested however, such as Dassault for military applications and Apple for personal computers. “We continued our work and participated in the development of an original standard called Hiperlan, defended at the ETSI, the European Telecommunications Standards Institute. Hiperlan brought two concepts together, one developed for Ethernet and the other (a self-routing principle) stemming from the Internet world,” continues Philippe Jacquet.

Hiperlan: a standard ahead of its time
Self-routing between computers was designed for the Internet at the beginning of the 1990s. It looked like a solution to increase the range of wireless network communications. Indeed, quite surprisingly, physics commands that the higher data rate, the shorter range, due to signal scrambling problems caused by the echoes engendered by such obstacles as walls and furniture.

HIPERCOM researchers started to work on this solution in the framework of a 1994-1996 European project with Dassault, Electronica (I), Symbionics (UK) and the Universities of Bradford and Bristol (UK). This concept of distributed network with internal routing was also at the heart of the Hiperlan European standard (cf. What is a wireless telecommunication standard?) defended by several companies including Apple, Motorola, ATT, Canon and Daimler-Benz. Hiperlan was however probably too ambitious. It aimed at transfer rates of 25 Mbits/s. “It was kind of the Concorde of the wireless networks,” remembers Philippe Jacquet with nostalgia. At the time, no company dared develop a product.

Wireless networks, the infamous Wi-Fi, were finally deployed in 1996 according to a competing standard called 802.11, defended at the IEEE by some of the same companies that were working on Hiperlan. The IEEE (Institute of Electrical and Electronics Engineers) edicts international standards, in particular for telecommunications. When Europeans were trying to develop a totally new standard and products, the Americans modified their products, and were able this time to impose industrial solutions more rapidly.

Nonetheless, the competition with Hiperlan certainly influenced the Wi-Fi standard significantly, in particular concerning the type of network created, easy user access and routing mode. Wi-Fi networks are thus designed around a single access point with which several computers can communicate using a very simple distributed protocol, just as Hiperlan proposed, that makes it possible for users to freely enter and leave the network.

Another consequence inherited from Hiperlan concerns the ability of the network to function in ad hoc mode—a mobile network in which the various computers connected at a given moment can serve as relays between one another and automatically route data. This type of functionality had been developed in Hiperlan to alleviate the short range problem due to the high rates envisioned. In the same way, ad hoc networks can be used to extend the range of Wi-Fi networks or to design a wireless network with an off-center access point, as is often the case. Thus, mobiles that are out of range of the access point can use other mobiles as relays to forward their communication to the access point.

Concepts that are now applied to the new wireless networks
In parallel with the development of these radio technologies, the Internet protocols also had to be adapted to computer physical mobility that causes frequent changes in network topology. The interoperability of the various networks had to be ensured as well as their stability in order to avoid interruptions in transmissions, without eating up all the available bandwidth with the traffic necessary to describe this topology. In 1996, the Internet protocol international standardization body, the IETF (Internet Engineering Task Force), launched a work group on the topic called MANET (Mobile Ad hoc Network). “They were interested in the concepts we had developed in Hiperlan, so they quickly got in touch with us,” remembers Philippe Jacquet. “They wanted to propose our OLSR (Optimized Link State Routing) routing protocol, adapted to the Internet and optimized for mobile networks. OLSR was the first protocol that made it possible to experiment on plug and play ad hoc networks.” The protocol was an instant success and was downloaded more than 2,500 times! For the time being, ad hoc networks are primarily of interest to the military, automobile manufacturers and Wi-Fi enthusiasts. The latter are carrying experiments between several buildings in Berlin and in Paris, using INRIA's protocol.

Today, about fifteen researchers keep working on OLSR. The protocol is experimental (RFC 3626, Request For Comments), as are the three other competing protocols at the IETF, all of which are American. “The breakthrough of OLSR at the IETF in 2000 owes a lot to Thomas Clausen, a Danish doctoral candidate who joined us in 2000 and who knew how to communicate adequately in such forums.” OSLR is probably the most implemented protocol presently, with around fifty implementations in France, the United States, Japan, Canada, Norway and Finland, among others, including by several companies such as Boeing and Cisco. Cisco is planning to market hybrid OSLR-based servers within a few months that will be able to operate with or without wires. The ideas developed in OLSR have also been reused in other IETF works, and several standard proposals by large American corporations include OSLR concepts.

Planning for the integration of third generation devices
In addition to all of the above, wireless networks must face new needs, such as transmitting multimedia data to cell phones or PDAs, the so-called 3rd generation devices. The companies and research departments that develop such technology participate in the standardization of such networks within the 3GPP (3rd Generation Partnership Project). The 3GPP was created in 1999 and regroups several organizations such as the ETSI. It defines the standards adapted to mobile telephony: GSM, GPRS and UMTS.

One part of the ARMOR team at Rennes is working on the mechanisms required to manage the displacements of mobile terminals in such a way that the latter may be able to transparently use several types of radio networks (UMTS, Wi-Fi, Wi-Max, and so on). In this way, it will become possible to use multimedia applications—voice over IP, videoconference, video on demand, etc.—through the best communication means available at every instant and every place. The different technologies thus cooperate with each other in order to supply the best possible covering.

Researchers are also developing data compression techniques for efficient transmission of multimedia information on GPRS and UMTS networks. More precisely, they are proposing an IPv6 header compression method at the 3GPP. The method is already standardized at the IETF under the name of ROHC (Robust Header Compression).

What is a wireless telecommunication standard?

By definition, wireless communications imply radio wave transmissions. The radio resource is very much in demand by radio stations, TV channels, satellite links and the military. To develop wireless telecommunication equipment, it is thus compulsory to go through standardization in order to obtain a specific frequency band. The ETSI is in charge of this in Europe, in agreement with the various national jurisdictions, such as the AFNOR, the French standardization body.

Concretely, each standard (GSM, Wi-Fi, Bluetooth, OLSR)must specify the power, modulation mode and protocols to be implemented in dedicated equipment in order to use this frequency and interoperate without interfering with other devices.

No Web without standardization

Several INRIA research projects are developing languages to improve the design and processing of Web documents, not only in terms of structure, but also in terms of meaning, in order to create what is known as the semantic Web. Such work is carried out in the framework of the W3C international consortium.

When the Web got started, the problem of page representation rapidly arose. A standard on the subject called SGML had been published by the ISO (International Standardization Organisation) in 1986. This was a first step toward a structured approach to documents, a logical, explicit organization into chapters, sections, subsections, and so on. Such an organization makes it possible to process the document's contents, for example to search by keyword, and to more easily navigate it.

Researchers of the WAM team (formerly known as OPERA) of INRIA Rhône Alpes had been working on such logical document representations for about ten years. They thus naturally got involved in the work concerning the Web page representation format. In order to experiment on these formats, they had developed a prototype software. “When the W3C (World Wide Web Consortium, in charge of developing Web standards) was set up in 1994,” recalls Vincent Quint, Head of project WAM, “it got immediately interested in our software. The W3C needed such a tool to validate, experiment and demonstrate new Web technology, especially from the point of view of the users who either produce or make use of documents.

Still up to date software
Two team members thus joined the W3C technical team (see box “INRIA, a major W3C player”) to pursue the development of the software, Amaya. Amaya thus progressively integrated the new languages created by the W3C. As time went by, it became an operational authoring tool that not only makes it possible to use the most recent Web technologies, but also to produce complex Web pages containing text, graphics and mathematical expressions, that conform to W3C specifications. A brand new version of this Web editor was released at the end of 2004.

INRIA also widely contributed to developing some of the new formalisms, such as MathML, which is the W3C standard to represent mathematics in XML documents. XML is the new Web page representation format, the successor of HTML. MathML makes it possible for teachers, students, researchers and engineers to put math on their Web pages and to exchange it by email or from one software to another.

MathML is the result of a work group created in 1997 by the W3C. Researchers from project CAFE (formerly SAFIR of Sophia Antipolis) participated in it from the start. In the 1990s, they had already developed a standard in this field called OpenMath based on SGML, in collaboration with other research institutions. Actually, MathML makes it possible to use OpenMath to describe mathematical objects that are more complex than those natively represented in MathML.
The first MathML recommendation was issued in 1999. The second version dates from 2003. A certain number of documents are already using MathML, including the American patents of the US Patents and Trademarks Office. Large scientific publishing houses such as Elsevier and Springer, as well as online education publishers, are expressing interest and will use it as soon as they start encoding their documents in XML.

The multimedia puzzle
Another concern rapidly assumed considerable importance on the Web—the development of multimedia, either static as with images and text, or dynamic with sound and video. Such documents had to be guaranteed to remain usable in such heterogeneous environments as a computer, a cell phone and a TV set. Several W3C work groups were set up to devise solutions. Nabil Layaïda, a project WAM researcher, has been working for several years in one of these groups called “Synchronous multimedia”. The group was created at the beginning of 1997 with the goal of adapting multimedia documents to the Web, defining temporal relations between the different information elements in a document and planning how sound, images and video will fit together within the space of the screen as well as in time. This work group develops a language called SMIL, to which INRIA researchers greatly contributed, especially through concepts developed by Nabil Layaïda during his doctoral thesis. The first SMIL version was standardized in June 1998, the second one in August 2001. Version 2.1 is done since may 2005. The format is already widely used, for example by Realplayer and by MMS multimedia messages that succeed SMS messages in a version adapted to cell phones.

Nabil Layaïda also coordinated the development of software implementing SMIL. One of these Web tools called Limsee has been available since the summer of 2004. It makes it possible to create adaptable multimedia presentations in the SMIL format. Another one called PocketSMIL is dedicated to PDAs and portable devices.

In the same spirit, the W3C created another work group called “Device independence” the goal of which is to make sure that the Web remains independent from the devices used to access it. A doctoral candidate of project WAM, Tayeb Lemlouma, has participated in this group until 2004. The research concern for example the transformation and adaptation of a multimedia document including video, sound and text to a cell phone. The solutions consist in replacing the video by still images, or to restructure the documents to display them sequentially

layercake
The different data levels accessible in the web

When computers start reasoning...
Nonetheless, beyond such needs for document structuring and data processing, we still must face the barrage of information coming from the Web. One of the solutions prepared by the W3C since the end of the 1990s, intends to make document contents more intelligible, to give meaning to the information stored on Web pages in HTML. This is what is called the semantic Web. XML standardization then is a first step: it defines the document and data structure syntax. To access the meaning, semantic Web languages then make it possible to organize and prioritize the concepts used to describe Web resources into ontologies. Ontologies are logical structures that capture a certain number of logical relations between the concepts. Such languages are for example capable of deducing from the fact that “you need a ticket to take a train” and the fact that “the TGV is a train”, that then “you need a ticket to take the TGV”. Ontologies organize the description of concepts, such as “ticket” or “train”. From then on, information search can be carried out intelligently by the computer itself, automatically and without user intervention, simultaneously on several sites and independently of data format. For example, a computer can plan a trip for a given destination involving planes, trains and hotels. In fact, the semantic Web makes it possible for computers to reason, associate neighboring concepts together for a given request, all things that are impossible with today's search engines. The answers will be more precise, more relevant and the information retrieved will be correct.

The first semantic Web language called RDF (Resource Description Framework) was standardized by the W3C in 1999 and 2004, followed by RDF Schema standardized in 2004. RDF allows simple semantic descriptions, and RDF Schema supplies a basic vocabulary to describe the meaning of the concepts used. Two INRIA projects are particularly involved in semantic Web work, EXMO in Grenoble and ACACIA at Sophia Antipolis.

Since 2001, EXMO researchers who had been working on the design of knowledge representation languages, naturally contributed to the “WebOnt” W3C work group dedicated to developing a third semantic language that is more expressive than RDF Schema. This language is called OWL and was standardized in February 2004. It is the first language capable of defining ontologies. Several software packages using OWL are under development; Operational systems are being produced by Hewlett Packard and the Universities of Manchester and Karlsruhe.

From the first applications to tomorrow's search engines
These languages, especially RDF and RDF Schema, are beginning to be used. The main application in the world is called FOAF (Friend Of A Friend). It was created to connect people, create acquaintance networks and partnerships. Everyone describes his or her profile (name, email, interest, profession, etc.) and the computer does the rest. The semantic Web is also of interest to companies to manage their knowledge.

To ensure the development of RDF and OWL, the W3C launched a work group to define the “Semantic Web Best Practices” in 2004. The goal of the group is to define the languages, to offer methodological elements, answer utilization questions and provide pedagogical material. Fabien Gandon of project ACACIA participates in the work group. Project ACACIA is interested in methods and tools for knowledge management. Since their goal is to ensure the interoperability between different solutions, their work is also in the context of the semantic Web. A platform called CORESE has been developed since 1999 that makes it possible to design servers dedicated to the semantic Web. These servers are based on a search engine that exploits descriptions of the semantic contents of documents. CORESE implements a translator to read and produce RDF descriptions by interpreting them in the conceptual graph formalism, a method to represent knowledge and reasoning that benefits from twenty years of research. The CORESE platform is available on the Web.

Finally, for the semantic Websemantic Web to really be operational, especially for search engines, request languages for RDF and OWL must also be designed, in order for example to simultaneously exploit two different ontologies and make sure they are interoperable. A W3C work group called “RDF Data Access Group” is devoted to the problem. Researchers from project EXMO belong to this group. It is in this spirit that Olivier Corby of project ACACIA is evaluating the performance of the request language he designed for CORESE.

INRIA, major W3C player

INRIA has been one of the pillars of the W3C (World wide web consortium), an international consortium that ensures the development and promotion of Web standards. The institute was the first European host site, from 1995 to 2002, along with MIT (Massachusetts institute of technology) for the American continent and the University of Keio in Japan for Asia. Since 2003, ERCIM (European Research Group for Computer Science and Mathematics) has taken the relay from INRIA in Europe.

During these 8 years, 20 people from INRIA participated in the W3C technical team (which counted some 60 members in all). Jean-François Abramatic moreover presided over the consortium for 4 years until 2001. Vincent Quint was responsible for one of the four W3C technical fields, that is, the format of documents used on the Web and user interfaces. "This was above all a guiding role, he explained. This involves keeping an open mind to needs, coordinating efforts, suggesting the creation of working groups and ensuring the participation of researchers and industrial companies."

Currently, 6 or 7 INRIA researchers participate in working groups. Vincent Quint, director of research for INRIA, has been co-chair of the "Technical Architecture Group" (TAG) for W3C since February 1, 2005.

International standardization recognition in the field of natural languages

Following is how the LORIA (Lorraine Research Laboratory in Computer Science and its Applications), a joint department of INRIA, CNRS and the three Nancy universities, became a national and now international reference site in the field natural language computer processing, and especially in standardization, which is the work topic of a dozen researchers.

Linguistic computing was invented practically at the same time as computer science. Right from the start, the idea appeared of using computers for automatic translation, a feat however still out of reach today. The deployment of the Internet and the multiplication of electronic documents then only increased opportunities and needs in indexing, classification, text search and dialog transcription in any language and independently of their evolution over time.

The whole difficulty is to achieve generic solutions that are applicable on an international scale, and that can be specifically parametrized for a given language or a special need. In order to get an idea of the complexity of this task, it is enough to take the example of a lexicon (a dictionary) and the notion of adjective with all possible declension forms: in French, you will have the “masculine” and “feminine” genders and the “singular” and “plural” numbers, whereas in Japanese you will also have to consider the possibility of negation and tense agreement. It thus appeared clearly since the beginning of the 1990s that the only solution to achieve a perpetual management of the world's linguistic resources was to go through standardization.

A way of speaking the same language
The first international standard in the field concerned terminology, that is to say the vocabulary that is specific to this and that industry, science or institution. The ISO, the International Organization for Standardization, has been thinking about this problem since its inception in 1947, within a dedicated technical committee (TC 37). Indeed, by definition, any standard uses a specific terminology and its multiple translations. It is also easy to understand the interest of such an approach in view of the 360.000 pages of European Community institutions texts that now have to be translated into 25 languages... a titanic task.
“The ISO called on us in 2000 on the basis of our work in linguistic modeling, in order to try and find a standardization solution in terminology,” says Laurent Romary, Head of project Langue et Dialogue at LORIA. “At the time, two standards were in competition, an American and a European one. Using our more abstract approach, we were able to unify the two within a common specification platform, which is now a reference.”Laurent Romary is the editor of this standard (ISO 16642, or TMF for Terminological Markup Framework) that was published in 2003, only three years after his first dealings with the ISO.

The interest of a standardized platform is that it makes it possible for a given user, say a company, to create their own terminological format, specific to their activity, and to exchange documents on this basis with for example subcontractors, service providers or clients who adopt the same format—in a sense it provides a way of speaking the same language. The ISO 16642 standard was very successful and has already been implemented many times, including by IBM and by Daimler-Benz. A white paper published in 2005 describes how to use it with concrete application recommendations. This work will result from a collaboration between LORIA, such industry partners as EDF and EADS and institutional partners such as the INIST (Institute for Scientific and Technical Information, the CNRS information center), in the framework of a national INRIA research and development initiative called SYNTAX

Dealing with large volumes of electronic documents
Another international organization took an interest in INRIA work, the TEI (Text Encoding Initiative) consortium. The TEI is a gathering of international institutional partners interested in managing large quantities of electronic archives. The consortium was created in 1987 for purposes of defining perpetual text formats for libraries, universities, museums, publishers and so on. Given the quality of the work carried out by the consortium, INRIA first took inspiration from its representation directives to publish its own written documents or dialog transcriptions. “We progressively integrated our own tools to annotate the texts and prioritize the information, etc.” says Laurent Romary. “Our work then attracted interest on the part of the consortium and in 2000 we were asked to participate in its Scientific Board, which we still do today.”
What is even more rewarding is that the LORIA and two CNRS units, the INIST and the ATILF (Computer Analysis and Processing of the French Language), constitute one of the four TEI host sites, next to the University of Virginia and the University of Providence in the United States, and Oxford University in Great Britain. Nancy research scientists will bring their skills in data modeling to define more generic text formats. On the national scale, this collaboration also makes it possible to deploy TEI standards in specific contexts, for example to standardize gray literature (scientific production, activity reports, doctoral dissertations, and so on).

Standardizing lexicons and contents
There is thus nothing surprising for a LORIA researcher, Laurent Romary, to be called on to chair the new ISO subcommittee dedicated to the standardization of linguistic resources that was founded in 2002. Its objective is to standardize all the information necessary for linguistic engineering, for example for orthographic or grammatical correcting, automatic translation, information extraction, etc. In this framework, a team associating INRIA (Gil Francopoulo) and the U.S. Department of Defense is developing a standardized platform called LMF (Lexical Markup Framework), this time to represent wide spectrum lexicons rather than terminologies. They plan to co-publish a standard that models the representations associated with words. The work involves in France a national network of more than fifty industry and institutional contributors.

The subcommittee is also working on another standard format to represent contents called MLIF (Multilingual Information Framework). The standard would be adapted for example to translation memories (typical phrases created by translators when frequently occurring), the localization of certain key messages in software, DVD subtitling, among others.

Software that is as usable as possible

An INRIA research team (MERLIN) has been participating for many years in standardization activities in software ergonomics.

The numerous benefits of computers and more generally of information and communications sciences must not overshadow the fact the primarily concerned party are... the users. A computer system, as significant and advanced as it may be, offers little interest if it does not obey certain criteria of usefulness, health and security, but also comfort and user-friendliness, which all are usability characteristics that must be taken into account from the design stage.

Computer technology has now invaded the private and professional spheres—work, transportation, services, leisure. They are part of all aspects of life, especially in developed countries. It is thus not surprising that the ergonomics of such systems is becoming a growing concern. This means better understanding human-machine interaction in order to improve user comfort and security as well as system efficiency.

Computer ergonomics has been taken into account by the ISO, the international standardization organization, and its national authorities such as the Afnor in France (French Standardization Association), since 1980. The software aspects (design process, dialog techniques, multimedia,...) are concerned, as well as the ergonomics of the work station, its surroundings and of the hardware. Computer ergonomics problems are even increasing. The new usages and issues of information technology require adaptation. In fact, more and more experts are appointed to participate in the ISO international meetings. Governments, institutions and software publishers are increasingly concerned.

“The topics we are working on now concern the Web and graphical interfaces, virtual reality, for example,” explains Dominique Scapin, Scientific Head of project MERLIN (INRIA Rocquencourt and Lorraine) that has been contributing to software ergonomics standardization since 1988. In France, twenty-five standards and specifications concern computing ergonomics, and fifteen are more specific to software ergonomics. MERLIN researchers published many articles and manuals for software designers. Recently, Dominique Scapin who organizes the “Software Ergonomics” group of the AFNOR, coordinated the publication of the first AFNOR compendium of standards, specifically on computer ergonomics.

New usages, new populations
In practice, what is meant by software ergonomics? In fact, there are many facets to this issue. For example, design processes are concerned in order to help take into account user characteristics (correctly taking into account the usage context and user experience and knowledge). The best practices for dialog between user and software interface are also defined (degree of adaptation, conformity to user expectations, easy learning, type of interactive menus, etc.). Rules to present information and guide users right are proposed. Methods to evaluation software usability are also defined. All these topics are at the heart of project MERLIN research work, with the goal of improving the ergonomic quality of interactive software. On the one hand, the team works on integrating ergonomics results into new software design, and on the other hand, it is interested in new computing applications such as multimedia and new populations of users.

This preoccupation actually concerns a large number of research works in the world, especially about the concept of accessibility, that is to say adapting computer systems to the largest number of individuals, irrespective of their capabilities, or sensory, motor or cognitive impairments (see box). Among other things, this includes dealing with an aging population. A law on this subject was voted in France in July 2004 by the Senate. Several laws were also adopted in the United States and European directives are in preparation. Four standards are being finalized in Japan. Consistently with these reflections and upcoming laws, the ISO is working on a technical specification on the subject. A first standard (ISO/TS 16071) was published in 2003. Its goal is to guide developers in human-machines interface design that propose the highest possible level of accessibility.

" Our team is working in all these areas ", says Dominique Scapin, “especially in the new areas of knowledge (for example, understanding the problems raised by Web software or virtual reality software) and on interactive software design and evaluation methodology.” Even if it is difficult to ascertain the importance of such and such contribution, this research has a recognized impact on the publication of standards in this field.

Accessibility: everyone can be concerned

Accessibility is a concept that encompasses the differences in capabilities dues to age, illness or handicap. It concerns persons who suffer from physical, sensory or cognitive deficiencies from birth or acquired during their life, elderly persons who may benefit from new products and services but who have reduced physical, sensory and cognitive capabilities, persons with a temporary impairment, for example someone with a broken arm or people who have forgotten their glasses, and persons faced with difficulties in certain situations, for example a person working in a noisy environment or whose two hands are occupied by another task.

Software gets adapted to the complexity of new usages

In the last ten years, free (de facto standards) or standardized software technology has changed significantly. Such approaches as component models and model engineering now are the core of much research, especially at INRIA.

Software design and everything that goes with it (development tools, assembly, tests, deployment, execution, administration, and so on), which is globally referred to as software engineering, is changing following the increasing complexity of developed systems. Such systems are more and more often embedded, distributed over several machines and work in real time. It is now commonplace for a company to develop software with several million lines of code. On the other hand, productivity, quality and flexibility demands have continuously risen. All these changes entail a definite need for automation.

The main current developments in software engineering partly stem from the deployment of object technologies, which are now widespread in computer system analysis, design and implementation. In 1991, the OMG (Object Management Group), the standardization organization dedicated to such technology, defined a reference software architecture for distributed applications—several components that cooperate over several sites—using object technology. This architecture called Corba makes it possible for applications developed in different languages to communicate with each other, even if they are not on the same computer. Corba is a kind of middleware, a class of software that takes charge of intermediate functions between data transport and the applications, such as communication, task allocation functions and such properties as security. This standard is widely used in industry, especially for corporate server information systems and in command and control military applications, as well as for certain applications in industry and telecommunications.

Automatically deploying software for distributed applications
Being interested in the design of complex distributed applications with specific constraints, research scientists of project JACQUARD (INRIA Futurs) rapidly took an interest in Corba. In order to solve certain problems in the use of the software, they developed a scripting language for Corba (a programming language with specific properties) in the context of Philippe Merle's doctoral thesis. Thus, in 1997 when the OMG issued a call for proposals on the topic, the team proposed its solution called Corba Scripting Language, which was accepted in 2000. Researchers got familiar with the functioning of the OMG on this occasion and followed the work carried out in the other work groups, concerning in particular models based on software components, an approach initiated by the OMG in 1997 that is especially adapted to distributed applications. As a matter of fact, object models make it possible to build software but cannot for example automatically deploy software over a set of machines, as is necessary in distributed applications. Component based models transparently support this functionality. JACQUARD researchers soon joined the dedicated work group and finally took the helm thereof in 2000. The Corba standard integrating the component model was issued in 2003.

Up to 2002 they were simultaneously experimenting on component models in their own prototype software, a platform called OpenCCM developed in the framework of the ObjectWeb consortium —a consortium created by INRIA, France Telecom and Bull to develop freely available, quality middleware. Research and development was then continued in the framework of national (RNTL IMPACT, RNRT COMPiTV, ACI GRID RMI) and European projects (IST COACH, ITEA OSMOSE), especially with Thales and a Greek company called Intracom. OpenCCM has been freely available over the Internet since 2003. It was downloaded more than 2,500 times and is one of the two OMG reference platforms to implement the OMG component model standard.

At the beginning of the 1990s, no less than about fifty object modeling methods had been developed. At the end of 1997, in order to maintain a certain uniformity, the OMG standardized the first unified object modeling formalism, UML (Unified Modeling Language). Industry rapidly adopted this code generation technique from models, an approach based on graphical representations. Each company developed its UML variant, dedicated to its applications. These are called dedicated languages and UML has become one dedicated language among others. Currently, the two main players in the field are IBM and Microsoft. Both have developed their own implementation platform and associated tools: Eclipse for IBM and Visual Studio for Microsoft.

mda
The MDA approach

The ambitious concepts of model engineering
One of the solutions that could make it possible to harmonize such slightly anarchic developments and ensure the compatibility and interoperability of these dedicated languages is model engineering. This software design methodology is the object of much research throughout the world. It was accepted by the OMG in November 2000 as a major approach called MDA™ (Model Driven Architecture). The objectives are multiple and ambitious: to automatically generate code, to easily and automatically pass from one model to another, from an abstraction level to another, or from a given application to another in order to automatically generate software component tests, for example. The approach is based on several already established technical standards: such languages as UML, modeling languages such as MOF (Meta Object Facility) and above all model transformation languages such as QVT (Query View Transformation), which is in the process of being standardized. QVT is a model transformation language specification that makes it possible to adapt a model to new constraints, and to transform any dedicated language into another. This language lies at the heart of the success of the MDA approach. The OMG is planning to standardize QVT in december 2005.

Two INRIA teams, TRISKELL in Rennes and ATLAS in Nantes, are carrying out research in model engineering. They explored two different and complementary directions concerning model transformation to adapt it to the users, and developed two model transformation languages, which are in a way QVT implementations: ATL (ATLAS Transformation Language) by ATLAS and MTL (Model Transformation Language) by TRISKELL.

ATL is a model transformation language intended for data engineering, in other words for information systems and database management systems. In effect, the size of such systems keeps growing, the data they contain is increasingly heterogeneous and complex, and the applications making use of them more and more varied. ATL stems from research carried out in collaboration by INRIA, the University of Nantes and company located in Brest, TNI-Software. ATL and its programming environment on IBM's Eclipse platform have been officially available as free software since October 2004, on the Generative Model Transformer site. ATL was awarded the IBM Eclipse Innovation Prize that same year. Simultaneously, the team is implementing the same principles following Microsoft's approach, in the framework of a research agreement, and is extending the functionalities of the operators beyond transformation by taking into account model weaving for example. This work is mainly done in industrial collaboration projects, such as the ModelWare European project steered by Thales. Close partnerships with other French companies, such as the Sodifrance group, made it possible to develop other tools that complement ATL in the framework of a research platform in model engineering (AMMA).

MTL is a generic object language, a basic building block for developing frameworks (a kind of application sketch) of model transformation. TRISKELL is interested in frameworks allowing for powerful semantic operations based on algebraic operators. MTL is for example especially well adapted to the control of software components, with the goal of ensuring the reliability of their assembly in the context of distributed applications. The language makes it possible to automatically generate software component tests. It is another way of taking advantage of model engineering, by changing representation mode. As a matter of fact, testing software components and verifying their reliability before assembly has become a sensitive and often very costly step before the deployment of an application. TRISKELL researchers had already worked on the topic in the framework of a national project with France Telecom, Gemplus and Softeam to define test techniques in UML. MTL is available online as free software (modelware.inria.fr).

Proposing solutions for different application areas
In addition, distributed applications are more and more embedded and in real time as in cell phones or car and aircraft systems. To model such complex embedded systems, AOSTE researchers (Sophia Antipolis) have been developing for twenty years so-called reactive synchronous formalisms, called RTE-ML (Real Time Embedded-Meta Language) such as Esterel and the SyncCharts. In the final analysis, such languages are rather close to UML state and activity behavioral diagrams that are indispensable in the context of distributed applications. The approach was of immediate interest to the companies that were developing UML for embedded systems, led by Thales. In 2001, AOSTE researchers designed an UML-compatible RTE-ML profile in the framework of a European project with Thales, Esterel Technologies (an INRIA startup), Nokia and Finnish universities.

ATLAS researchers are also interested in a particular application of model engineering: finding generic solutions to automatically renew the old computer systems programmed several dozens of years ago (for example in Cobol, PL/1 or Ada) in order to adapt them to modern technological environments. The OMG launched an initiative of the subject in 2003 to solve this large scale industrial problem, the importance of which was already highlighted on the occasion of the year 2000 and that of switching to the euro. The AMMA model engineering experimental platform with its various components (transformation, weaving, projection, global management of models) is the basis of such reverse-engineering for information systems which often entails hard data engineering problems. Certain tools developed by ATLAS should be directly applicable to this especially important industrial problem.Carroll: INRIA, a partner that cannot be ignored in software engineering

In March 2003, in the framework of the development of their electronic defense systems (sea, land and air), Thales initiated a three year partnership called Carroll with the CEA and INRIA in the fields of model software engineering and middleware technology. INRIA naturally contributes to this partnership due to its recognized competence in the fields of embedded, real time computing, distributed applications and middleware. INRIA projects ATLAS, TRISKELL and AOSTE initiated three of the four Carroll projects. In particular, ATLAS and TRISKELL are contributing to the development of a programming language compatible with QVT. In addition, TRISKELL develops software component test automatic generation methods, and AOSTE researchers aim at standardizing a solution stemming from their RTE-ML profile at the OMG, that is adapted to embedded, real time systems. They proposed a Request for Proposals (RFP) on the needs in the field voted on in January 2005. They should be in a favorable position to answer this RFP based on their previous work.

Carroll: INRIA, a partner that cannot be ignored in software engineering

In March 2003, in the framework of the development of their electronic defense systems (sea, land and air), Thales initiated a three year partnership called Carroll with the CEA and INRIA in the fields of model software engineering and middleware technology. INRIA naturally contributes to this partnership due to its recognized competence in the fields of embedded, real time computing, distributed applications and middleware. INRIA projects ATLAS, TRISKELL and AOSTE initiated three of the four Carroll projects. In particular, ATLAS and TRISKELL are contributing to the development of a programming language compatible with QVT. In addition, TRISKELL develops software component test automatic generation methods, and AOSTE researchers aim at standardizing a solution stemming from their RTE-ML profile at the OMG, that is adapted to embedded, real time systems. They proposed a Request for Proposals (RFP) on the needs in the field voted on in January 2005. They should be in a favorable position to answer this RFP based on their previous work.

A standard for computation consumers

It is easy to forget but numerical computation is one of the basic building blocks of computing. The precision of the operators and programs, their reliability and their speed are at the heart of a large number of more or less critical industrial applications.

It is enough to remember a few of the most famous computer errors of the past years, such as the Pentium bug in 1994, or the explosion of the first Ariane 5 rocket in 1996, to realize how reliability and precision are crucial in machine computations. The primary culprits of these infamous failures are microprocessors and the so-called numerical algorithms that use floating point arithmetic. Floating point arithmetic is the format implemented in computers to approximately represent real numbers (see box).

At the end of the 1970s, engineers and researchers were able to impose a uniform standardized solution, in spite of divergent commercial interests. At that time, each manufacturer (Digital, Cray, Hewlett Packard,...) had their own floating point number representation, but certain choices were only meant to speed up computations, maybe with less transistors, even if that meant producing notoriously false results or go contrary to programming usages. Even the simplest arithmetic operations led to variable results from one machine to the next. When personal computers appeared, the idea of standardizing floating point arithmetic had gained ground. A newcomer in the field, Intel, wanted to carve out a place on a market in which it had no legitimacy and chose to rely on the work of an influential scientist, William Kahan, Professor at the University of California at Berkeley. The rest is history. Kahan's ideas strongly influenced the topic and he then was awarded the Turing Award for his contributions. Almost all of his recommendations have been accepted by the IEEE (Institute of Electrical and Electronic Engineers), the learned society that is an authority in the fields of computer engineering. The IEEE 754 standard was thus adopted in 1985 by the ANSI (American National Standards Institute). The standard precisely defines the correct round-off for the four arithmetical operations (addition, subtraction, multiplication and division), and for the square root. Practically all current computers obey this standard, even if certain functionalities are only implemented via libraries during execution or compilation.

The best way to be at the heart of a topic
Project ARENAIRE of INRIA Rhône Alpes has been working in this field since its inception in 1999, and completely integrates the standardization approach. Its researchers are naturally involved in the ongoing discussions concerning the revision of the IEEE 754 standard. “Our technical ideas and our research work may thus be taken into account in the future standard,” explains Marc Daumas, a CNRS researchers working in project ARENAIRE. “However, for us, to participate is first of all a way of exchanging ideas with the best specialists worldwide, to establish links that will then have scientific consequences, to see where the wind blows, which all are plusses to advisedly develop our ideas.” As a matter of fact, the functioning of this IEEE technical committee is especially pragmatic. The standard revision committee of about twenty persons meets every month in California, and for more efficiency, a subcommittee works weekly to prepare the proposals to be discussed. ARENAIRE members visit two or three times a year and then pursue discussions remotely. In addition to INRIA, only a few British and Israeli scientists participate in a limited fashion in these meetings, which are open and free of access.

“In this field as in many others, there is no point in developing cool techniques in your own little corner,” says Marc Daumas. “You have to take into account the experience of scientists, but also the point of view of industry, and interferences with the work carried out by other standardization committees.” Hence the interest of being at the heart of these expert exchanges, which are at the same time a source of inspiration for research, and a guarantee of quality for the work. It is however difficult to isolate direct scientific consequences. One thing is certain, project ARENAIRE is reputed in its field. ARENAIRE researchers worked for example with Aerospatiale on the choice of floating point use for the Airbus A380. Other researchers from the project developed complex algorithms to reduce division computing times for STMicroelectronics. Since 2002, they have been developing a floating point library for another ST processor.

Multiple positive consequences
The consequences are sometimes surprising. Thus, following a consensus reached between several members of the technical committee in 2004, it was decided to start working on integrating arbitrary precision floating point arithmetic into the future standard. This is a field in which another INRIA team, SPACES of Nancy, has unique know-how. This multiprecision mode of computation is more precise than floating point computation, but also slower. The SPACES team developed software called MPFR (MultiPrecision Floating-point arithmetic with exact Rounding) that could be compatible with the IEEE standard when it is completed, for a very reasonable overhead in terms of computing time. The experience acquired by SPACES clearly sets it ahead of the competition, including the experts of the standard revision committee.
Another key area to improve computation precision is the certification of programs and algorithms with automatic tools based on formal proofs. Such techniques are beginning to impose themselves in cases where human lives might be threatened by malfunctioning. They also are increasingly interesting for industry because of their growing power and automation. Due to ARENAIRE developments, such techniques can now be applied to programs that resort to numerical computations. The objective of this approach is to validate the programs and make sure that their implementation is correct. In 2000, INRIA launched a cooperative research initiative on the subject called AOC (Certified Computer Arithmetic) involving several projects (LEMME, ARENAIRE and SPACES). INRIA has long been developing a proof assistant software called Coq in the context of formal methods, that checks the detail of proofs. Since 2002, ARENAIRE researchers have been working on these topics in collaboration with Nasa researchers of the Langley Research Center and of the National Institute of Aerospace (Virginia). Historically, Nasa is strongly supportive of another formal proof assistant called PVS, which is widely used in the United States and in industry. Floating point arithmetic is used in a project to automate small airport air control. Due to a collaboration funded by CNRS and Nasa, there will soon be a joint Coq and PVS specification, which will ipso facto become a primary world standard. Even if this specification has little chance of being accepted by the IEEE, because it is expressed in a formalism that most standard revision committee experts do not master, its noted contributions once more make the Institute work better known.

Processing very large and very small numbers at the same time, using floating point arithmetic

To represent real numbers, which are not integers, most computer systems use a so-called floating point format. This format has been used for decades in scientific computing. It consists in defining a real number with a triple: its sign, mantissa and exponent. Changing the latter causes the decimal point to “float”. This representation is different from the fixed point representation in which the exponent is fixed.

Thus, in base 10, the electron mass is 9.109381 x 10-31 kg, that is to say a plus sign, a mantissa of 9.109381 and an exponent of -31.

An open standard for the development of algorithms

On August 9 2005, INRIA (host of the Scilab consortium), Maplesoft, Mathsoft and National Instruments (Nasdaq : NATI) announced the creation of the Numerical Mathematics Consortium (NMC). Associated with participants from the industrial and academic world, these mathematical software publishers joined forces to establish consistent and concrete bases for numerical programming. The initial goal of the consortium was the establishment of an open standard for the semantics of mathematical functions used to develop algorithms for use in a wide range of disciplines and in various hardware and software environments.

« There has long been a need in our industry for a unified and standardised base, explains Ali Maleki, Director of the Brake and Chassis Electronics programme for ArvinMeritor. Today, each tool has its own function set, often requiring extensive training, which has led us to develop algorithms and know-how that is not easily transferable to the rest of the industry. We must rewrite these algorithms for new projects or when implementing new technologies, which leads to additional costs. A standard set of mathematical functions based on semantics accepted by the industry would be a great leap forward in the creation of transferable techniques and ready-to-use libraries and tools that can be used instantly in a number of environments, thus resulting in financial savings. »

The purpose of this organisation is to create specifications to define the mathematical functions the most frequently used in numerical algorithms. These algorithms can then be integrated in applications specific to a number of activity fields such as industrial control, the development of embedded software and numerous research fields. They can therefore be easily transferred between researchers and engineers in industrial and academic fields.

« By using the industrial standards developed by the Numerical Mathematics Consortium, students can create algorithms compatible with functions shared by traditional tools and be assured that their work will be correctly integrated in other mathematical environments, states Robert H. Bishop, Professor and President of the Aerospatial Engineering and Mechanics Department for the University of Texas. Furthermore, with a proven standard for numerical mathematics, I can be sure that my students will be trained on tools and approaches that they will encounter in the industry. »

The Numerical Mathematics Consortium will create a mathematical community within which the exchange of ideas and information will be facilitated by a shared vocabulary.

The NMC in brief

The Numerical Mathematics Consortium is a non-profit organisation made up of industrial companies and persons from the industrial sector and universities with a view to defining an open standard for the semantics of mathematical functions used in the development of numerical algorithms.

The main objective of the Consortium is to reduce the global costs of developing algorithms and facilitating their use in a wide range of disciplines and in various hardware and software environments.

Increasing the level of security in the world of smart cards

Promoting the use of formal methods for software security, in particular within the context of small secure portable objects and ubiquitous computing, is the main goal of the INRIA EVEREST research project.

PUCE

© GlobalPlatform

By offering Globalplatform, world leader for the development and deployment of secure smart card solutions, a formal model for the GlobalPlatform Card Specification v2.1.1 specification, the research project provides an IT solution designed to reinforce the security assessment of industrial products, accelerate the process for compliance testing and facilitate the development of subsequent versions of GlobalPlatform products. Boasting 57 members that include Visa and Mastercard International, IBM, Hitachi, Thales, STMicroelectronics, Sun, Gemplus, etc., the representatives of this consortium stress the added value of this type of model that provides « a rigorous and unambiguous specification » for those wishing to implement the v2.1.1.
Formal verification techniques will thus contribute to a deeper understanding of complex security architectures that are widely used in the industry. Emerging from fundamental research, it will also increase confidence in the solutions implemented by encouraging adoption of the most pertinent standards.

Valid XHTML 1.0! Valid CSS!   © INRIA - 2005.