GSM 03.40


GSM 03.40 or 3GPP TS 23.040 is a mobile telephony standard describing the format of the Transfer Protocol Data Units part of the Short Message Transfer Protocol used in the GSM networks to carry Short Messages. This format is used throughout the whole transfer of the message in the GSM mobile network. In contrast, application servers use different protocols, like Short Message Peer-to-Peer or Universal Computer Protocol, to exchange messages between them and the Short Message service center.
GSM 03.40 is the original name of the standard. Since 1999 has been developed by the 3GPP under the name 3GPP TS 23.040. However, the original name is often used to refer even to the 3GPP document.

Usage

The GSM 03.40 TPDUs are used to carry messages between the Mobile Station and Mobile Switching Centre using the Short Message Relay Protocol, while between MSC and Short Message Service Centre the TPDUs are carried as a parameter of a Mobile Application Part package.
In emerging networks which use IP Multimedia Subsystem Short Messages are carried in the MESSAGE command of Session Initiation Protocol. Even in these IP-based networks an option exists which defines transfer of Short Messages in the GSM 03.40 format embedded in 3GPP 24.011 as Content-Type: application/vnd.3gpp.sms.

TPDU Types

GSM 03.40 defines six types of messages between Mobile Station and SMS Center, which are distinguished by the message direction and the two least significant bits in the first octet of SM-TP message :
TP-MTIdirectionmessage type
0 0MS → SCSMS-DELIVER-REPORT
0 0SC → MSSMS-DELIVER
0 1MS → SCSMS-SUBMIT
0 1SC → MSSMS-SUBMIT-REPORT
1 0MS → SCSMS-COMMAND
1 0SC → MSSMS-STATUS-REPORT
1 1anyReserved

SMS-SUBMIT is used to submit a short message from a mobile phone to a short message service centre.
SMS-SUBMIT-REPORT is an acknowledgement to the SMS-SUBMIT; a success means that the message was stored in the SMSC, a failure means that the message was rejected by the SMSC.
SMS-COMMAND may be used to query for a message buffered in the SMSC, to modify its parameters or to delete it.
SMS-DELIVER is used to deliver a message from SMSC to a mobile phone. The acknowledgement returned by the mobile phone may optionally contain a SMS-DELIVER-REPORT. When home routing applies, SMS-DELIVER is used to submit messages from an SMSC to another one.
SMS-STATUS-REPORT may be sent by the SMSC to inform the originating mobile phone about the final outcome of the message delivery or to reply to a SMS-COMMAND.

TPDU Fields

The fields of SM-TP messages, including their order and size, are summarized in the following table, where M means a mandatory field, O an optional field, E is used for fields which are mandatory in negative responses and not present in positive responses, x is a field present elsewhere:
The first octet of the TPDU contains various flags including the TP-MTI field described above:
bitMeaning
1-0TP-Message-Type-Indicator
2TP-More-Messages-to-Send in SMS-DELIVER
2TP-Reject-Duplicates in SMS-SUBMIT
3TP-Loop-Prevention in SMS-DELIVER and SMS-STATUS-REPORT
4-3TP-Validity-Period-Format in SMS-SUBMIT
5TP-Status-Report-Indication in SMS-DELIVER
5TP-Status-Report-Request in SMS-SUBMIT and SMS-COMMAND
5TP-Status-Report-Qualifier in SMS-STATUS-REPORT
6TP-User-Data-Header-Indicator
7TP-Reply-Path in SMS-DELIVER and SMS-SUBMIT

By setting the TP-More-Messages-to-Send bit to 0, the SMSC signals it has more messages for the recipient. The MSC usually does not close the connection to the mobile phone and does not end the MAP dialogue with the SMSC, which allows faster delivery of subsequent messages or message segments. If by coincidence the further messages vanish from the SMSC in the meantime, the SMSC terminates the MAP dialogue with a MAP Abort message.
The TP-Loop-Prevention bit is designed to prevent looping of SMS-DELIVER or SMS-STATUS-REPORT messages routed to a different address than is their destination address or generated by an application. Such message may be sent only if the original message had this flag cleared and the new message must be sent with the flag set.
By setting the TP-Status-Report-Indication bit to 1, the SMSC requests a status report to be returned to the SME.
By setting the TP-Status-Report-Request bit to 1 in a SMS-SUBMIT or SMS-COMMAND, the mobile phone requests a status report to be returned by the SMSC.
When the TP-SRQ has value of 1 in an SMS-STATUS-REPORT message, the message is the result of an SMS-COMMAND; otherwise it is a result of an SMS-SUBMIT.
When TP-UDHI has value 1, the TP-UD field starts with User Data Header.
Setting the TP-RP bits turns on a feature which allows to send a reply for a message using the same path as the original message. If the originator and the recipient home networks differ, the reply would go through another SMSC then usually. The mobile operator must take special measures to charge such messages.
Both SM-RP and MAP used to transmit GSM 03.40 TPDU carry enough information to return acknowledgement—the information whether a request was successful or not. However, a GSM 03.40 TPDU may be included in the acknowledgement to carry even more information. The GSM 03.40 has undergone the following development:
  • Up to GSM 03.40 5.2.0 SMS-DELIVER-REPORT and SMS-SUBMIT-REPORT was sent only in the case of an error. Since 5.3.0 they are sent in case of success as well. MO-ForwardSM-Res was introduced back in GSM 09.02 5.6.0 August 1997
  • Up to GSM 03.40 6.0.0 SMS-DELIVER-REPORT and SMS-SUBMIT-REPORT sent in case of an error contained only TP-MTI and TP-FCS fields and the last field in SMS-STATUS-REPORT was TP-ST. Since version 6.1.0 these TPDUs has format shown in the table above.
Although these changes are ancient, old formats of MAP are frequently seen even in today's networks.

Message Content

The content of the message is carried in the TP-UD field. Its size may be up to 160 × 7 = 140 × 8 = 1120 bits. Longer messages can be split into multiple parts and sent as a Concatenated SMS. The length of message content is given in the TP-UDL field. When the message encoding is GSM 7-bit default alphabet, the TP-UDL gives length of TP-UD in 7-bit units; otherwise TP-UDL gives length of the TP-UD in octets.
When TP-UDHI is 1, the TP-UD starts with User Data Header ; in this case the first octet of the TP-UD is User Data Header Length octet, containing the length of the UDH in octets without UDHL itself. UDH eats room from the TP-UD field. When the message encoding is GSM 7-bit default alphabet and a UDH is present, fill bits are inserted to align start of the first character of the text after UDH with septet boundary. This behaviour was designed for older mobile phones which don't understand UDH; such mobile phones might display the UDH as a jumble of strange characters; if the first character after UDH was Carriage Return, the mobile phone would rewrite the message with the rest of the message.

Addresses

A GSM 03.40 message contains at most one address: destination address in SMS-SUBMIT and SMS-COMMAND, originator address in SMS-DELIVER and recipient address in SMS-STATUS-REPORT. Other addresses are carried by lower layers.
The format of addresses in the GSM 03.40 is described in the following table:
octetMeaning
0address length in nibbles
1EXT, TON, NPI
2-11address digits

Type of number :
Bit
6 5 4
Meaning
0 0 0Unknown
0 0 1International number
0 1 0National number
0 1 1Network specific number
1 0 0Subscriber number
1 0 1Alphanumeric,
1 1 0Abbreviated number
1 1 1Reserved for extension

If a subscriber enters a telephone number with `+' sign at its start, the `+' sign will be removed and the address gets TON=1, NPI=1. The number itself must always start with a country code and must be formatted exactly according to the E.164 standard.
In contrast, for numbers written without `+' sign the address gets TON=0, NPI=1. In this case the number must adhere to the mobile operator's dial plan, which means that international numbers must have the international prefix before the country code and numbers for long-distance calls must start with the trunk prefix followed by a trunk code.
Numbering plan identification :
Bits
3 2 1 0
Meaning
0 0 0 0Unknown
0 0 0 1ISDN/telephone numbering plan
0 0 1 1Data numbering plan
0 1 1 0Service Centre Specific plan 2)
1 0 0 0National numbering plan
1 0 0 1Private numbering plan
1 0 1 0ERMES numbering plan
1 1 1 1Reserved for extension

Telephone numbers should have NPI=1. Application servers may use alphanumeric addresses which have TON=5, NPI=0 combination.
The EXT bit is always 1 meaning "no extension".

Address examples

U.S. number +1 555 123 4567 would be encoded as 0B 91 51 55 21 43 65 F7.
Alphanumeric address is at first put to the GSM 7-bit default alphabet, then encoded the same way as any message text in TP-UD field and then the address is supplied with the "number" length and TON and NPI.
For example, a fictional alphanumeric address Design@Home is converted to the GSM 7-bit default alphabet which yields 11 bytes 44 65 73 69 67 6E 00 48 6F 6D 65, the 7-bit packing transforms it to 77 bits stored in 10 octets as C4 F2 3C 7D 76 03 90 EF 76 19; 77 bits is 20 nibbles which is the value of the first octet of the address. The second octet contains TON and NPI, which yields D0 hex. The complete address in the GSM format is 14 D0 C4 F2 3C 7D 76 03 90 EF 76 19.

Time Format

A date and time used in TP-SCTS, TP-DT and in Absolute format of TP-VP is stored in 7 octets:
octetContent
0Last two digits of the year
1Month
2Day
3Hour
4Minute
5Second
6Time zone

In all octets the values are stored in binary coded decimal format with switched digits.
Time zone is given in quarters of an hour. If the time zone offset is negative bit 3 of the last octet is set to 1.
23:01:56 Mar 25th 2013 PST would be encoded as 31 30 52 32 10 65 8A.
In this example, the time zone, 8A is binary 1000 1010. Bit 3 is 1, therefore the time zone is negative. The remaining number is 1000 0010, hexadecimal 82. Treat this as any previous element in the sequence,. Finally the time zone offset is given by 28 × 15 minutes = 420 minutes.

Validity Period

An SMS-SUBMIT TPDU may contain a TP-VP parameter which limits the time period for which the SMSC would attempt to deliver the message. However, the validity period is usually limited globally by the SMSC configuration parameter— often to 48 or 72 hours. The Validity Period format is defined by the Validity Period Format field:
TP-VPFTP-VP formatTP-VP length
0 0TP-VP not present0
0 1Enhanced format7
1 0Relative format1
1 1Absolute format7

Relative format

TP-VP valueValidity periodPossible validity periods
0–143 x 5 minutes5, 10, 15 minutes... 11:55, 12:00 hours
144–167 hours12:30, 13:00,... 23:30, 24:00 hours
168–196 days2, 3, 4,... 30 days
197–255 weeks5, 6, 7,... 63 weeks

Absolute format

The absolute format is identical to the other time formats in GSM 03.40.

Enhanced format

Enhanced format of TP-VP field is seldom used. It has always 7 octets, although some of them are not used. The first octet is TP-VP Functionality Indicator. Its 3 least significant bits have the following meaning:
2 1 0Meaning
0 0 0No validity period specified
0 0 1The following octet is a relative validity period as described in the Relative Validity Period Values table
0 1 0The following octet contains a relative validity period in seconds in the range 0 to 255
0 1 1The following 3 octets contain a relative validity period in hours, minutes and seconds as the 3rd to 5th octet of time format
1 X XReserved

The value of 1 in the bit 6 of the first octet means that the message is Single-shot. The value of 1 in the bit 7 of the first octet indicates that TP-VP functionality indicator extends to another octet. However, no such extensions are defined.

Protocol Identifier

TP-PID either refers to the higher layer protocol being used, indicates interworking with a certain type of telematic device, specifies replace type of the message or allows download of configuration parameters to the SIM card. Plain MO-MT messages have PID=0.
TP-PIDmeaning
0Default store and forward short message
1–31no telematic interworking, but SME to SME protocol
32implicit telemetic device
33Telex or teletex reduced to telex format
34Group 3 telefax
35Group 4 telefax
36Voice telephone
37ERMES
38National Paging system
39Videotex
40Teletex, carrier unspecified
41Teletex, in PSPDN
42Teletex, in CSPDN
43Teletex, in analog PSTN
44Teletex, in digital ISDN
45UCI
46–47Reserved
48A message handling facility
49Any public X.400 based message handling system
50Internet Electronic Mail
51–55Reserved
56–62SC-specific; usage based on mutual agreement between the SME and the SC
63A GSM/UMTS mobile station.
64Short Message Type 0
65Replace Short Message Type 1
66Replace Short Message Type 2
67Replace Short Message Type 3
68Replace Short Message Type 4
69Replace Short Message Type 5
70Replace Short Message Type 6
71Replace Short Message Type 7
72Device Triggering Short Message
73–93Reserved
94Enhanced Message Service
95Return Call Message
96–123Reserved
124ANSI-136 R-DATA
125ME Data download
126ME De personalization Short Message
127SIM Data download
128–191reserved
192–255Assigns bits 0 5 for SC specific use

For TP-PID = 63 the SC converts the SM from the received TP Data Coding Scheme to any data coding scheme supported by that MS.
Short Message Type 0 is known as a silent SMS. Any handset must be able to receive such short message irrespective of whether there is memory available in the SIM or ME or not, must acknowledge receipt of the message, but must not indicate its receipt to the user and must discard its contents, so the message will not be stored in the SIM or ME.

Data Coding Scheme

A special 7-bit encoding called GSM 7 bit default alphabet was designed for Short Message System in GSM. The alphabet contains the most-often used symbols from most Western-European languages. Some ASCII characters and the Euro sign did not fit into the GSM 7-bit default alphabet and must be encoded using two septets. These characters form GSM 7-bit default alphabet extension table. Support of the GSM 7-bit alphabet is mandatory for GSM handsets and network elements.
Languages which use Latin script, but use characters which are not present in the GSM 7-bit default alphabet, often replace missing characters with diacritic marks with corresponding characters without diacritics, which causes a not entirely satisfactory user experience, but is often accepted. For best look the 16-bit UTF-16 encoding may be used at price of reducing length of a message from 160 to 70 characters.
The messages in Chinese, Korean or Japanese languages must be encoded using the UTF-16 character encoding. The same was also true for other languages using non-Latin scripts like Russian, Arabic, Hebrew and various Indian languages. In 3GPP TS 23.038 8.0.0 published in 2008 a new feature, an extended National language shift table was introduced, which in the version 11.0.0 published in 2012 covers Turkish, Spanish, Portuguese, Bengali, Gujarati, Hindi, Kannada, Malayalam, Oriya, Punjabi, Tamil, Telugu and Urdu languages. The mechanism replaces GSM 7-bit default alphabet code table and/or extended table with a national table according to special information elements in User Data Header. The non-segmented message using national language shift table may carry up to 155 7-bit characters.
The Data Coding Scheme field contains primarily information about message encoding. GSM recognizes only 2 encodings for text messages and 1 encoding for binary messages:
  • GSM 7-bit default alphabet
  • UCS-2
  • 8-bit data
The TP-DCS octet has a complex syntax to allow carrying of other information; the most notable are message classes:
ValueMessage Class
0 00 - Flash messages
0 11 - ME-specific
1 02 - SIM / USIM specific
1 13 - TE-specific

Flash messages are received by a mobile phone even though it has full memory. They are not stored in the phone, they just displayed on the phone display.
Another feature available through TP-DCS is Automatic Deletion: after reading the message is deleted from the phone.
Message Waiting Indication group of DCS values can set or reset flags of indicating presence of unread voicemail, fax, e-mail or other messages.
A special DCS values also allows message compression, but it perhaps is not used by any operator.
The values of TP-DCS are defined in GSM recommendation 03.38. Messages sent via this encoding can be encoded in the default GSM 7-bit alphabet, the 8-bit data alphabet, and the 16-bit UCS-2 alphabet.

Discharge Time

The TP-DT field indicates the time and date associated with a particular TP-ST outcome:
  • if the message has been delivered or, more generally, other transaction completed, the TP-DT is the time of the completion of the transaction
  • if the SMSC is still trying to deliver the message, the TP-DT is the time of the last delivery attempt
  • if the SMSC is not making any more delivery attempts, the TP-DT is either the time of the last delivery attempt or the time at which the SMSC disposed the message

Parameter Indicator

The TP-PI field indicates presence of further fields in the SUBMIT-REPORT, DELIVER-REPORT or SMS-STATUS-REPORT TPDU.
bitMeaning
0TP-PID
1TP-DCS
2TP-UDL and TP-UD
8another TP-PI octet

As currently there are still four free bits in TP-PI, it can be expected that the extension bit will be zero even in the future, which helps to distinguish TP-PI field from TP-FCS field when information whether TPDU is part of positive or negative response is not available: if the most significant bit of the second octet of TPDU is 1, the second octet is TP-FCS, otherwise it is TP-PI.