TOC 
Network Working GroupR. Sparks
Internet-Draftdynamicsoft
Expires: January 13, 2005July 15, 2004

Actions addressing identified issues with the Session Initiation Protocol's non-INVITE Transaction

draft-sparks-sip-nit-actions-01

Status of this Memo

By submitting this Internet-Draft, I certify that any applicable patent or other IPR claims of which I am aware have been disclosed, and any of which I become aware will be disclosed, in accordance with RFC 3668.

Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts.

Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."

The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt.

The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html.

This Internet-Draft will expire on January 13, 2005.

Copyright Notice

Copyright (C) The Internet Society (2004). All Rights Reserved.

Abstract

This draft describes modifications to the Session Initiation Protocol (SIP) to address problems that have been identified with the SIP non-INVITE transaction. These modifications reduce the probability of messages losing the race condition inherent in the non-INVITE transaction and reduce useless network traffic. They also improve the robustness of SIP networks when elements stop responding. These changes update behavior defined in RFCs 3261.



Table of Contents

1.  Introduction
2.  Improving the situation when responses are only delayed
    2.1  Action 1: Make the best use of provisional responses
    2.2  Action 2: Remove the useless late-response storm
3.  Improving the situation when an element is not going to respond
4.  Normative Updates to RFC 3261
    4.1  Action 1
    4.2  Action 2
§.  References
§  Author's Address
§  Intellectual Property and Copyright Statements




 TOC 

1. Introduction

There are a number of unpleasant edge conditions created by the SIP non-INVITE transaction model's fixed duration. The negative aspects of some of these are exacerbated by the effect provisional responses have on the non-INVITE transaction state machines. These problems are documented in [3]Sparks, R., Problems identified associated with the Session Initiation Protocol's non-INVITE Transaction, February 2004.. In summary:

A non-INVITE transaction must complete immediately or risk losing a race

Losing the race will cause the requester to stop sending traffic to the responder (the responder will be temporarily blacklisted)

Provisional responses can delay recovery from lost final responses

The 408 response is useless for the non-INVITE transaction

As non-INVITE transactions through N proxies time-out, there can be an O(N^2) storm of the useless 408 responses

This draft specifies updates to RFC 3261Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, A., Peterson, J., Sparks, R., Handley, M. and E. Schooler, SIP: Session Initiation Protocol, June 2002.[1] to improve the behavior of SIP elements when these edge conditions arise.



 TOC 

2. Improving the situation when responses are only delayed

There are two goals to achieve when we constrain the problem to those cases where all elements are ultimately responsive and networks ultimately deliver messages:

2.1 Action 1: Make the best use of provisional responses

Since non-INVITE transactions must complete rapidly ([3]Sparks, R., Problems identified associated with the Session Initiation Protocol's non-INVITE Transaction, February 2004.), any information beyond "I'm here" (which can be provided by a 100 Trying) can be just as usefully delayed to the final response. Sending non-100 provisionals wastes bandwidth.

As shown in [3]Sparks, R., Problems identified associated with the Session Initiation Protocol's non-INVITE Transaction, February 2004., sending any provisional response inside a NIT before Timer E reaches T2 damages recovery from failure of an unreliable transport.

Without a provisional, a late final response is the same as no response at all and will likely result in blacklisting the late responding element ([3]Sparks, R., Problems identified associated with the Session Initiation Protocol's non-INVITE Transaction, February 2004.), If an element is delaying its final response at all, sending a 100 Trying after Timer E reaches T2 prevents this blacklisting without damaging recovery from unreliable transport failure.

Blacklisting on a late response occurs even over reliable transports. Thus, if an element processing a request received over a reliable transport is delaying its final response at all, sending a 100 Trying well in advance of the timeout will prevent blacklisting. Sending a 100 Trying immediately will not harm the transaction as it would over UDP, but a policy of always sending such a message results in unneccessary traffic. A policy of sending a 100 Trying after the period of time in which Timer E reaches T2 had this been a UDP hop is one reasonable compromise.

2.2 Action 2: Remove the useless late-response storm

A 408 to non-INVITE will always arrive too late to be useful ([3]Sparks, R., Problems identified associated with the Session Initiation Protocol's non-INVITE Transaction, February 2004.), The client already has full knowledge of the timeout. The only information this message would convey is whether or not the server believed the transaction timed out. However, with the current design of the NIT, a client can't do anything with this knowledge. Thus the 408 simply wasting network resources and contributes to the response bombardment illustrated in [3]Sparks, R., Problems identified associated with the Session Initiation Protocol's non-INVITE Transaction, February 2004..

Late non-INVITE responses by definition arrive after the client transaction's Timer F has fired and the client transaction has entered the Terminated state. Thus, these responses cannot be distinguished from strays. Changing the protocol behavior to prohibit forwarding non-INVITE stray responses stops the late response storm. It also improves the proxy's defenses against malicious users counting on the RFC 3261 requirement to forward such strays.



 TOC 

3. Improving the situation when an element is not going to respond

When we expand the scope of the problem to also deal with element or network failure, we have more goals to achieve:

Action 1 helps with the first two goals, dramatically improving an element's ability to distinguish between failure and delayed response from the next downstream element. Some response, either provisional or final, will almost certainly be received before the transaction times out. So, an element can more safely assume that no response at all indicates the peer is not available and follow the existing requirements in [1]Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, A., Peterson, J., Sparks, R., Handley, M. and E. Schooler, SIP: Session Initiation Protocol, June 2002. and [2]Rosenberg, J. and H. Schulzrinne, Session Initiation Protocol (SIP): Locating SIP Servers, June 2002. for that case.

Achieving the third goal requires more agressive changes to the protocol. As noted in [3]Sparks, R., Problems identified associated with the Session Initiation Protocol's non-INVITE Transaction, February 2004., future non-invite transactions are likely to fail again unless the implementation takes steps beyond what is defined in [1]Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, A., Peterson, J., Sparks, R., Handley, M. and E. Schooler, SIP: Session Initiation Protocol, June 2002. and [2]Rosenberg, J. and H. Schulzrinne, Session Initiation Protocol (SIP): Locating SIP Servers, June 2002. to remember non-responsive destinations between transactions. Standardizing these extra steps is left to future work.



 TOC 

4. Normative Updates to RFC 3261

4.1 Action 1

A SIP element MUST NOT send any provisional response with a Status-Code other than 100 to a non-INVITE request.

A SIP element MUST NOT respond to a request with a Status-Code of 100 over any unreliable transport, such as UDP, before the amount of time it takes a client transaction's Timer E to be reset to T2.

A SIP element MAY respond to a request with a Status-Code of 100 over an unreliable transport after the amount of time it takes a client transaction's Timer E to be reset to T2.

A SIP element MAY respond to a request with a Status-Code of 100 over a reliable transport at any time.

4.2 Action 2

A transaction-stateful SIP element MUST NOT send a response with Status-Code of 408 to a non-INVITE request. As a consequence, an element that can not respond before the transaction expires will not send a final response at all.

A transaction-stateful SIP proxy MUST NOT send any response to a non-INVITE request unless it has a matching server transaction that is not in the Terminated state. As a consequence, this proxy will not forward any "late" non-INVITE response.



 TOC 

5 References

[1] Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, A., Peterson, J., Sparks, R., Handley, M. and E. Schooler, "SIP: Session Initiation Protocol", RFC 3261, June 2002.
[2] Rosenberg, J. and H. Schulzrinne, "Session Initiation Protocol (SIP): Locating SIP Servers", RFC 3263, June 2002.
[3] Sparks, R., "Problems identified associated with the Session Initiation Protocol's non-INVITE Transaction", draft-sparks-sip-nit-problems (work in progress), February 2004.


 TOC 

Author's Address

  Robert J. Sparks
  dynamicsoft
  5100 Tennyson Parkway
  Suite 1200
  Plano, TX 75024
EMail:  rsparks@dynamicsoft.com


 TOC 

Intellectual Property Statement

Disclaimer of Validity

Copyright Statement

Acknowledgment