Freescale Semiconductor Application Note AN2254 Rev. 1, 11/2004 Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores By Imran Ahmed In a Wideband Code Division Multiple Access (WCDMA) environment, each user is assigned a unique complex scrambling sequence to encode its information-bearing signal. The receiver has the scrambling code of the user, unscrambles the received signal, and recovers the original data [1]. This application note presents a method for complex pseudo-random sequence (PN code) generation and complex scrambling of an I/Q code multiplexed signal on a StarCore® SC140 digital signal processor (DSP). The PN codes in this application note are generated for a WCDMA Universal Mobile Telecommunications Systems (UMTS) uplink (signal from handset to base station) according to the third-generation partnership project (3GPP) specifications. This application note provides practical information to help users understand PN code generation and complex scrambling, which are required in the WCDMA standards. Typically, these operations are performed on Architecture-Specific Integrated Circuits (ASICs), but here we explore the use of the Freescale StarCore™-based DSPs to accomplish the same task. © Freescale Semiconductor, Inc., 2002, 2004. All rights reserved. CONTENTS 1 1.1 1.2 2 2.1 2.2 3 3.1 3.2 3.3 3.4 4 5 Pseudo-Random Sequences ....................................2 Randomness Properties ........................................... 2 Generating Pseudo-Random Sequences ..................2 Scrambling Codes for WCDMA .............................2 Generating Long Complex Scrambling Codes ....... 3 Scrambling an I-Q/Code Multiplexed Sign ......... al 6 Software Implementation on the StarCore SC140/SC1400 Cores ............................................. 6 Allocating Memory Space ...................................... 6 Binary PN Code and Complex Scrambling Sequences................................................................ 7 Forming the Complex Scrambling Sequences ........9 Complex Scrambling of an IQ/Code Multiplexed Signal ................................................ 13 Results ...................................................................16 References .............................................................19 Pseudo-Random Sequences 1 Pseudo-Random Sequences Pseudo-random sequences or PN codes are sequences of 1s and 0s generated by an algorithm so that the resulting numbers look statistically independent and uniformly distributed. A random signal differs from a pseudo-random signal in that a random signal cannot be predicted. A pseudo-random signal is not random at all; it is a deterministic, periodic signal that is known to both the transmitter and the receiver. Even though the signal is deterministic, it appears to have the statistical properties of sampled white noise. To an unauthorized listener, it appears to be a truly random signal. 1.1 Randomness Properties CDMA systems achieve their multiple access capability using large sets of sequences with three basic properties that are applied to a periodic binary sequence as a test for the appearance of randomness [2]: • Balance Property. In each period of the sequence, the number of binary 1s must differ from the number of binary 0s by at most one digit. In other words, the sequences are balanced so that each element of the sequence alphabet occurs with equal frequency. • Run Property. A run is defined as a sequence of the same binary digit. The appearance of a different binary digit marks the start of a new run. The length of the run is the number of digits in the run. For the randomness run property, in each period, about one-half the runs of each binary digit should be of length 1, about one-fourth of length 2, one-eighth of length 3, and so on. • Correlation Property. Random sequences are often described in terms of their correlation properties. A scrambling sequence in a CDMA system must have small off-peak autocorrelation values to allow for rapid sequence acquisition at the receiver and to minimize self interference due to multipath acquisitions. Furthermore, the cross correlations are small enough among such sequences at all delays to minimize multiple-access interference. 1.2 Generating Pseudo-Random Sequences Pseudo-random binary codes are typically generated using a system of linear feedback shift registers (LFSRs). The LFSR generators produce a sequence that depends on the number of stages, the feedback tap connections, and the initial conditions. The output sequences can be classified as either maximal length (m-sequence) or nonmaximal length. The m-sequences have the property that for an n-stage LFSR the sequence repetition period in clock pulses, p, is as shown in Equation 1. Equation 1 P = 2n–1 Thus, if the sequence length is less than the maximum period of (2n–1), the sequence is classified as a nonmaximal length sequence. In fact, all the m-sequences are generated by primitive polynomials of degree n over Galois Field 2 (GF(2)). 2 Scrambling Codes for WCDMA In a CDMA scheme, all users transmit on the same frequency and are differentiated by their unique scrambling codes. The receiver correlates the received signal with a synchronously generated replica of the scrambling code to recover the original information-bearing signal. The third-generation partnership project (3GPP) specifications define how these uplink complex scrambling codes are generated. Part of the process in the transmitter, in addition Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 2 Freescale Semiconductor Scrambling Codes for WCDMA to spreading, is the scrambling operation. Because scrambling is used on top of spreading as shown Figure 1, it does not change the bandwidth of the signal, but only makes the signals from different users separable from each other [3]. Symbol Rate Chip Rate Chip Rate Data Channelization Code Scrambling Code Figure 1. Relation Between Spreading and Scrambling With I-Q/code multiplexing, also called dual-channel quaternary phase shift keying (QPSK) modulation, the power levels of the dedicated physical data channel (DPDCH) and the dedicated physical control channel (DPCCH) typically differ. This is especially true as data rates increase and can lead in extreme cases to binary phase shift keying (BPSK) type transmission when the branches are independently transmitted. This situation is avoided by using a scrambling operation after the spreading with channelization codes. The transmission of two parallel channels, DPDCH and DPCCH, leads to multicode transmission, which increases the peak-to-average power ratio [3]. The spreading modulation solution shown in Figure 2 keeps the transmitter power amplifier efficiency the same as for normal balanced QPSK transmission in general. CD CSCRAMB I DPDCH IQ Multiplex I+jQ Q DPCCH To QPSK Modulation CC Figure 2. I-Q/Code Multiplexing With Complex Scrambling. 2.1 Generating Long Complex Scrambling Codes All uplink physical channels are subjected to scrambling with a complex-valued scrambling code. In WCDMA uplink transmissions, the scrambling code can either be short or long. There are 224 long uplink scrambling codes, and these codes are assigned by higher layers. The long codes are essentially Gold codes. Large sets of Gold codes have low cross-correlation properties so that as many users as possible can use the channel with minimum mutual interference. According to 3GPP specifications, Gold codes are generated with a system of 25-stage linear feedback shift registers, as shown in Figure 3 [4]. Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 Freescale Semiconductor 3 Scrambling Codes for WCDMA MSB LSB C1,n C2,n Figure 3. Uplink Long Scrambling Code Generator These 25-degree generator polynomials are truncated to the 10 ms frame length that results in 38400 chips at the rate of 3.84 Mcps. The long scrambling sequences, c1,n and c2,n, are constructed from a position-wise modulo 2 sum of 38400 chip segments of the two binary m-sequences. The two binary m-sequences are constructed using the following primitive polynomial over GF(2), as show in Figure 3. Furthermore, sequence c2,n is a 16,777,232 chip delayed version of sequence c1,n. Equation 2 X25 + X3 + 1 Equation 3 X25 + X3 + X2 + X + 1 Let x, and y be the two m-sequences that are constructed from primitive polynomials of Equation 2 and Equation 3, respectively. The resulting sequences constitute segments of a set of Gold sequences. Now, let n23 ... n0 be the 24-bit binary representation of the scrambling sequence number n with n0 as the least significant bit. The x sequence depends on the chosen scrambling sequence number n and is denoted as xn in the sequel. Furthermore, let xn(i) and y(i) denote the i:th symbol of the sequences xn and y, respectively. The m-sequences xn and y are constructed as follows: 1. Initial conditions: Equation 4 xn(0) = n0, xn(1) = n1, ..., xn(22) = n22, xn(23) = n23, xn(24) = 1 Equation 5 y(0) = y(1) = ... = y(23) = y(24) = 1 2. Recursive definition of subsequent symbols: Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 4 Freescale Semiconductor Scrambling Codes for WCDMA Equation 6 xn(i+25) = xn(i+3) + xn(i) modulo 2, i=0, ..., 225-27 . Equation 7 y(i+25) = y(i+3) + y(i+2) + y(i+1) + y(i) modulo 2, i=0, ..., 225-27 3. Binary Gold sequence zn: Equation 8 zn(i) = xn(i) + y(i) modulo 2, i=0, ..., 225-2 4. Real-valued Gold sequence: Equation 9 +1 if zn(i) = 0 for i = 0, 1, 2, ..., 225-2. Zn(i) = -1 if zn(i) = 1 a. The real-valued long scrambling sequences c1,n and c2,n are defined as follows: Equation 10 c1,n(i) = Zn(i), i=0, ..., 225-2 Equation 11 c2,n(i) = Zn(i+16777232) modulo (225–1), i=0, ..., 2 25–2 b. The complex-valued long scrambling sequence Cn, is defined as follows, where i = 0, 1, ..., 225-2 and denotes rounding to the nearest lower integer: Equation 12 i Cn(i) = c1,n(i) ( 1 + j( -1 ) c2,n( 2 * FLOOR(i/2) ) ) A more intuitive way of forming the complex-valued scrambling code from two real-valued codes, c1,n and c2,n, with the decimation principle is: Equation 13 Cscrambling = c1,n( w0 + jc2,n (2k) w1 ), k = 0, 1, 2, ... with sequences w0 and w1 given as chip rate sequences: Equation 14 w0 = {1 1}, w1 = {1 -1} The decimation factor for the second sequence is 2. Ultimately this way of creating the scrambling sequence reduces the zero crossings in the constellation and further reduces the amplitude violations in the modulation process. In conclusion, Equation 13 and Equation 14 give the same complex scrambling code as is achieved through Equation 12. Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 Freescale Semiconductor 5 Software Implementation on the StarCore SC140/SC1400 Cores 2.2 Scrambling an I-Q/Code Multiplexed Signal Figure 2 shows that before the data signal is QPSK modulated, the I-Q/code multiplexed data signal is multiplied with the complex scrambling code. In this step, the two complex signals are multiplied together as shown in the following equations, where DI = the real part of the incoming data: Equation 15 ( DIi + jDQi ) x ( SIi + jSQi ) Where: • DI = real part of the incoming data • DQ = complex part of the incoming data • SI = real part of the scrambling code • SQ = complex part of the scrambling code • I = 0, 1, 2, . . . , 38399 Equation 15 implies the final result, as follows: Equation 16 (( DIi*SIi ) – ( DQi*SQi )) + j(( DIi*SQ i ) + ( DQi*SIi )) 3 Software Implementation on the StarCore SC140/SC1400 Cores This section describes how the algorithms in Section 2, Scrambling Codes for WCDMA, are implemented on the StarCore SC140/SC1400 DSP cores. For ease of implementation, the algorithms slightly differ from the theory presented in Section 2. The first part of the program generates the PN code, and the second part performs the actual scrambling of the incoming signal. First, the memory space required for these calculations is specified. 3.1 Allocating Memory Space The assembly code assumes that required memory space has been allocated before the assembly routine is called. This memory space is 16-bit aligned. Table 1 lists the exact amount of space required for different global variables. Table 1. Memory Allocation Global Variable Name Description Number of Bytes REG1 Holds the starting phase value for PN code generation. As shown in Equation 3, the PN code generated depends on the initial value of the 25-stage LFSR. The most significant bit of the upper 25-stage LFSR is always one (1), and the initial value for this register is passed to the assembly code. The lower 25-stage LFSR does not require initialization because all of its 25 bits are always configured to a value of one (1) at the start of a new sequence. 4 Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 6 Freescale Semiconductor Software Implementation on the StarCore SC140/SC1400 Cores Table 1. Memory Allocation Global Variable Name Description Number of Bytes CODE_IQ Points to the interleaved complex scrambling code. This scrambling code is stored as a real-valued code scaled down by a factor of 2 (+1 as +0.5 or -1 as -0.5). Since the code length for a frame is 38400 chip segments, each sample is stored as a 16 bit sample, and each chip segment contains an I and Q part, a buffer of 38400*2*2 bytes is assigned. 38400 × 2 × 2 INPUT_IQ Points to the buffer in the memory where the interleaved input data to be scrambled is stored. This assembly code assumes that the data samples are 16-bits wide and are held in the memory buffer as I/Q interleaved samples. 38400 × 2 × 2 Points to the memory buffer where the interleaved scrambled data is stored for one frame. 38400 × 2 × 2 OUTPUT_IQ 3.2 Binary PN Code and Complex Scrambling Sequences For optimal implementation of the algorithm to generate the complex binary PN code, 16 stacked-bit samples are generated in one iteration rather than generating the PN code one bit at a time. Since a PN code is essentially a system of LFSRs, the last 16 bits are processed in one operation to give 16 samples of PN code. Example 1 shows the pseudo code for this implementation. Example 1. Pseudo Code for 16-Bit Vector Processing X= Upper LFSR Y= Lower LFSR for (i = 0; i < 2400; i++) X0 X3 = = (X >> 0) (X >> 3) ;//(38400/16) = 2400 { ;//X0 holds the lower 16 bits of X non-shifted, reqd for c1 & X25 feedback ;//X3 holds the lower 16 bits of X shifted by 3,reqd for X25 feedback poly. X4 = X7 = X25 = (X >> 4) (X >> 7) (X3 ^ X0) ;//X4 holds the lower 16 bits of X shifted by 4, reqd for c2 ;//X7 holds the lower 16 bits of X shifted by 7, reqd for c2 ;//feedback polynomial, accodring to eqn. 6, most significant 16 bits ;//for next iteration, 9 from previous iteration, as old 16 shifted out X = (X >> 16) ;//lower 16 bits shifted out X = (X | (X25 << 9)) ;//most sig. 9 bits from prev iteration & 16 sig bits from this iteration ;//X is ready for next X18 = (X >> 2) ;//X18 holds the lower 16 bits of X shifted by 18, reqd for c2 Y0 Y1 = (Y >> 0) = (Y >> 1) ;//Y0 holds the lower 16 bits of Y non-shifted, reqd for c1 & Y25 feedback ;//Y1 holds the lower 16 bits of Y shifted by 1, reqd for Y25 feedback Y2 = (Y >> 2) ;//Y2 holds the lower 16 bits of Y shifted by 2, reqd for Y25 feedback Y3 = (Y >> 3) ;//Y3 holds the lower 16 bits of Y shifted by 3, reqd for Y25 feedback poly. poly. poly. Y4 = (Y >> 4) ;//Y4 holds the lower 16 bits of Y shifted by 4, reqd for c2 Y6 = (Y >> 6) ;//Y5 holds the lower 16 bits of Y shifted by 6, reqd for c2 Y25 = (Y3 ^ Y2 ^ Y1 ^ Y0) ;//feedback polynomial, accodring to eqn. 7, most significant 16 bits ;//for next iteration, 9 from previous iteration, as old 16 shifted out Y = (Y >> 16) ;//lower 16 bits shifted out Y = (Y | (Y25 << 9)) ;//most sig. 9 bits from prev iteration & 16 sig bits from this iteration ;//Y is ready for next Y17 = (Y >> 1) ;//X17 holds the lower 16 bits of Y shifted by 17, reqd for c2 Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 Freescale Semiconductor 7 Software Implementation on the StarCore SC140/SC1400 Cores c1 = (X0 ^ Y0) ;//16-stacked bit c1 according to figure 3 configuration c2 = (X4 ^ X7 ^ X18 ^ Y4 ^ Y6 ^ Y17) ;//16-stacked bit c2 according to figure 3 configuration } Now, the formation of the complex scrambling codes begins. In this part of the code, the C/Assembly calling function enters the assembly code, and the data variables and pointers are put into the appropriate registers for the assembly function to use. The pn_generation subroutine includes the pn_generation_param.asm parameter file, which defines the local constants used by this subroutine: • REG2_INIT holds the value 0x01FFFFFF for initializing the lower 25-stage LFSR, as shown in Figure 3. • MASK16 holds the value 0x000000FFFF for masking the lower 16-bits of a data register. • NUM_ITER specifies the number of times the main loop in the function iterates. It is initialized to 2400. Data registers D6 and D7 are the two 25-stage LFSRs. Since the LFSRs are only 25-stage, only the lower 25 bits of the data registers are used for this purpose. The most significant 15-bits are set to zero (data registers are 40 bits wide). The first three instructions initialize the lower LFSR and the upper LFSR as shown in Figure 3 according to the initialization value that is stored in global variable REG1 for the upper LFSR (see Example 2). The last instruction loads address register R0 to point to the memory buffers to store the interleaved scrambling code samples I and Q. Example 2. Setting Data and Address Registers move.l move.l move.l move.l will be #REG1,r0 #REG2_INIT,d7 (r0),d6 #CODE_IQ,r0 stored ;//R0 ;//D7 ;//D6 ;//R0 points is the is the points to initial value of upper LFSR lower LFSR upper LFSR to where IQ scrambling code The program can be divided into two main parts: 1. Generating the binary PN code. 2. Forming the complex scrambling sequence. 3.2.1 Generating the Binary PN Code Generating the binary PN codes as stacked bits is accomplished following the algorithm shown in Example 1. The mainloop in the program generates the PN codes. The mainloop produces 16-bit stacked c1 and c2 PN code samples, as shown in Figure 3. As the routine starts, it executes instructions to set up the address and data registers before the code jumps into mainloop. The code sets up mainloop and the loop counter for the loop to perform 2400 iterations, as described in Example 3 (which shows a complete assembly code listing for generating the PN codes and the function for forming complex scrambling sequences, pn_generation.asm). Following is a step-by-step description of one iteration of the StarCore DSP code to demonstrate how it executes: 1. To determine c1and c2 for the PN code, we must determine the polynomials that are required. The c1 part of the PN code is a modulo 2 sum of the least significant bits of the X and Y registers. a. The first 16-bit c1 sample is determined in instruction set ‘b’ of Example 3. b. Inside the mainloop, it is calculated in instruction set ‘j’ and stored into the memory buffer in instruction set ‘d’. 2. Determining c2 requires a modulo 2 sum of several shifted polynomials: — 4-bit shifted D6 (X4-instruction set ‘c’ and ‘k’) Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 8 Freescale Semiconductor Software Implementation on the StarCore SC140/SC1400 Cores — 7-bit shifted D6 (X7-instruction set ‘d’) — 18-bit shifted D6 (X18-instruction set ‘h’) — 4-bit shifted D7 (Y4-instruction set ‘d’) — 6-bit shifted D7 (Y6-instruction set ‘e’) — 17-bit shifted D7 (Y17-instruction set ‘i’) The first 16-bit sample for c2 is determined in instruction set ‘j’ of code listing 3 and then stored in the memory buffer in the very next instruction set ‘k’. 3. Since the algorithm determines 16-bit samples and then shifts out the lower 16 bits from data registers D6 and D7, the determination of the feedback polynomials, X25 and Y25, is required: a. The feedback polynomial X25 is a modulo 2 sum of the non-shifted lower 16 bits of D6 (X0 instruction set ‘a’ and ‘h’) and a 3-bit shifted version of D6 (X3 instruction set ‘b’ and ‘i’). b. The first feedback X25 polynomial is determined in instruction set ‘c’ and then in instruction set ‘k’ in mainloop and is stored in register D1 in the same instruction set ‘k.’ c. The feedback polynomial Y25 is a modulo 2 sum of the non-shifted lower 16 bits of D7 (Y0 instruction set ‘a’ and ‘i’), 1-bit shifted D7 (Y1 instruction set ‘a’ and ‘i’), 2-bit shifted D7 (Y2 instruction set ‘b’ and ‘j’), and a 3-bit shifted D7 (Y3 instruction set ‘c’ and ‘k’). d. The first feedback Y25 polynomial is determined in instruction set ‘d’ and stored in register D9 in the same cycle. 4. After the feedback polynomials (X25 and Y25) have been determined and the original registers (D6 and D7) are shifted by 16 bits, we put the significant 16 bits of the 25-stage LFSRs into place. This occurs in cycles ‘e,’ ‘f,’ ‘g,’ and ‘h:’ a. In instruction set ‘e,’ the lower 16 bits of the feedback polynomials (X25 and Y25) are extracted and stored in D1 and D9. b. In instruction set ‘f,’ the lower 16 bits of D1 and D9 are shifted to the left by 9, so that they become the higher 16 bits of a 25-stage LFSR. c. In instruction set ‘g,’ D6, which by now has shifted out its lower 16 bits and has only 9 bits located in its least significant part, gets the higher 16 bits from D1. d. Similarly, in instruction set ‘h,’ D7 gets its higher 16 bits from D9 for its 25-stage LFSR without affecting its lower 9 bits. These are the overall steps performed to generate the binary PN code in mainloop. The mainloop iterates 2400 times, producing 16-bit samples of c1 and c2 in each iteration. As a result, 38400 chip segments are produced. 3.3 Forming the Complex Scrambling Sequences Once the binary PN code is generated, the next step is the formation of complex scrambling sequences from the binary PN code. Complex scrambling code is formed according to Equation 12 or Equation 13 and Equation 14. According to these equations, every other sample of c2 binary PN code is selected before the formation of complex scrambling code. After a 16-bit binary scrambling sequence is formed, it is mapped into a real-valued code according to Equation 9 on page 5, one bit at a time. This occurs in the mappingloop section of the program. This part of the code takes the 16-bit c1 and c2 samples and forms complex scrambling codes, 16 bits at a time. According to Equation 12, the real part of the scrambling sequence is c1 itself, and no change is required for calculating the real part of the scrambling sequence. The complex part of the scrambling sequence is a multiplicative result of the real valued code of c1,c2 and +1 or -1, depending on whether it is an even or odd Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 Freescale Semiconductor 9 Software Implementation on the StarCore SC140/SC1400 Cores sample. Moreover, before the multiplication to calculate the complex part of the scrambling sequence, the c2 used is a decimated version of the original c2 by 2. Decimation of c2 by a factor of 2 is accomplished by ANDing c2 with 0x5555, shifting the result to the left by one bit and then ORing the shifted result with itself. The next step is the multiplication of c1 and decimated c2 using an exclusive-or (EOR) operation. The final step is the multiplication by +1 or -1, depending on whether it is an even or odd sample. This step is also performed using an EOR operation with 0xAAAA. Thus, for each 16-bit sample of PN code, 16 chip segments of complex scrambling code are formed. Finally, the complex scrambling code is mapped into real values and stored into the memory buffer as interleaved IQ samples. To prevent overflow, a scaled-down version of the real-valued code (+1 or –1 to +0.5 or –0.5) is stored in memory. The mappingloop program iterates 15 times for one iteration of the mainloop program because one iteration of the code is performed while mappingloop is being set up. Example 3. Generating PN Codes ;******************************************************************************* ;* File: pn_generation.asm ;* Function: binary pn code generation for WCDMA ;* Author: Imran Ahmed ;* Version/Date: 1.0 Oct 10 2001 ;* ;* Target Processor: Star*Core 140 ;* ;* Description: ;* Module Details: ;* Registers Used: ;* d0,d1,d2,d3,d4,d5,d6,d7,d8,d9,d10,d11,d12,d13,d14,d15 ;* r0 ;* entry : jsr ;******************************************************************************* ;* ;* Revision History: Date Change Details Initials ;* -----------------------;* ;******************************************************************************* ;----------------------------pn_generation_param.asm---------------------------;******************************************************************************* ; ;MASK16 EQU $000000FFFF ;REG2_INIT EQU $0001FFFFFF ;NUM_ITER EQU #2400 ;MASKONE EQU $0000000001 ;MASK_DECM2 EQU $0000005555 ;MASK_PN1 EQU $000000AAAA ; ;******************************************************************************* ;------------------------------WCDMA PN GENERATION-----------------------------;******************************************************************************* section .data local include ’pn_generation_param.asm’ endsec section .text local global main_pn_generation main_pn_generation type func Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 10 Freescale Semiconductor Software Implementation on the StarCore SC140/SC1400 Cores [ push d6 push d7 ] [ push r6 push r7 ] move.l #REG1,r0 move.l #REG2_INIT,d7 move.l (r0),d6 move.l #CODE_IQ,r0 [ move.f #0.5,d13 move.f #-0.5,d14 ;//R0 ;//D7 ;//D6 ;//R0 points is the is the points to initial value of upper LFSR lower LFSR upper LFSR to where IQ scrambling code will be stored ;//1 scaled down by a factor of 2 to 0.5 to avoid overflow ;//-1 scaled down by a factor of 2 to -0.5 to avoid ;//overflow ;//used to keep track for decimation of c2 code clr d4 ] dosetup0 mainloop doen0 #NUM_ITER ;-------------------------generation of binary PN codes (c1 & c2) starts here-----------------a [ move.w #9,d15 ;used for offset purposes in shifting registers tfr d6,d0 ;//c1 = x0 tfr d7,d9 ;//y25 = y0 lsr d7 ;//y1 tfr d6,d1 ;//x25 = x0 ] b [ eor d7,d9 ;//y25 = yo^y1 eor d9,d0 ;//c2 = x0^y0 lsr d7 ;//y2 lsrr #3,d6 ;//x3 = x >> 3 ] c [ lsr d7 ;//y3 eor d7,d9 ;//y25 = y0^y1^y2 eor d6,d1 ;//x25 = x0^x3 lsr d6 ;//x4 ] ;---------------------------mainloop main kernel-------------------------------------falign: loopstart0 mainloop d [ eor d7,d9 lsr d7 lsrr #3,d6 tfr d6,d8 move.w #9,d15 ] ;//y25 = y0^y1^y2^y3 ;//y4 ;//x7 ;//c2 = x4 ;//used as offset in shifting [ eor d7,d8 lsrr #2,d7 and #MASK16,d1,d1 and #MASK16,d9,d9 ] ;//c2 = x4^y4 ;//y6 ;//get lower 16 bits of x25, zero high bits ;//get lower 16 bits of y25, zero high bits e f [ eor d6,d8 lsll d15,d1 lsll d15,d9 lsrr #9,d6 ;//c2 = y4^x4^x7 ;//x25 =<<9 ;//y25 =<<9 ;//x >> 16 Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 Freescale Semiconductor 11 Software Implementation on the StarCore SC140/SC1400 Cores ] g [ or d1,d6 lsrr #10,d7 eor d7,d8 ] ;//x |= x25 ;//y >> 16, was at y6 ;//c2 = y4^x4^x7 ^y6 [ tfr d6,d1 tfr d6,d0 or d9,d7 lsrr #2,d6 ] ;//x25 = x0 ;//c1 = x0 ;//y |= y25 ;//x18 [ eor lsr tfr lsr ] d6,d8 d6 d7,d9 d7 ;//c2 = y4^x4^y6^x7^x18 ;//x3 ;//y25 = y0 ;//y17 = y16 >> 1 (y1) [ eor eor eor lsr ] d7,d8 d7,d9 d9,d0 d7 ;//c2 = y4^x4^y6^x7^x18^y17 ;//y25 = y0^y1 ;//c2 = x0^y0 ;//y2 [ eor lsr eor lsr ] d6,d1 d6 d7,d9 d7 ;//x25 = x0^x3 ;//x4 ;//y25 = y0^y1^y2 ;//y3 h i j k ;--------------------mapping into real values sarts here------------------------[ and #MASK_DECM2,d8.l ;//decimation of every other sample of c2 dosetup1 mappingloop ;//setup mappingloop ] asl d8,d4 ;//left shift decimated version of c2 or d4,d8 ;//or with itself, repeats one sample twice [ eor d12,d8 ;//Q part of scrambling = c1(i)*c2(i) -- eqn. 12 doen1 #15 ;//set mappingloop counter to 15 and #MASKONE,d12,d2 ;//extract c1’s least sig. bit asr d12,d12 ;//shift out the c1 bit already checked ] [ eor #MASK_PN1,d8.l ;//Q part of scrambling [c1(i)*c2(i)] * +1 and -1 respecively tsteq d2 tfr d13,d10 ;//-- eqn. 12 ;//test c1’s bit for 0 or 1 ;//I part of scram. code, assume c1==0, map into real value 1, and #MASKONE,d8,d3 ] [ asr d8,d8 tfrf d14,d10 ;//i.e. put 0.5 ;//extracts c2’s least sig. bit ;//extracts c2’s least sig. bit ;//I part of scram. code, if c1==1, map into real value - 1, tsteq d3 tfr d13,d11 ;//i.e. put -0.5 ;//test c2’s bit for 0 or 1 ;//assume c2==0, map into real value 1, i.e. put 0.5 Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 12 Freescale Semiconductor Software Implementation on the StarCore SC140/SC1400 Cores ] loopstart1 mappingloop [ tfrf d14,d11 and #MASKONE,d12,d2 asr d12,d12 ] [ moves.2f d10:d11,(r0)+ ;//if c2==1, map into real value -1, i.e. put 0.5 ;//extract c1’s least sig. bit ;//shift out the c1 bit already checked ;//move I and Q scrambling code into memory buffer, increment tsteq d2 tfr d13,d10 ;//buffer ;//test c1’s bit for 0 or 1 ;//I part of scram. code, assume c1==0, map into real value 1, and #MASKONE,d8,d3 asr d8,d8 ] [ tfrf d14,d10 ;//i.e. put 0.5 ;//extracts c2’s least sig. bit ;//extracts c2’s least sig. bit ;//I part of scram. code, if c1==1, map into real value - 1, tsteq d3 tfr d13,d11 ] loopend1 tfrf d14,d11 moves.2f d10:d11,(r0)+ ;//i.e. put -0.5 ;//test c2’s bit for 0 or 1 ;//assume c2==0, map into real value 1, i.e. put 0.5 ;//if c2==1, map into real value -1, i.e. put 0.5 ;//move I and Q scrambling code into memory buffer, increment ;//buffer loopend0 [ pop r6 pop r7 ] [ pop d6 pop d7 ] rts endsec 3.4 Complex Scrambling of an IQ/Code Multiplexed Signal This section describes in detail how complex scrambling code is formed on the SC140 DSP core, and also describes the process of actual complex scrambling of an I/Q code multiplexed signal. With its four ALUs, the SC140 core can compute complex numbers and perform several different operations very efficiently. Imposing one constraint on the incoming complex signal is required to ensure that all entries of the incoming I-Q/code multiplexed signal are less than one to help prevent overflow. After complex scrambling, the final output signal is scaled down by a factor of 2 and stored in memory. After the complex signal has been formed, it is time for scrambling the received data. This function carries out the complex scrambling operation according to Equation 16. The received I-Q/code multiplexed signal is multiplied by the complex scrambling code, and the Output_IQ is stored in memory. Following is the flow of the assembly code in Example 4 for generating the complex scrambling sequence from previously-generated binary PN code and scrambling the received data: 1. START mainloop #38400. 2. Read the I and Q, complex scrambling code from memory, 1-word sample at a time. Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 Freescale Semiconductor 13 Software Implementation on the StarCore SC140/SC1400 Cores 3. Read the I and Q, input signal data from memory, 1-word sample at a time. 4. Perform the scrambling of the input data signal, according to Equation 16. 5. Store the I-Q/code interleaved complex scrambled signal into memory. 6. END mainloop. Example 4. Complex Scrambling of an I-Q/code Multiplexed Signal ;******************************************************************************* ;* ;* File: cmplx_scrambling.asm ;* Function: formation of complex scrambling code and scrambling of ;* received I-Q/code multiplexed signal for WCDMA ;* Author: Imran Ahmed ;* Version/Date: 1.0 Oct 10 2001 ;* ;* Target Processor: Star*Core 140 ;* ;* Description: ;* Module Details: ;* Registers Used: ;* d0,d1,d2,d3,d4,d5,d6,d7,d8,d9,d10,d11,d15 ;* r0,r2,r4 ;* entry : jsr ;******************************************************************************* ;* ;* Revision History: Date Change Details Initials ;* -----------------------;* ;******************************************************************************* ;--------------------------cmplx_scrambling_param.asm--------------------------;******************************************************************************* ; ;NUM_ITEREQU19199 ;//(38400-2)/2 ;INV_SQRT2 EQU #0.70710678;//1/sqrt(2) ; ;******************************************************************************* ;---------------------------WCDMA COMPLEX SCRAMBLING---------------------------;******************************************************************************* section .data local include ’cmplx_scrambling_param.asm’ endsec section .text local global main_cmplx_scrambling main_cmplx_scrambling type func [ push d6 push d7 ] [ push r6 push r7 ] Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 14 Freescale Semiconductor Software Implementation on the StarCore SC140/SC1400 Cores move.l move.l move.l move.f #INPUT_IQ,r0 #CODE_IQ,r4 #OUTPUT_IQ,r2 #INV_SQRT2,d15 ;//R0 -> received input signal ;//R4 -> complex scrambling signal ;//R2 -> IQ complex scrambled signal to be stored ;//(1/sqrt(2)), required for scrambling to keep the dosetup1 mainloop [ move.4f (r0)+,d0:d1:d2:d3 move.4f (r4)+,d4:d5:d6:d7 ;//energy of the srambled signal constant doen1 #NUM_ITER ;//move 2 input IQ samples from memory to data ;//registers ;//move 2 scrambling IQ code samples from memory to ;//data registers ] [ mpy mpy mpy mpy ] [ mac mac mac mac ] d0,d4,d8 d0,d5,d9 d2,d6,d10 d2,d7,d11 ;//(DI*SI) ;//(DI*SQ) ;//(DI*SI) ;//(DI*SQ) part part part part from from from from eqn. eqn. eqn. eqn. 16, 16, 16, 16, 1st 1st 2nd 2nd sample sample sample sample -d1,d5,d8 d1,d4,d9 -d3,d7,d10 d3,d6,d11 ;//(-(DQ*SQ)) part from eqn. 16, 1st sample ;//(DQ*SI) part from eqn. 16, 1st sample ;//(-(DQ*SQ)) part from eqn. 16, 2nd sample ;//(DQ*SI) part from eqn. 16, 2nd sample ;-------------------code and scaling to preserve the energy of the constellation---------------------[ ;-------------------code and scaling to preserve the energy of the constellation-------mpy d15,d8,d8 ;//(1/sqrt(2)) x (scrambled output I), 1st sample mpy d15,d9,d9 ;//(1/sqrt(2)) x (scrambled output Q), 1st sample mpy d15,d10,d10 ;//(1/sqrt(2)) x (scrambled output I), 2nd sample mpy d15,d11,d11 ;//(1/sqrt(2)) x (scrambled output Q), 2nd sample ] [ asl d8,d8 ;//output I scaling factor change from 4 to 2, 1st ;//sample asl d9,d9 ;//output Q scaling factor change from 4 to 2, 1st ;//sample asl d10,d10 ;//output I scaling factor change from 4 to 2, 2nd ;//sample asl d11,d11 ;//output Q scaling factor change from 4 to 2, 2nd sample ] ;----------------------------end of code to preserve energy of constellation------------------[ move.4f (r0)+,d0:d1:d2:d3 move.4f (r4)+,d4:d5:d6:d7 ;//move 2 input IQ samples from memory to data ;//registers ;//move 2 scrambling IQ code samples from memory to ;//data registers ] falign loopstart1 mainloop [ moves.4f d8:d9:d10:d11,(r2)+ ;//move 2 complex scrambled IQ samples into memory ;//buffer Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 Freescale Semiconductor 15 Results mpy d0,d4,d8 mpy d0,d5,d9 mpy d2,d6,d10 mpy d2,d7,d11 ] [ mac -d1,d5,d8 mac d1,d4,d9 mac -d3,d7,d10 mac d3,d6,d11 move.4f (r0)+,d0:d1:d2:d3 move.4f (r4)+,d4:d5:d6:d7 ;//(DI*SI) ;//(DI*SQ) ;//(DI*SI) ;//(DI*SQ) part part part part from from from from eqn. eqn. eqn. eqn. 16, 16, 16, 16, 1st 1st 2nd 2nd sample sample sample sample ;//(-(DQ*SQ)) part from eqn. 16, 1st sample ;//(DQ*SI) part from eqn. 16, 1st sample ;//(-(DQ*SQ)) part from eqn. 16, 2nd sample ;//(DQ*SI) part from eqn. 16, 2nd sample ;//move 2 input IQ samples from memory to data ;//registers ;//move 2 scrambling IQ code samples from memory to ;//data registers ] ;-------------------code and scaling to preserve the energy of the constellation--------------[ mpy d15,d8,d8 ;//(1/sqrt(2)) x (scrambled output I), 1st sample mpy d15,d9,d9 ;//(1/sqrt(2)) x (scrambled output Q), 1st sample mpy d15,d10,d10 ;//(1/sqrt(2)) x (scrambled output I), 2nd sample mpy d15,d11,d11 ;//(1/sqrt(2)) x (scrambled output Q), 2nd sample ] [ asl d8,d8 ;//output I scaling factor change from 4 to 2, 1st ;//sample asl d9,d9 ;//output Q scaling factor change from 4 to 2, 1st ;//sample asl d10,d10 ;//output I scaling factor change from 4 to 2, 2nd ;//sample asl d11,d11 ;//output Q scaling factor change from 4 to 2, 2nd sample ] ;----------------------------end of code to preserve energy of constellation------------------loopend1 moves.4f d8:d9:d10:d11,(r2)+ ;//move 2 complex scrambled IQ samples into memory ;//buffer [ pop pop ] [ pop pop ] rts r6 r7 d6 d7 endsec 4 Results The plots in Figure 4 and Figure 5 show the corresponding Matlab and StarCore DSP results for the complex scrambled signal. As these figures indicate, the StarCore DSP and the Matlab results agree. Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 16 Freescale Semiconductor Results Real part of Scrambled Signal 2 Green − DSP output Blue −− Matlab output 1.5 value of chip in constellation 1 0.5 0 −0.5 −1 −1.5 −2 1.915 1.916 1.917 1.918 1.919 1.92 1.921 chip segment number 1.922 1.923 1.924 1.925 4 x 10 Figure 4. Real Part of the Complex Scrambled Signal (Chips 19150–19250) Imaginary part of Scrambled Signal 2 Green − DSP output Blue −− Matlab output 1.5 value of chip in constellation 1 0.5 0 −0.5 −1 −1.5 −2 1.915 1.916 1.917 1.918 1.919 1.92 1.921 chip segment number 1.922 1.923 1.924 1.925 4 x 10 Figure 5. Imaginary Part of Complex Scrambled Signal (Chips 19150–19250) In Figure 4 and Figure 5, the x-axis represents the number of the chip, and the y-axis represents the magnitude of each of the chips. The StarCore DSP output is scaled up by a factor of 2 to account for the scaling factors used by the DSP in an implementation of complex scrambling code. The complex scrambled signal obtained from the DSP implementation matches the Matlab result. Figure 6 shows the signal constellation for the I-Q/code multiplexed Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 Freescale Semiconductor 17 Results signal before complex scrambling, and Figure 7 shows the signal constellation after the complex scrambling operations. The I-Q/code multiplexed signal with complex scrambling results in a rotated QPSK constellation. Figure 7 shows the resulting constellation achieved by both the Matlab and the StarCore DSP implementations. 2 1.5 1 0.5 0 −0.5 −1 −1.5 −2 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2 Figure 6. QPSK Constellation Before Complex Scrambling 2 X (Blue) − DSP output 1.5 O (Green) − Matlab output 1 0.5 0 −0.5 −1 −1.5 −2 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2 Figure 7. QPSK Constellation Map after Complex Scrambling Table 2 shows the assembly code results for PN code generation and formation of the pn_generation complex scrambling sequence function for one frame. The second row of the table shows the results for scrambling of an IQ/code multiplexed signal in the cmplx_scrambling function for one frame. Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 18 Freescale Semiconductor References Table 2. Assembly Code Results Function pn_generation cmplx_scrambling 1 Code Size (Bytes) Cycles per Frame MIPS 330 153618 15.36 124 38411 3.84 NOTES: 1. If scaling to preserve the energy of the constellation before and after complex scrambling is included, it requires 7.6 MIPS with a code size of 180 bytes. 5 References [1] R. Prasad, “An Overview of CDMA Evolution Toward Wideband CDMA,” IEEE Communications Surveys, vol. 1, no. 1, Fourth Quarter 1998. [2] B. Sklar, DIGITAL COMMUNICATIONS Fundamentals and Applications. New Jersey: Prentice-Hall, Inc., 1988. [3] H Holma and A. Toskala, WCDMA for UMTS-Radio Access For Third Generation Mobile Communications. New York: John Wiley & Sons, Ltd., 2001. [4] 3GPP, “TS 25.213 V3.40 (2000-12): Spreading and Modulation (FDD),” Release 1999. Scrambling Code Generation for WCDMA on the StarCore™ SC140/SC1400 Cores, Rev. 1 Freescale Semiconductor 19 How to Reach Us: Home Page: www.freescale.com E-mail: [email protected] USA/Europe or Locations not listed: Freescale Semiconductor Technical Information Center, CH370 1300 N. Alma School Road Chandler, Arizona 85224 +1-800-521-6274 or +1-480-768-2130 [email protected] Europe, Middle East, and Africa: Freescale Halbleiter Deutschland GMBH Technical Information Center Schatzbogen 7 81829 München, Germany +44 1296 380 456 (English) +46 8 52200080 (English) +49 89 92103 559 (German) +33 1 69 35 48 48 (French) [email protected] Japan: Freescale Semiconductor Japan Ltd. Headquarters ARCO Tower 15F 1-8-1, Shimo-Meguro, Meguro-ku, Tokyo 153-0064, Japan 0120 191014 or +81 3 5437 9125 [email protected] Asia/Pacific: Freescale Semiconductor Hong Kong Ltd. Technical Information Center 2 Dai King Street Tai Po Industrial Estate Tai Po, N.T. Hong Kong +800 2666 8080 For Literature Requests Only: Freescale Semiconductor Literature Distribution Center P.O. Box 5405 Denver, Colorado 80217 1-800-441-2447 or 303-675-2140 Fax: 303-675-2150 [email protected] AN2254 Rev. 1 11/2004 Information in this document is provided solely to enable system and software implementers to use Freescale Semiconductor products. There are no express or implied copyright licenses granted hereunder to design or fabricate any integrated circuits or integrated circuits based on the information in this document. Freescale Semiconductor reserves the right to make changes without further notice to any products herein. Freescale Semiconductor makes no warranty, representation or guarantee regarding the suitability of its products for any particular purpose, nor does Freescale Semiconductor assume any liability arising out of the application or use of any product or circuit, and specifically disclaims any and all liability, including without limitation consequential or incidental damages. “Typical” parameters which may be provided in Freescale Semiconductor data sheets and/or specifications can and do vary in different applications and actual performance may vary over time. All operating parameters, including “Typicals” must be validated for each customer application by customer’s technical experts. Freescale Semiconductor does not convey any license under its patent rights nor the rights of others. Freescale Semiconductor products are not designed, intended, or authorized for use as components in systems intended for surgical implant into the body, or other applications intended to support or sustain life, or for any other application in which the failure of the Freescale Semiconductor product could create a situation where personal injury or death may occur. Should Buyer purchase or use Freescale Semiconductor products for any such unintended or unauthorized application, Buyer shall indemnify and hold Freescale Semiconductor and its officers, employees, subsidiaries, affiliates, and distributors harmless against all claims, costs, damages, and expenses, and reasonable attorney fees arising out of, directly or indirectly, any claim of personal injury or death associated with such unintended or unauthorized use, even if such claim alleges that Freescale Semiconductor was negligent regarding the design or manufacture of the part. Freescale™ and the Freescale logo are trademarks of Freescale Semiconductor, Inc. StarCore is a trademark of StarCore LLC. All other product or service names are the property of their respective owners. © Freescale Semiconductor, Inc. 2002, 2004.