WO2007102981A2 - Asymmetric control of high-speed bidirectional signaling - Google Patents

Asymmetric control of high-speed bidirectional signaling Download PDF

Info

Publication number
WO2007102981A2
WO2007102981A2 PCT/US2007/004375 US2007004375W WO2007102981A2 WO 2007102981 A2 WO2007102981 A2 WO 2007102981A2 US 2007004375 W US2007004375 W US 2007004375W WO 2007102981 A2 WO2007102981 A2 WO 2007102981A2
Authority
WO
WIPO (PCT)
Prior art keywords
data
slave device
master device
crc
information
Prior art date
Application number
PCT/US2007/004375
Other languages
French (fr)
Other versions
WO2007102981A3 (en
Inventor
Gerald R. Talbot
R. Stephen Polzin
Original Assignee
Advanced Micro Devices, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Advanced Micro Devices, Inc. filed Critical Advanced Micro Devices, Inc.
Priority to DE112007000574T priority Critical patent/DE112007000574B4/en
Priority to GB0815491A priority patent/GB2448651B/en
Priority to CN2007800077529A priority patent/CN101416435B/en
Priority to JP2008558284A priority patent/JP2009529289A/en
Publication of WO2007102981A2 publication Critical patent/WO2007102981A2/en
Publication of WO2007102981A3 publication Critical patent/WO2007102981A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/0001Systems modifying transmission characteristics according to link quality, e.g. power backoff
    • H04L1/0023Systems modifying transmission characteristics according to link quality, e.g. power backoff characterised by the signalling
    • H04L1/0026Transmission of channel quality indication
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/004Arrangements for detecting or preventing errors in the information received by using forward error control
    • H04L1/0056Systems characterized by the type of code used
    • H04L1/0061Error detection codes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/20Arrangements for detecting or preventing errors in the information received using signal quality detector
    • H04L1/205Arrangements for detecting or preventing errors in the information received using signal quality detector jitter monitoring
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/24Testing correct operation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/24Testing correct operation
    • H04L1/242Testing correct operation by comparing a transmitted test signal with a locally generated replica
    • H04L1/243Testing correct operation by comparing a transmitted test signal with a locally generated replica at the transmitter, using a loop-back
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L25/00Baseband systems
    • H04L25/02Details ; arrangements for supplying electrical power along data transmission lines
    • H04L25/03Shaping networks in transmitter or receiver, e.g. adaptive shaping networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L2001/0092Error control systems characterised by the topology of the transmission link
    • H04L2001/0094Bus
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L25/00Baseband systems
    • H04L25/02Details ; arrangements for supplying electrical power along data transmission lines
    • H04L25/03Shaping networks in transmitter or receiver, e.g. adaptive shaping networks
    • H04L25/03006Arrangements for removing intersymbol interference

Definitions

  • This invention relates to communication links and, more particularly, to communication over bidirectional links.
  • An example of such a system may be a memory system, where there may be a complex master device (e.g., memory controller) and simpler slave devices (e.g., memory devices).
  • the bidirectional data transfers would correspond to write data when transferring to the slave and read data when transferring from the slave.
  • a clock phase recovery function may be implemented in the receiver at each end of the bi-directional data bus.
  • the channel may be equalized to prevent data eye closure from the effect of inter-symbol interference (ISI).
  • ISI inter-symbol interference
  • links that have high data transfer rates may have a significant likelihood of bit errors occurring.
  • a means of error detection is typically implemented.
  • these functions may be conventionally implemented at both ends of the link. However, it may be desirable to simplify slave devices while maintaining control of the analog properties of the data waveforms that travel in both directions.
  • the system includes a slave device and a master device that is coupled to the slave device via a plurality of bidirectional data paths, for example.
  • the master device may control data transfer between the master device and the slave device. More particularly, the master device may adaptively modify transmit characteristics subsequent to adaptively modifying receiver characteristics based upon information received from the slave device via one or more unidirectional data paths.
  • the information may include CRC information that corresponds to data sent by the master device via the bidirectional data paths.
  • the master device may adaptively modify a phase alignment of a receiver sample clock of the master device dependent upon the CRC information.
  • the information may include data eye information that corresponds to an edge position of data signal transitions received by the slave device on the plurality of bidirectional data paths. The master device may adaptively modify a phase alignment of data transmitted by the master device via the plurality of bidirectional data paths depending upon the CRC information and the data eye information.
  • the master device may transmit a predetermined pattern to the slave device and adaptively modify data path equalization coefficients within the master device for each path of the plurality of bidirectional data paths based upon the corresponding CRC information and the data eye information.
  • FIG. 1 is a block diagram of one embodiment of a system including asymmetric control of bidirectional data transfer.
  • FIG. 2 is a diagram illustrating more detailed aspects of one embodiment of the slave device of FIG. 1.
  • FIG. 3 is a flow diagram describing the operation of the embodiments shown in FIG. 1 and FIG. 2.
  • FIG. 4 is a block diagram that illustrates additional details of the embodiment of the system of FIG. 1 including scrambling and de-scrambling logic.
  • FIG. 5 is a diagram illustrating an embodiment of an exemplary encoding across a byte group.
  • FIG. 6 is a diagram illustrating an embodiment of an encoding of a masked write operation.
  • FIG. 7 is a diagram of one embodiment of a single ended switching bi-directional signaling scheme.
  • System 10 includes a master controller 105 coupled to slave devices 1 1OA through 11On via a plurality of signal paths and a connector ISO.
  • the signal paths include bidirectional (bidir) data paths 114, command paths 116, and cyclic redundancy code (CRC) and eye statistic information paths 1 12.
  • slave device 11On is meant to illustrate that any number of slave devices may be used.
  • components that include a reference designator having a number and a letter may be referred to by the number only.
  • slave device 11OA may be referred to as slave device 110 where appropriate.
  • master controller 105 includes a transmit unit 106, a receive unit 107, and a control unit 108 that is coupled to both transmit unit 106 and receive unit 107.
  • system 10 may be an example of a memory subsystem.
  • master controller 105 may be a memory controller and slave devices 1 lOA-11On may be memory devices such as devices in the dynamic random access memory (DRAM) family of memory devices, for example.
  • connector 150 may be a connector such as may be found on a memory module that includes a plurality of memory devices such as slave devices 110.
  • command paths 116 may convey address and control information
  • the bidirectional data paths 1 14 may convey data in both directions.
  • CRC and eye stats paths 112 may convey CRC information from slave 110 to master controller 105. As described in greater detail below in conjunction with the description of FIG. 2, CRC and eye stats paths 112 may also convey data signal phase information generated by slave device 110. It is noted that in general, system 10 may be representative of any type of system that employs bidirectional data paths. 10019] In one embodiment, CRC and eye stats path 1 12 may include two signal paths, although any number of signal paths may be used.
  • the bidirectional data paths 114 may include a number of eight-bit (byte-wide) data paths. For example, the full data path may be 64 bits wide, but the data path may be divided into byte-sized portions, the full data path may include any number of data bits, and be divided into different sized portions.
  • master controller 105 may include control functionality that may dynamically and adaptively adjust the signal characteristics (e.g., phase and equalization, and voltage offset, etc.) of transmitted data to enable slave device 110 to correctly read the data based upon information received from slave device 110.
  • master controller 105 may adjust the internal receiver characteristics to enable master controller 105 to receive data sent by slave device 110.
  • slave device 110 may adjust the phase of its transmitted signal to align with the sample clock generated inside the slave device, thus avoiding complex phase shifting circuits in the slave.
  • transmit unit 106 receives information from slave device 110 that may be used to adjust its transmit phase.
  • slave device 110 may include eye statistics and CRC generation logic 118 that may accumulate statistics about the edge position of data transitions using a phase detector (shown in FIG.
  • slave device 1 10 may send this information to master controller 105 for each received "bit lane" or data path.
  • a CRC code may be used to provide multiple bit error detection. More particularly, as shown in FIG. 2, to simplify the logic in the slave device and reporting of errors to master controller 105, slave device 110 calculates a CRC based on either the data it is generating or the data that it is receiving. Accordingly, to transfer the CRC information back to master controller 105 one or more uni-directional CRC signal paths 12 may be used. As shown in FIG. 2, CRC generation unit 230 which calculates the CRC based on its internal data, and sends the CRC data back to master controller 105.
  • the CRC information may be calculated and sent in parallel with the data on a transfer from slave device 1 10 to master controller 105 so that the CRC may be available at the same time as the data block it is protecting when it reaches master controller 105.
  • Slave device 1 10 includes logic 118 which includes a data input buffer 209 that is coupled to one signal path of bidir data paths 1 14 and to an input of a flip-flops (FF) 208.
  • FF flip-flops
  • the output of FF 208 is coupled to slave core logic 2SS.
  • a data out signal from slave core logic 255 is also coupled to an input of FF 206.
  • the output of FF 206 is coupled to a data output buffer which is coupled to the same signal path.
  • the input of FF 207 is also coupled to the signal path of bidir data paths 114.
  • the output of FF 207 is coupled to eye calculation unit 225.
  • the output of eye calculation unit 225 is coupled to one input of a multiplexer 250.
  • the data in and data out signals are also coupled to a CRC unit 230, the output of which is in turn coupled to the other input of multiplexer 250.
  • the output of multiplexer 250 is coupled to the input of FF 205.
  • the output of FF 205 is coupled to a buffer 21 1, which is coupled to one signal path of CRC and eye stats signal paths 1 12.
  • a slave sample clock signal is coupled to each clock input of FF 205-208.
  • FF 207 and FF 208 sample the analog voltage signal at the output of buffer 209 at the edge and center of the data signal, respectively. These two samples are the inputs to eye calculation unit 225, which may calculate various statistics about the received data eye.
  • eye calculation unit 225 may calculate various statistics about the received data eye.
  • every edge transition may be accumulated in a counter/register 226 within eye calculation unit 225.
  • early and late edges increment and decrement register 226, respectively, which may provide a scaled index for the phase of the transmit data.
  • additional statistics may also be accumulated to adjust transmitter equalization and input receiver offset adjustment of master controller 105.
  • the phase information accumulates for each data bit transition received. This phase information may be accumulated over a 16-bit burst on a per data path basis.
  • Register 226 may be a 4-bit counter that may be incremented or decremented depending on the transition being late or early.
  • Slave device 110 may send a single bit per data path back to master controller 105 per 16-bit burst to indicate whether the 4- bit counter value is positive or negative.
  • control unit 108 may further integrate this transition information to control the phase of the transmitted data. As such, transmit unit 106 may adjust or modify the phase of the transmitted data until slave device 110 is locked to the data.
  • receive unit 107 includes sample clock adjustment logic 103 that may include a bang-bang phase detector similar to the phase detector shown in FIG. 2. As such, when master controller 105 is receiving data from slave device 110, receive unit 107 may use the bang-bang phase detector to adjust its own local sample clock to more optimally receive data transmitted by slave device 110. This is described in more detail below in conjunction with the description of FIG. 3.
  • transmit unit 106 includes a phase and equalization unit 103, which employs a finite impulse response (FIR) filter having a plurality of coefficients to drive its output digital-to-analog converter stage.
  • FIR finite impulse response
  • the coefficients for the FIR filter may be trained or adaptively modified to the characteristics for each individual data path.
  • master controller 105 receives from slave device 110 error information that corresponds to the ISI that is seen at each input of slave device 110.
  • equalizer unit 103 within transmit unit 106 may also use the logic 118 within slave device 110 that samples data phase to determine how well it is equalizing the channel.
  • master controller 105 sends commands to slave device 1 10 that may cause slave device 110 to change its input receive threshold by some offset amount. This offset may be chosen such that on a correctly equalized channel, there may be approximately a 50% error rate, and deviation from this 50% error rate may indicate to master controller 105 an error in the equalization it is performing within transmit unit 106.
  • master controller 105 may provide a receive threshold value by supplying it directly to slave device 1 10 via another signal path in the interface.
  • master controller 105 may transmit blocks of 16 bits, and slave device 1 10 reports the value sampled for the 15th bit of this current 16-bit block. Master controller 105 may transmit random data for all of the data bits in this block except for the 15th bit, which may be a predetermined value chosen to match the offset value that slave device 110 is using within its receiver. Since master controller 105 knows what data values it is transmitting, it can use these data values on a per coefficient basis to adjust each coefficient's value depending on the sign of the error bit returned by slave device 110. The transmit coefficients are then trained using an algorithm such as the sign-sign LMS adaptation algorithm, for example. The number of coefficients to be trained may be a function of the length of the tail of the channel's pulse response.
  • master controller 105 may use a four tap decision feedback equalizer (DFE) to correct a pulse response that includes nulls in the channel's transmission response that may include reflections due to the slave device stubs caused by the multi-drop data bus.
  • DFE tap decision feedback equalizer
  • FIG. 3 is a flow diagram describing the operation of the embodiments shown in FIG. 1 and FIG. 2. More particularly, as described above, the master controller may be configured to adaptively modify its transmit and receive characteristics such that it may transmit data that may be received correctly by the slave device, and it may correctly receive the data sent by the slave device.
  • the receive unit 107 of master controller 105 may be trained to receive at least the two data paths of the CRC paths 112. Accordingly, master controller 105 writes a known first training pattern to a predetermined location within slave device 110 (block 305). In response to the write, slave device 110 generates and sends a CRC based upon the received data via CRC paths 112. Master controller 105 receives the CRC and determines if the receive unit 107 is locked to the CRC data (block 310). If the master is not locked to the CRC data, control unit 108 may cause the phase and equalization for the receive unit 103 to be adjusted (block 311) and the write pattern to be resent (block 305).
  • master controller 105 determines that the receive unit 107 is locked to the CRC data (block 310)
  • master controller 105 attempts to train the transmit unit 106 to send data that the slave device 110 can correctly receive. More particularly, master controller 105 sends a second training pattern to slave device 110 (block 315).
  • the second training pattern may be a pattern that may not need the data paths to be equalized. For example, a pattern such as 1 1001100b may be used.
  • master controller 105 determines whether the slave device 1 10 is correctly locked to the write data (block 320). If the master controller 105 determines that the slave is not locked to the data, control unit 108 may cause the phase of the transmitted data to be adjusted, and the pattern to be resent (block 315).
  • master controller 105 may attempt to equalize the data paths of bidir data paths 114.
  • master controller 105 sends a third training pattern to slave device 110 (block 325) via bidir data paths 1 14.
  • the third pattern may be a more randomized pattern.
  • master controller 105 may determine if the coefficients of a multiple tap feed forward equalizer (FFE) are converged (block 330).
  • FFE tap feed forward equalizer
  • the phase alignment of the transmitted data may move during the adjustment of the coefficients, thus the phase may also be rechecked and adjusted using training pattern two (blocks 335 and 340).
  • master controller 105 may adapt the coefficients and track the phase iteratively until both the coefficients have converged, and the transmitted data phase is aligned (block 330).
  • master controller 1OS transmits a fourth pattern to be stored to a storage location (not shown), for example, within the slave device 110 (block 345).
  • Master controller 105 performs a subsequent read of the fourth pattern from slave device 110. Since the pattern is known, control unit 108 may cause the phase and equalization of receive unit 106 to be adjusted to reliably receive the data sent from slave device 110 (block 350). If master controller 105 determines the receiver is not trained (block 355), master controller 105 may initiate further reads of the pattern data, and further adjustment may be made (block 350). Once the receive unit 106 has been trained, system 10 may begin normal operations, during which master controller 105 may perform reads and writes to slave device 110.
  • phase may drift.
  • the phase alignment may be continually checked by master controller 105 as long as there are reads and writes occurring and data being transferred on the data paths. However, large gaps in bus traffic may allow the phase to drift undetected. As such, if there is a gap in the reads and writes that exceeds a predetermined amount of time, master controller 105 may initiate dummy write and read cycles to keep the phase of the transmit unit 106 and receive units 107 trained.
  • control unit 108 may measure the time elapsed between consecutive reads and between consecutive writes (block 362) using timers, for example. If the write timer expires (too much time elapses between two writes) (block 363), control unit 108 may cause one or more dummy write cycles to occur which includes writing the second training pattern to slave device 110. Based upon the CRC and eye statistics received, control unit 108 may cause the phase of transmit unit 106 to be adjusted (block 365).
  • control unit 108 may cause one or more dummy read cycles to occur which includes reading the fourth training pattern from slave device 1 10.
  • Control unit 108 may cause the phase of receive unit 107 to be adjusted (block 367). Once trained or checked, operation may proceed normally as in block 361.
  • master controller 105 may initiate the dummy read and write cycles dependent upon factors other than elapsed timers. For example, master controller 105 may initiate retraining of the transmit unit 106 and receive unit 107 based upon error rates of received data or CRC bits.
  • master controller 105 may determine whether there is any bias in the error rate for a logic 1 versus a logic 0 as sampled in the 15th bit of a 16-bit burst. This bias may indicate an input offset in the receiver of slave device 110. Master controller 105 may compensate for the offset by shifting its transmitted waveform up or down. In one implementation, transmit unit 106 may correct up to +/-5OmV of a slave device receiver's input offset.
  • FIG. 4 a block diagram that illustrates one embodiment of the system of FIG. 1 including scrambling logic is shown.
  • master controller 105 of FIG. 4 may include all the functionality of master controller 105 of FIG 1.
  • slave device 110 of FIG. 4 may also include all the functionality of slave device 110 of FIG. 1.
  • that functionality is not duplicated here. Since the data to be transferred is not guaranteed to be random there is a possibility that a pathologically bad data pattern may be transferred on a victim line and its significant aggressors. To avoid this causing an undesirable increase in the bit error rate of the link, the data on each path may scrambled.
  • the data may be combined with an output of a pseudo random binary sequence (PRBS) using an exclusive-OR (XOR) function.
  • PRBS pseudo random binary sequence
  • XOR exclusive-OR
  • master controller 105 and slave device 110 may include identical PRBS generators which may be initialized during training. Depending on direction of data flow, one end of the link scrambles the data with the known PRBS and the other de-scrambles it with its local PRBS generator. Accordingly, master controller 105 includes a scrambler/descrambler 405 that includes a PRBS generator 410 that is coupled to an XOR block 415. Similarly, slave device 1 10 includes a scrambler/descrambler 430 that includes a PRBS generator 435 that is coupled to an XOR block 440.
  • each path may be scrambled with a different PRBS, thereby effectively smearing out, in the frequency domain, any pathologically bad crosstalk data patterns and allowing their effects to be evaluated statistically with normal distributions. Scrambling the data may also help to ensure that the data patterns that are generated are spectrally white in the frequency domain so that the LMS adaptation algorithm will not have any biased statistics for any of the tap coefficients that it may be adapting.
  • each of the PRBS generators 410 and 435 may be implemented from a master PRBS generator that includes multiple taps.
  • a different generator tap may be XORed with data from each data path to generate a non-overlapping PRBS per path.
  • the choice of taps may be made to maximize the difference between the sequences used on each path.
  • slave device 110 may be a memory device
  • data may be stored in the scrambled state, thus avoiding the cost of implementing scrambling in the memory devices.
  • a scrambling seed may be calculated from a hash of the storage address of the data, thus the data transferred over the bus may remain statistically random even for the case of frequently occurring sequential memory access patterns.
  • a hash of the address may be generated by pre-loading the scrambler PRBS with the address of the block to be accessed and then shifting the PRBS generator by some fixed number of stages, thereby randomizing the address to create a unique seed for each block of data transferred to and from the memory. It is noted that in a system that uses critical word order, additional care may be taken to ensure that the de-scrambling takes into account the wrapping that will occur when the first address access does not occur on a block boundary. In such cases, an appropriately wrapped scramble pattern may be computed for that block of data.
  • the data may be grouped in blocks over which the CRC is calculated.
  • the correctness of the data in the block cannot be established until all of the data block and the CRC have been received.
  • this increases the latency for the first portion of the block which might be a critical word for forward progress in the system.
  • the critical word may be additionally protected by including an additional in-line error code that is inserted between the critical word and the rest of the block.
  • the additional error detection information may be implemented by repeating the critical word (e.g., byte 0) at the beginning of the block.
  • master controller 105 may validate that each bit is identical between the two copies and substantially lower the error rate for the critical word, thus allowing the critical word to be treated as valid before the complete CRC for the block has been received.
  • slave device 1 10 may send the critical word during the first two beats or bit times of the read block.
  • two CRC signal paths protect 8 data paths using a burst of 16 bits.
  • byte 3 is output on the CRC paths during the first four beats of the read block. This also allows the read CRC to be calculated after the delivery of the critical word. On a write, the CRC calculation takes longer and so 12 bit times are allowed for this calculation and the overlap on a write- to-read turn around occurs in the bus turnaround delay. It is noted that the CRC does not change direction between reads and writes. As shown, this encoding may provide some additional bandwidth that may be used to output write data eye statistics. It is also noted that when there are more than one slave device, it may be necessary to insert a gap (i.e., bubble) when master controller 105 controller changes the slave device to which it is reading or writing.
  • a gap i.e., bubble
  • the skew between data paths in a byte group can be controlled in the design of the slave device 110, master controller 105 and the PCB routing.
  • the maximum skew between bits within a byte group may be controlled to within 0. IUI at 3.2Gb/s which allows the different data paths in the group to be treated the same for equalization.
  • the noise generated by simultaneously switching output (SSO) may be aligned accurately at the received data eye edge, thereby minimizing its impact on eye closure.
  • byte writes may be implemented using a masked writes.
  • FIG. 6 a diagram illustrating an embodiment of a masked write operation is shown. The additional bandwidth for the mask write is accomplished by preceding the write data payload with a short burst of eight bits that contains the mask information.
  • master controller 105 Since master controller 105 will not generally know the contents of the memory that is not to be modified, a bit error in the mask information could cause data in memory to be incorrectly modified. To overcome this problem, master controller 105 transmits the mask information for four consecutive bit times and slave device 110 uses the third bit time to sample the data. More particularly, in one embodiment, the third bit time is guaranteed to be stable by device specification. However, it is noted that in other embodiments, mask information may be transmitted for other numbers of consecutive bit times. As such, other bit times may be guaranteed to be stable and slave device 110 may use the stable bit time to sample the data.
  • FIG. 7 is a diagram of one embodiment of a system employing a single ended switching bi-directional signaling scheme.
  • single ended switching signaling schemes such as the system of FIG. 1
  • minimization of pad I/O capacitance and control of on-die VDD are critical design parameters.
  • a simple voltage mode transmitter and receiver terminator combined with a on-die shunt regulator may provide a good compromise between power consumption and a simplified I/O structure.
  • system 700 includes an integrated circuit (IC) device 701 coupled to an IC device 711 via a bus 114 including a plurality of bidirectional signal paths.
  • IC 701 includes a driver circuit coupled to the signal path.
  • exemplary driver circuits are shown as device I/O 601, 602, and 603, although only the detailed aspects of device I/O 601 are shown for simplicity.
  • device I/O 601 embodies a complimentary metal oxide semiconductor (CMOS) driver circuit that includes transistors TI and T2.
  • CMOS complimentary metal oxide semiconductor
  • the tx[0] input is coupled to the input of the driver circuit.
  • the output is coupled to the signal path of bus 714. and to the receive signal path rx[0].
  • driver circuits 602 and 603 may include substantially the same circuit as device I/O 601.
  • device I/O 601 is a voltage mode driver having a voltage swing of approximately VDD/2 into a transmission line (e.g., signal path of 714) terminated by an substantially equivalent driver (e.g., device I/O 613), which drives the die-pad to ground, thereby terminating the receiver to ground.
  • an substantially equivalent driver e.g., device I/O 613
  • the supply voltages for the two IC devices (chips) may be different voltages such as 1.Sv and 1.Ov, for example.
  • IC 701 also includes a shunt regulator circuit, designated clamp 604 that detects transitions in the on-die VDD that go above the average DC voltage.
  • clamp 604 detects transitions in the on-die VDD that go above the average DC voltage.
  • a power delivery loop including inductors Ll, L2 and voltage source Vl is shown. It is noted that the inductors Ll and L2 are representative of a model of intrinsic inductances that may be due to packaging, lead length, etc, and voltage source Vl is representative of a model of the VDD supply.
  • the amplifier Al turns on transistor T3, which sustains the current flow through the inductive power delivery loop (e.g., Ll, Vl, L2). This effectively clamps the positive voltage overshoot.
  • This clamping in combination with transmitting scrambled data may reduce the on-die VDD noise by more than 50% in some embodiments.
  • the clamp transistor T3 may be turned on for a short period prior to enabling the transmitter switching. This may pull current from the on-die VDD decoupling capacitor Cl, which may in turn start increasing the current flowing through the power delivery loop.
  • IC device 71 1 includes substantially identical circuits and therefore operates substantially the same as IC device 701. Accordingly, the operation of IC device 711 is not described further for brevity.
  • the average power of the switching I/O may not be significantly altered by this technique, whereas a driver that draws constant current from the supply for driving a 1 or 0 could consume approximately twice the average switching power.
  • IC 701 and IC 711 shown in FIG. 7 may be illustrative of any two devices that may employ a bi-directional signaling scheme between two chips.
  • IC 701 and IC 711 may be representative of the master controller 105 and slave device 110 depicted in the embodiments described above in conjunction with the descriptions of FIG. 1 through FIG. 6.
  • This invention may generally be applicable to microprocessors.

Abstract

A system (10) including asymmetric control of high-speed bidirectional signaling includes a slave device (110) and a master device (105) that is coupled to the slave device via a plurality of bidirectional data paths (1 14), for example. The master device may control data transfer between the master device and the slave device. More particularly, the master device may adaptively modify transmit characteristics subsequent to adaptively modifying receiver characteristics based upon information received from the slave device via one or more unidirectional data paths (112).

Description

ASYMMETRIC CONTROL OF HIGH-SPEED BIDIRECTIONAL SIGNALING
Technical Field
[0001] This invention relates to communication links and, more particularly, to communication over bidirectional links.
Background Art
[0002| Many systems employ conventional high-speed bidirectional signaling schemes in which the work of controlling amplitude and phase of the signals sent over a channel may be divided equally between each end of a communication link. In such systems, the control of the link may be symmetric such that the transmitter and the receiver at each end of the link may include very similar functionality.
[0003] An example of such a system may be a memory system, where there may be a complex master device (e.g., memory controller) and simpler slave devices (e.g., memory devices). The bidirectional data transfers would correspond to write data when transferring to the slave and read data when transferring from the slave. [0004] To allow transfers to occur at high data rates, a clock phase recovery function may be implemented in the receiver at each end of the bi-directional data bus. For channels with significant high frequency loss or reflections, the channel may be equalized to prevent data eye closure from the effect of inter-symbol interference (ISI). In addition, links that have high data transfer rates may have a significant likelihood of bit errors occurring. Thus, a means of error detection is typically implemented. As mentioned above, these functions may be conventionally implemented at both ends of the link. However, it may be desirable to simplify slave devices while maintaining control of the analog properties of the data waveforms that travel in both directions.
Disclosure of Invention
[0005] Various embodiments of system including asymmetric control of high-speed bidirectional signaling are disclosed. In one embodiment, the system includes a slave device and a master device that is coupled to the slave device via a plurality of bidirectional data paths, for example. The master device may control data transfer between the master device and the slave device. More particularly, the master device may adaptively modify transmit characteristics subsequent to adaptively modifying receiver characteristics based upon information received from the slave device via one or more unidirectional data paths.
[0006] In one implementation, the information may include CRC information that corresponds to data sent by the master device via the bidirectional data paths. In addition, the master device may adaptively modify a phase alignment of a receiver sample clock of the master device dependent upon the CRC information. |0007| In another implementation, the information may include data eye information that corresponds to an edge position of data signal transitions received by the slave device on the plurality of bidirectional data paths. The master device may adaptively modify a phase alignment of data transmitted by the master device via the plurality of bidirectional data paths depending upon the CRC information and the data eye information. [0008] In still other implementations, the master device may transmit a predetermined pattern to the slave device and adaptively modify data path equalization coefficients within the master device for each path of the plurality of bidirectional data paths based upon the corresponding CRC information and the data eye information.
Brief Description of the Drawings
[0009] FIG. 1 is a block diagram of one embodiment of a system including asymmetric control of bidirectional data transfer.
[0010] FIG. 2 is a diagram illustrating more detailed aspects of one embodiment of the slave device of FIG. 1.
[0011] FIG. 3 is a flow diagram describing the operation of the embodiments shown in FIG. 1 and FIG. 2.
[0012] FIG. 4 is a block diagram that illustrates additional details of the embodiment of the system of FIG. 1 including scrambling and de-scrambling logic.
[0013] FIG. 5 is a diagram illustrating an embodiment of an exemplary encoding across a byte group.
[0014] FIG. 6 is a diagram illustrating an embodiment of an encoding of a masked write operation.
[0015] FIG. 7 is a diagram of one embodiment of a single ended switching bi-directional signaling scheme.
|0016] While the invention is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and will herein be described in detail. It is noted that the word "may" is used throughout this application in a permissive sense (i.e., having the potential to, being able to), not a mandatory sense (i.e., must).
Modefe) of Carrying Out the Invention
[0017] Turning now to FIG. 1 , a block diagram of one embodiment of a system including asymmetric control of bidirectional data transfer is shown. System 10 includes a master controller 105 coupled to slave devices 1 1OA through 11On via a plurality of signal paths and a connector ISO. As shown, the signal paths include bidirectional (bidir) data paths 114, command paths 116, and cyclic redundancy code (CRC) and eye statistic information paths 1 12. It is noted that slave device 11On is meant to illustrate that any number of slave devices may be used. It is also noted that components that include a reference designator having a number and a letter may be referred to by the number only. For example, slave device 11OA may be referred to as slave device 110 where appropriate.
[0018] In the illustrated embodiment, master controller 105 includes a transmit unit 106, a receive unit 107, and a control unit 108 that is coupled to both transmit unit 106 and receive unit 107. In one implementation, system 10 may be an example of a memory subsystem. As such, master controller 105 may be a memory controller and slave devices 1 lOA-11On may be memory devices such as devices in the dynamic random access memory (DRAM) family of memory devices, for example. As such, connector 150 may be a connector such as may be found on a memory module that includes a plurality of memory devices such as slave devices 110. In addition, command paths 116 may convey address and control information, and the bidirectional data paths 1 14 may convey data in both directions. CRC and eye stats paths 112 may convey CRC information from slave 110 to master controller 105. As described in greater detail below in conjunction with the description of FIG. 2, CRC and eye stats paths 112 may also convey data signal phase information generated by slave device 110. It is noted that in general, system 10 may be representative of any type of system that employs bidirectional data paths. 10019] In one embodiment, CRC and eye stats path 1 12 may include two signal paths, although any number of signal paths may be used. The bidirectional data paths 114 may include a number of eight-bit (byte-wide) data paths. For example, the full data path may be 64 bits wide, but the data path may be divided into byte-sized portions, the full data path may include any number of data bits, and be divided into different sized portions. [0020] As mentioned above, many conventional systems control high-speed bidirectional communication by implementing control functions such as clock phase recovery, channel equalization, error detection, for example, in both communicating devices. However, as described in greater detail below, slave device 1 10 may be simplified. As such, master controller 105 may include control functionality that may dynamically and adaptively adjust the signal characteristics (e.g., phase and equalization, and voltage offset, etc.) of transmitted data to enable slave device 110 to correctly read the data based upon information received from slave device 110. In addition, master controller 105 may adjust the internal receiver characteristics to enable master controller 105 to receive data sent by slave device 110.
[0021] More particularly, at high data rates the uncertainties of delays in the transmission path for different signals in a bus require a per bit phase adjustment of a sample clock of the receiver. To avoid employing this circuitry in slave device 110, master controller 105 may adjust the phase of its transmitted signal to align with the sample clock generated inside the slave device, thus avoiding complex phase shifting circuits in the slave. [0022] Thus, transmit unit 106 receives information from slave device 110 that may be used to adjust its transmit phase. As such, slave device 110 may include eye statistics and CRC generation logic 118 that may accumulate statistics about the edge position of data transitions using a phase detector (shown in FIG. 2) that is sometimes referred to as a "bang-bang phase detector." The early/late signals from this detector may be integrated and sent to master controller 105 via the uni-directional CRC signal paths 1 12. In one implementation, slave device 1 10 may send this information to master controller 105 for each received "bit lane" or data path.
[0023] In addition, at high data rates the probability of a slave device 110 or master controller 105 receiving a bit error is significant. Accordingly, it may be necessary to protect transfers with an error detection code that will robustly detect multiple bit errors within a protected block. When an error is detected on the link in either direction master controller 105 may correct the error by retrying the operation.
[0024] In one embodiment, a CRC code may be used to provide multiple bit error detection. More particularly, as shown in FIG. 2, to simplify the logic in the slave device and reporting of errors to master controller 105, slave device 110 calculates a CRC based on either the data it is generating or the data that it is receiving. Accordingly, to transfer the CRC information back to master controller 105 one or more uni-directional CRC signal paths 12 may be used. As shown in FIG. 2, CRC generation unit 230 which calculates the CRC based on its internal data, and sends the CRC data back to master controller 105.
[0025] In one embodiment, the CRC information may be calculated and sent in parallel with the data on a transfer from slave device 1 10 to master controller 105 so that the CRC may be available at the same time as the data block it is protecting when it reaches master controller 105.
[0026] It is noted that on a transfer from master controller 105 to slave device 110 there will be some inherent delay from slave device 110 receiving the data internally, calculating the CRC and sending it to the master controller 105. However, as described further below in conjunction with the description of FIG. 5, this delay may be overlapped with a gap that is inserted on the data bus on a write-to-read and read-to-write bus transition. [0027] Referring to FIG. 2, a diagram illustrating more detailed aspects of one embodiment of the slave device of FIG. 1 is shown. Slave device 1 10 includes logic 118 which includes a data input buffer 209 that is coupled to one signal path of bidir data paths 1 14 and to an input of a flip-flops (FF) 208. The output of FF 208 is coupled to slave core logic 2SS. A data out signal from slave core logic 255 is also coupled to an input of FF 206. The output of FF 206 is coupled to a data output buffer which is coupled to the same signal path. The input of FF 207 is also coupled to the signal path of bidir data paths 114. The output of FF 207 is coupled to eye calculation unit 225. The output of eye calculation unit 225 is coupled to one input of a multiplexer 250. The data in and data out signals are also coupled to a CRC unit 230, the output of which is in turn coupled to the other input of multiplexer 250. The output of multiplexer 250 is coupled to the input of FF 205. The output of FF 205 is coupled to a buffer 21 1, which is coupled to one signal path of CRC and eye stats signal paths 1 12. A slave sample clock signal is coupled to each clock input of FF 205-208.
[0028] In the illustrated embodiment, FF 207 and FF 208 sample the analog voltage signal at the output of buffer 209 at the edge and center of the data signal, respectively. These two samples are the inputs to eye calculation unit 225, which may calculate various statistics about the received data eye. To adjust the phase of the data transmitted by master controller 105, every edge transition may be accumulated in a counter/register 226 within eye calculation unit 225. In one implementation, early and late edges increment and decrement register 226, respectively, which may provide a scaled index for the phase of the transmit data. As described further below, additional statistics may also be accumulated to adjust transmitter equalization and input receiver offset adjustment of master controller 105.
[0029] In one embodiment, the phase information accumulates for each data bit transition received. This phase information may be accumulated over a 16-bit burst on a per data path basis. Register 226 may be a 4-bit counter that may be incremented or decremented depending on the transition being late or early. Slave device 110 may send a single bit per data path back to master controller 105 per 16-bit burst to indicate whether the 4- bit counter value is positive or negative. After receiving the transition error information, control unit 108 may further integrate this transition information to control the phase of the transmitted data. As such, transmit unit 106 may adjust or modify the phase of the transmitted data until slave device 110 is locked to the data. [0030] As mentioned above, master controller 105 may adjust receiver circuitry to correctly receive the data sent by slave device 1 10. In one embodiment, receive unit 107 includes sample clock adjustment logic 103 that may include a bang-bang phase detector similar to the phase detector shown in FIG. 2. As such, when master controller 105 is receiving data from slave device 110, receive unit 107 may use the bang-bang phase detector to adjust its own local sample clock to more optimally receive data transmitted by slave device 110. This is described in more detail below in conjunction with the description of FIG. 3.
[0031] In addition to phase adjustment of the transmitted data and adjustment of the receive sample clock within receive unit 107, it may be necessary to equalize data path channels. More particularly, due to the high frequency loss of the channel and/or reflections that may be caused by multiple slave devices 1 10 attached to the same channel (as shown in FIG. 1), data paths may need to be equalized. Equalization of the data paths may optimize the eye opening in the receiver logic of slave device 110. Accordingly, in one embodiment, transmit unit 106 includes a phase and equalization unit 103, which employs a finite impulse response (FIR) filter having a plurality of coefficients to drive its output digital-to-analog converter stage.
|0032] In one embodiment, the coefficients for the FIR filter may be trained or adaptively modified to the characteristics for each individual data path. To allow the transmit unit 106 to adjust its equalization, master controller 105 receives from slave device 110 error information that corresponds to the ISI that is seen at each input of slave device 110.
[0033] To simplify or reduce the logic needed within slave device 110, equalizer unit 103 within transmit unit 106 may also use the logic 118 within slave device 110 that samples data phase to determine how well it is equalizing the channel. To enable slave device 110 to generate an error value of the received eye amplitude, master controller 105 sends commands to slave device 1 10 that may cause slave device 110 to change its input receive threshold by some offset amount. This offset may be chosen such that on a correctly equalized channel, there may be approximately a 50% error rate, and deviation from this 50% error rate may indicate to master controller 105 an error in the equalization it is performing within transmit unit 106. Alternatively as shown in FIG. 2, master controller 105 may provide a receive threshold value by supplying it directly to slave device 1 10 via another signal path in the interface. f0034] In one implementation, master controller 105 may transmit blocks of 16 bits, and slave device 1 10 reports the value sampled for the 15th bit of this current 16-bit block. Master controller 105 may transmit random data for all of the data bits in this block except for the 15th bit, which may be a predetermined value chosen to match the offset value that slave device 110 is using within its receiver. Since master controller 105 knows what data values it is transmitting, it can use these data values on a per coefficient basis to adjust each coefficient's value depending on the sign of the error bit returned by slave device 110. The transmit coefficients are then trained using an algorithm such as the sign-sign LMS adaptation algorithm, for example. The number of coefficients to be trained may be a function of the length of the tail of the channel's pulse response. [0035] In one implementation, master controller 105 may use a four tap decision feedback equalizer (DFE) to correct a pulse response that includes nulls in the channel's transmission response that may include reflections due to the slave device stubs caused by the multi-drop data bus. However, it is contemplated that in other embodiments other numbers of taps may be used to cover, for example, any reflections that may be caused by a round trip of the channel.
[0036] FIG. 3 is a flow diagram describing the operation of the embodiments shown in FIG. 1 and FIG. 2. More particularly, as described above, the master controller may be configured to adaptively modify its transmit and receive characteristics such that it may transmit data that may be received correctly by the slave device, and it may correctly receive the data sent by the slave device.
[0037] Referring collectively to FIG. 1 through FIG. 3 and beginning in block 300 of FIG. 3, after a reset or power-on condition, the receive unit 107 of master controller 105 may be trained to receive at least the two data paths of the CRC paths 112. Accordingly, master controller 105 writes a known first training pattern to a predetermined location within slave device 110 (block 305). In response to the write, slave device 110 generates and sends a CRC based upon the received data via CRC paths 112. Master controller 105 receives the CRC and determines if the receive unit 107 is locked to the CRC data (block 310). If the master is not locked to the CRC data, control unit 108 may cause the phase and equalization for the receive unit 103 to be adjusted (block 311) and the write pattern to be resent (block 305).
10038] When master controller 105 determines that the receive unit 107 is locked to the CRC data (block 310), master controller 105 attempts to train the transmit unit 106 to send data that the slave device 110 can correctly receive. More particularly, master controller 105 sends a second training pattern to slave device 110 (block 315). In one implementation, the second training pattern may be a pattern that may not need the data paths to be equalized. For example, a pattern such as 1 1001100b may be used. Based upon the CRC data and edge statistics received from slave device 110, master controller 105 determines whether the slave device 1 10 is correctly locked to the write data (block 320). If the master controller 105 determines that the slave is not locked to the data, control unit 108 may cause the phase of the transmitted data to be adjusted, and the pattern to be resent (block 315).
[0039] If however, the CRC and eye statistic information indicates that the slave device 110 is locked to the transmitted data (block 320), master controller 105 may attempt to equalize the data paths of bidir data paths 114. In one embodiment, master controller 105 sends a third training pattern to slave device 110 (block 325) via bidir data paths 1 14. In one implementation, the third pattern may be a more randomized pattern. Once again, based upon received CRC data, master controller 105 may determine if the coefficients of a multiple tap feed forward equalizer (FFE) are converged (block 330). In addition, the phase alignment of the transmitted data may move during the adjustment of the coefficients, thus the phase may also be rechecked and adjusted using training pattern two (blocks 335 and 340). Thus as shown, master controller 105 may adapt the coefficients and track the phase iteratively until both the coefficients have converged, and the transmitted data phase is aligned (block 330).
[0040] Once the transmit data phase has been aligned and the transmit data paths have been equalized, the characteristics of receive unit 107 may be adjusted and trained to reliably receive data on the bidir data paths 114. Accordingly, master controller 1OS transmits a fourth pattern to be stored to a storage location (not shown), for example, within the slave device 110 (block 345). Master controller 105 performs a subsequent read of the fourth pattern from slave device 110. Since the pattern is known, control unit 108 may cause the phase and equalization of receive unit 106 to be adjusted to reliably receive the data sent from slave device 110 (block 350). If master controller 105 determines the receiver is not trained (block 355), master controller 105 may initiate further reads of the pattern data, and further adjustment may be made (block 350). Once the receive unit 106 has been trained, system 10 may begin normal operations, during which master controller 105 may perform reads and writes to slave device 110.
[0041] Proceeding to block 361, during normal operation of system 10, the phase may drift. The phase alignment may be continually checked by master controller 105 as long as there are reads and writes occurring and data being transferred on the data paths. However, large gaps in bus traffic may allow the phase to drift undetected. As such, if there is a gap in the reads and writes that exceeds a predetermined amount of time, master controller 105 may initiate dummy write and read cycles to keep the phase of the transmit unit 106 and receive units 107 trained.
[0042] More particularly, in one embodiment, control unit 108 may measure the time elapsed between consecutive reads and between consecutive writes (block 362) using timers, for example. If the write timer expires (too much time elapses between two writes) (block 363), control unit 108 may cause one or more dummy write cycles to occur which includes writing the second training pattern to slave device 110. Based upon the CRC and eye statistics received, control unit 108 may cause the phase of transmit unit 106 to be adjusted (block 365).
[0043] Similarly, if the read timer expires (too much time elapses between two reads) (block 366), control unit 108 may cause one or more dummy read cycles to occur which includes reading the fourth training pattern from slave device 1 10. Control unit 108 may cause the phase of receive unit 107 to be adjusted (block 367). Once trained or checked, operation may proceed normally as in block 361. It is noted that in other embodiments, master controller 105 may initiate the dummy read and write cycles dependent upon factors other than elapsed timers. For example, master controller 105 may initiate retraining of the transmit unit 106 and receive unit 107 based upon error rates of received data or CRC bits.
Slave Device Input Offset Correction
(0044) By changing the polarity of the offset used in the receiver of slave device 110, master controller 105 may determine whether there is any bias in the error rate for a logic 1 versus a logic 0 as sampled in the 15th bit of a 16-bit burst. This bias may indicate an input offset in the receiver of slave device 110. Master controller 105 may compensate for the offset by shifting its transmitted waveform up or down. In one implementation, transmit unit 106 may correct up to +/-5OmV of a slave device receiver's input offset.
Data Scrambling
(0045] When transferring data at high data rates via a lossy channel, some data patterns have a higher probability of generating bit errors than others. In addition, most channels of interest consist of multiple parallel wires which can exhibit significant crosstalk between the data paths in the bus. This crosstalk can be generated by coupling between parallel conductors on a printed circuit board, coupling in via arrays, connectors, packages, sockets, and the like. Similarly, in the case of a single ended switching system, crosstalk may be a result of multiple outputs switching simultaneously.
(0046] Referring to FIG. 4 a block diagram that illustrates one embodiment of the system of FIG. 1 including scrambling logic is shown. It is noted that in addition to the functionality described below, master controller 105 of FIG. 4 may include all the functionality of master controller 105 of FIG 1. Likewise, slave device 110 of FIG. 4 may also include all the functionality of slave device 110 of FIG. 1. However, for simplicity, that functionality is not duplicated here. Since the data to be transferred is not guaranteed to be random there is a possibility that a pathologically bad data pattern may be transferred on a victim line and its significant aggressors. To avoid this causing an undesirable increase in the bit error rate of the link, the data on each path may scrambled. In one implementation, the data may be combined with an output of a pseudo random binary sequence (PRBS) using an exclusive-OR (XOR) function.
[0047] In one embodiment, master controller 105 and slave device 110 may include identical PRBS generators which may be initialized during training. Depending on direction of data flow, one end of the link scrambles the data with the known PRBS and the other de-scrambles it with its local PRBS generator. Accordingly, master controller 105 includes a scrambler/descrambler 405 that includes a PRBS generator 410 that is coupled to an XOR block 415. Similarly, slave device 1 10 includes a scrambler/descrambler 430 that includes a PRBS generator 435 that is coupled to an XOR block 440.
[0048] To avoid correlation between data paths, each path may be scrambled with a different PRBS, thereby effectively smearing out, in the frequency domain, any pathologically bad crosstalk data patterns and allowing their effects to be evaluated statistically with normal distributions. Scrambling the data may also help to ensure that the data patterns that are generated are spectrally white in the frequency domain so that the LMS adaptation algorithm will not have any biased statistics for any of the tap coefficients that it may be adapting.
(0049] In one embodiment, to minimize the implementation cost, each of the PRBS generators 410 and 435 may be implemented from a master PRBS generator that includes multiple taps. A different generator tap may be XORed with data from each data path to generate a non-overlapping PRBS per path. The choice of taps may be made to maximize the difference between the sequences used on each path.
(00SO] In embodiments in which slave device 110 may be a memory device, data may be stored in the scrambled state, thus avoiding the cost of implementing scrambling in the memory devices. To ensure that master controller 105 can decode the scrambled data, a scrambling seed may be calculated from a hash of the storage address of the data, thus the data transferred over the bus may remain statistically random even for the case of frequently occurring sequential memory access patterns.
(0051] In one implementation, a hash of the address may be generated by pre-loading the scrambler PRBS with the address of the block to be accessed and then shifting the PRBS generator by some fixed number of stages, thereby randomizing the address to create a unique seed for each block of data transferred to and from the memory. It is noted that in a system that uses critical word order, additional care may be taken to ensure that the de-scrambling takes into account the wrapping that will occur when the first address access does not occur on a block boundary. In such cases, an appropriately wrapped scramble pattern may be computed for that block of data.
(0052] It is also noted that when storing scrambled data in memory devices, access to memory device control registers would not naturally be scrambled. As such, it may be desirable to ensure that the control information transfers also behave as random data by appropriately defining the sense of the memory device control register bit values to minimize the likelihood of the pathologically bad data patterns. For example, the memory device control register bits may be changed such that a logic one may mean one thing in one bit location while it might the opposite in another location. Accordingly, the bits having the same logic value may be distributed to appear more random.
Slave to Master Latency Reduction
[0053| To get adequate error coverage from the CRC while minimizing impact on bus efficiency, the data may be grouped in blocks over which the CRC is calculated. On a transfer from slave device 110 to master controller 105, the correctness of the data in the block cannot be established until all of the data block and the CRC have been received. However, this increases the latency for the first portion of the block which might be a critical word for forward progress in the system.
[0054] Referring to FIG. 5, a diagram illustrating an exemplary encoding across a byte group is shown. In one embodiment, the critical word may be additionally protected by including an additional in-line error code that is inserted between the critical word and the rest of the block. For example, as shown in FIG. 5, the additional error detection information may be implemented by repeating the critical word (e.g., byte 0) at the beginning of the block. By sending the critical word twice, master controller 105 may validate that each bit is identical between the two copies and substantially lower the error rate for the critical word, thus allowing the critical word to be treated as valid before the complete CRC for the block has been received. Said in another way, during a read operation, slave device 1 10 may send the critical word during the first two beats or bit times of the read block.
[0055] In the embodiment shown in FIG. S, two CRC signal paths protect 8 data paths using a burst of 16 bits. To allow room for two copies of the critical word, byte 3 is output on the CRC paths during the first four beats of the read block. This also allows the read CRC to be calculated after the delivery of the critical word. On a write, the CRC calculation takes longer and so 12 bit times are allowed for this calculation and the overlap on a write- to-read turn around occurs in the bus turnaround delay. It is noted that the CRC does not change direction between reads and writes. As shown, this encoding may provide some additional bandwidth that may be used to output write data eye statistics. It is also noted that when there are more than one slave device, it may be necessary to insert a gap (i.e., bubble) when master controller 105 controller changes the slave device to which it is reading or writing.
Skew Management Across A Byte Group
[0056] To reduce the complexity in slave device 110 and master controller 105 the skew between data paths in a byte group can be controlled in the design of the slave device 110, master controller 105 and the PCB routing. In one embodiment, the maximum skew between bits within a byte group may be controlled to within 0. IUI at 3.2Gb/s which allows the different data paths in the group to be treated the same for equalization. [0057] By controlling the skew in this manner, the noise generated by simultaneously switching output (SSO) may be aligned accurately at the received data eye edge, thereby minimizing its impact on eye closure. However this may not provide any relief for SSO affects on the master controller receive CRC data while it is transmitting and so additional burden may be placed on the master to correctly receive the resulting smaller data eye. However, this may be mitigated by the master controller having an external reference voltage for its input receivers that is internally terminated and referenced to signal ground of the CRC receive bits. By doing so, the SSO noise may be converted to common mode noise and thus rejected by the input receiver for the CRC bits.
Byte Mode Writes to Memory
[0058] In some memory applications, it may be desirable to support a byte write feature between a memory controller and a memory device to avoid a read/modify/write operation on a burst-oriented transaction. As such, in embodiments in which master controller 105 is a memory controller and slave device 110 is a memory device, byte writes may be implemented using a masked writes. Referring to FIG. 6, a diagram illustrating an embodiment of a masked write operation is shown. The additional bandwidth for the mask write is accomplished by preceding the write data payload with a short burst of eight bits that contains the mask information. [0059] Since master controller 105 will not generally know the contents of the memory that is not to be modified, a bit error in the mask information could cause data in memory to be incorrectly modified. To overcome this problem, master controller 105 transmits the mask information for four consecutive bit times and slave device 110 uses the third bit time to sample the data. More particularly, in one embodiment, the third bit time is guaranteed to be stable by device specification. However, it is noted that in other embodiments, mask information may be transmitted for other numbers of consecutive bit times. As such, other bit times may be guaranteed to be stable and slave device 110 may use the stable bit time to sample the data.
Single Ended Switching I/O Topology
[0060] FIG. 7 is a diagram of one embodiment of a system employing a single ended switching bi-directional signaling scheme. In the case of single ended switching signaling schemes such as the system of FIG. 1, minimization of pad I/O capacitance and control of on-die VDD are critical design parameters. A simple voltage mode transmitter and receiver terminator combined with a on-die shunt regulator may provide a good compromise between power consumption and a simplified I/O structure.
[0061] Accordingly, system 700 includes an integrated circuit (IC) device 701 coupled to an IC device 711 via a bus 114 including a plurality of bidirectional signal paths. For each of the bidirectional signal paths, IC 701 includes a driver circuit coupled to the signal path. As shown in FIG. 7, exemplary driver circuits are shown as device I/O 601, 602, and 603, although only the detailed aspects of device I/O 601 are shown for simplicity. As such, device I/O 601 embodies a complimentary metal oxide semiconductor (CMOS) driver circuit that includes transistors TI and T2. The tx[0] input is coupled to the input of the driver circuit. The output is coupled to the signal path of bus 714. and to the receive signal path rx[0]. It is noted that driver circuits 602 and 603 may include substantially the same circuit as device I/O 601.
[0062] In the illustrated embodiment, device I/O 601 is a voltage mode driver having a voltage swing of approximately VDD/2 into a transmission line (e.g., signal path of 714) terminated by an substantially equivalent driver (e.g., device I/O 613), which drives the die-pad to ground, thereby terminating the receiver to ground. It is noted that in the illustrated embodiment, the supply voltages for the two IC devices (chips) may be different voltages such as 1.Sv and 1.Ov, for example.
[0063] IC 701 also includes a shunt regulator circuit, designated clamp 604 that detects transitions in the on-die VDD that go above the average DC voltage. In addition a power delivery loop including inductors Ll, L2 and voltage source Vl is shown. It is noted that the inductors Ll and L2 are representative of a model of intrinsic inductances that may be due to packaging, lead length, etc, and voltage source Vl is representative of a model of the VDD supply.
[0064] When the die VDD transitions go above the average DC voltage, the amplifier Al turns on transistor T3, which sustains the current flow through the inductive power delivery loop (e.g., Ll, Vl, L2). This effectively clamps the positive voltage overshoot. This clamping in combination with transmitting scrambled data (as described above), may reduce the on-die VDD noise by more than 50% in some embodiments. To minimize noise during bus turn-around (e.g., write-to-read and read-to-write transitions) the clamp transistor T3 may be turned on for a short period prior to enabling the transmitter switching. This may pull current from the on-die VDD decoupling capacitor Cl, which may in turn start increasing the current flowing through the power delivery loop. The clamp transistor T3 may be turned off prior to the first bit being transmitted which may cause an inductive ring of the on-die VDD, which may cause the shunt regulator 604 to clamp, thereby ensuring a negligible on-die VDD drop as the transmitter starts up. |0065] As shown, IC device 71 1 includes substantially identical circuits and therefore operates substantially the same as IC device 701. Accordingly, the operation of IC device 711 is not described further for brevity.
[0066] It is noted that the average power of the switching I/O may not be significantly altered by this technique, whereas a driver that draws constant current from the supply for driving a 1 or 0 could consume approximately twice the average switching power.
[0067] It is noted that the IC 701 and IC 711 shown in FIG. 7 may be illustrative of any two devices that may employ a bi-directional signaling scheme between two chips. However in one embodiment, IC 701 and IC 711 may be representative of the master controller 105 and slave device 110 depicted in the embodiments described above in conjunction with the descriptions of FIG. 1 through FIG. 6.
[0068] Although the embodiments above have been described in considerable detail, numerous variations and modifications will become apparent to those skilled in the art once the above disclosure is fully appreciated. It is intended that the following claims be interpreted to embrace all such variations and modifications.
Industrial Applicability
[0069] This invention may generally be applicable to microprocessors.

Claims

WHAT IS CLAIMED IS:
1. A system (10) comprising: a slave device (110); and a master device (105) coupled to the slave device and configured to control data transfer between the master device and the slave device; wherein the master device is configured to adaptively modify transmit characteristics subsequent to adaptively modifying receiver characteristics based upon information received from the slave device.
2. The system as recited in claim 1, wherein the master device is configured to adaptively modify a phase alignment of a receiver sample clock of the master device dependent upon the information received from the slave device, wherein the information received from the slave device includes cyclic redundancy code (CRC) information sent via one or more unidirectional CRC data paths (112), and wherein the CRC information corresponds to data sent by the master device via a plurality of bidirectional data paths (114).
3. The system as recited in claim 2, wherein the information received from the slave device includes data eye information sent via the one or more unidirectional CRC data paths, wherein the data eye information corresponds to an edge position of data signal transitions received by the slave device on the plurality of bidirectional data paths.
4. The system as recited in claim 2, wherein the master device is configured to adaptively modify a phase alignment of data transmitted by the master device via the plurality of bidirectional data paths depending upon the CRC information and the data eye information.
5. The system as recited in claim 3, wherein the master device is configured to transmit a predetermined pattern to the slave device and to adaptively modify data path equalization coefficients within the master device for each path of the plurality of bidirectional data paths based upon the corresponding CRC information and the data eye information.
6. A method comprising: a master device (105) controlling data transfer between the master device and a slave device (1 10); and the master device adaptively modifying transmit characteristics subsequent to adaptively modifying receiver characteristics based upon information received from the slave device.
7. The method as recited in claim 6, further comprising the master device adaptively modifying a phase alignment of a receiver sample clock of the master device dependent upon the information received from the slave device, wherein the information received from the slave device includes cyclic redundancy code (CRC) information sent via one or more unidirectional CRC data paths (112), wherein the CRC information corresponds to data sent by the master device via a plurality of bidirectional data paths (114).
8. The method as recited in claim 17, wherein the information received from the slave device includes data eye information sent via the one or more unidirectional CRC data paths, wherein the data eye information corresponds to an edge position of data signal transitions received by the slave device on the plurality of bidirectional data paths.
9. The method as recited in claim 19, further comprising the master device adaptively modifying a phase alignment of data transmitted by the master device via the plurality of bidirectional data paths dependent upon the CRC information and the data eye information.
10. The method as recited in claim 20, further comprising the master device transmitting a predetermined pattern to the slave device and adaptively modifying data path equalization coefficients within the master device for each path of the plurality of bidirectional data paths based upon the corresponding CRC information and the data eye information.
PCT/US2007/004375 2006-03-06 2007-02-20 Asymmetric control of high-speed bidirectional signaling WO2007102981A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
DE112007000574T DE112007000574B4 (en) 2006-03-06 2007-02-20 System and method for asymmetric control of bidirectional high speed signal routing
GB0815491A GB2448651B (en) 2006-03-06 2007-02-20 Asymmetric control of high-speed bidirectional signalling
CN2007800077529A CN101416435B (en) 2006-03-06 2007-02-20 Asymmetric control of high-speed bidirectional signaling
JP2008558284A JP2009529289A (en) 2006-03-06 2007-02-20 Asymmetric control of high-speed bidirectional signal transmission

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/368,785 US7729465B2 (en) 2006-03-06 2006-03-06 Asymmetric control of high-speed bidirectional signaling
US11/368,785 2006-03-06

Publications (2)

Publication Number Publication Date
WO2007102981A2 true WO2007102981A2 (en) 2007-09-13
WO2007102981A3 WO2007102981A3 (en) 2007-11-29

Family

ID=38330441

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2007/004375 WO2007102981A2 (en) 2006-03-06 2007-02-20 Asymmetric control of high-speed bidirectional signaling

Country Status (8)

Country Link
US (1) US7729465B2 (en)
JP (1) JP2009529289A (en)
KR (1) KR20080100843A (en)
CN (1) CN101416435B (en)
DE (1) DE112007000574B4 (en)
GB (1) GB2448651B (en)
TW (1) TWI421699B (en)
WO (1) WO2007102981A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9910112B2 (en) 2013-07-12 2018-03-06 Toshiba Medical Systems Corporation Noise suppression for MRI signals directly sampled and digitized in imaging room

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7783954B2 (en) * 2006-09-11 2010-08-24 Globalfoundries Inc. System for controlling high-speed bidirectional communication
US7721160B2 (en) * 2006-09-11 2010-05-18 Advanced Micro Devices, Inc. System for protecting data during high-speed bidirectional communication between a master device and a slave device
US8131903B2 (en) * 2007-04-30 2012-03-06 Hewlett-Packard Development Company, L.P. Multi-channel memory connection system and method
US20090097401A1 (en) * 2007-10-12 2009-04-16 Wael William Diab Method and system for configurable data rate thresholds for energy efficient ethernet
US7991098B2 (en) 2007-10-31 2011-08-02 Micron Technology, Inc. Method and apparatus for training the reference voltage level and data sample timing in a receiver
US20090259786A1 (en) * 2008-04-10 2009-10-15 Chu-Ming Lin Data transfer system and method for host-slave interface with automatic status report
US8713330B1 (en) * 2008-10-30 2014-04-29 Apple Inc. Data scrambling in memory devices
WO2010093529A2 (en) * 2009-02-12 2010-08-19 Rambus Inc. Memory interface with reduced read-write turnaround delay
JP2011130008A (en) * 2009-12-15 2011-06-30 Hitachi-Lg Data Storage Inc Data input/output device
TWI435596B (en) * 2010-07-06 2014-04-21 Realtek Semiconductor Corp Master/slave decision device and master/slave decision method applied to network device
CN102868567B (en) * 2011-07-05 2015-05-20 瑞昱半导体股份有限公司 Primary and secondary judgment device and primary and secondary judgment method applied in network device
US8495440B2 (en) 2011-08-30 2013-07-23 Advanced Micro Devices, Inc. Fully programmable parallel PRBS generator
US10122392B2 (en) * 2016-08-18 2018-11-06 Advanced Micro Devices, Inc. Active equalizing negative resistance amplifier for bi-directional bandwidth extension
US10896723B2 (en) * 2019-04-30 2021-01-19 Ambient Scientific Inc. Signal communication circuit implementing receiver and transmitter circuits

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010038674A1 (en) * 1997-07-31 2001-11-08 Francois Trans Means and method for a synchronous network communications system
US6975585B1 (en) * 2000-07-27 2005-12-13 Conexant Systems, Inc. Slotted synchronous frequency division multiplexing for multi-drop networks
US20060034358A1 (en) * 2004-08-16 2006-02-16 Hitoshi Okamura Methods and transmitters for loop-back adaptive pre-emphasis data transmission

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4763249A (en) * 1983-09-22 1988-08-09 Digital Equipment Corporation Bus device for use in a computer system having a synchronous bus
JP2570845B2 (en) * 1988-05-27 1997-01-16 セイコーエプソン株式会社 Information processing device
EP0619547A1 (en) * 1993-04-05 1994-10-12 Motorola, Inc. A method of requesting data and apparatus therefor
JP2738340B2 (en) * 1995-05-11 1998-04-08 日本電気株式会社 Multi-access communication method
US6029250A (en) * 1998-09-09 2000-02-22 Micron Technology, Inc. Method and apparatus for adaptively adjusting the timing offset between a clock signal and digital signals transmitted coincident with that clock signal, and memory device and system using same
US6434650B1 (en) * 1998-10-21 2002-08-13 Intel Corporation Apparatus and method for multiplexing bi-directional data onto a low pin count bus between a host CPU and co-processor
US6374360B1 (en) 1998-12-11 2002-04-16 Micron Technology, Inc. Method and apparatus for bit-to-bit timing correction of a high speed memory bus
US6467012B1 (en) * 1999-07-08 2002-10-15 International Business Machines Corporation Method and apparatus using a distributed system structure to support bus-based cache-coherence protocols for symmetric multiprocessors
US6839393B1 (en) * 1999-07-14 2005-01-04 Rambus Inc. Apparatus and method for controlling a master/slave system via master device synchronization
WO2001006696A1 (en) 1999-07-16 2001-01-25 Conexant Systems, Inc. Apparatus and method for servo-controlled self-centering phase detector
US6321282B1 (en) 1999-10-19 2001-11-20 Rambus Inc. Apparatus and method for topography dependent signaling
US6643787B1 (en) 1999-10-19 2003-11-04 Rambus Inc. Bus system optimization
US6898726B1 (en) * 2000-11-15 2005-05-24 Micron Technology, Inc. Memory system that sets a predetermined phase relationship between read and write clock signals at a bus midpoint for a plurality of spaced device locations
US6516282B2 (en) 2001-04-19 2003-02-04 Ge Medical Systems Global Technology Company Predictive thermal control used with a vacuum enclosed coil assembly of a magnetic resonance imaging device
JP2003050738A (en) 2001-08-03 2003-02-21 Elpida Memory Inc Calibration method and memory system
US6877103B2 (en) 2001-10-05 2005-04-05 Via Technologies, Inc. Bus interface timing adjustment device, method and application chip
US7362800B1 (en) 2002-07-12 2008-04-22 Rambus Inc. Auto-configured equalizer
JP2004213438A (en) * 2003-01-07 2004-07-29 Matsushita Electric Ind Co Ltd Data transfer circuit
US20040268190A1 (en) 2003-05-19 2004-12-30 International Business Machines Corporation Adjusting parameters of a serial link
US7165153B2 (en) * 2003-06-04 2007-01-16 Intel Corporation Memory channel with unidirectional links
US7234070B2 (en) 2003-10-27 2007-06-19 Micron Technology, Inc. System and method for using a learning sequence to establish communications on a high-speed nonsynchronous interface in the absence of clock forwarding
US7516029B2 (en) 2004-06-09 2009-04-07 Rambus, Inc. Communication channel calibration using feedback
US7305574B2 (en) 2004-10-29 2007-12-04 International Business Machines Corporation System, method and storage medium for bus calibration in a memory subsystem
US7346795B2 (en) 2004-12-31 2008-03-18 Intel Corporation Delaying lanes in order to align all lanes crossing between two clock domains
US7590173B2 (en) 2005-06-30 2009-09-15 Intel Corporation System and method for performing adaptive phase equalization
US7370247B2 (en) 2005-09-28 2008-05-06 Intel Corporation Dynamic offset compensation based on false transitions

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010038674A1 (en) * 1997-07-31 2001-11-08 Francois Trans Means and method for a synchronous network communications system
US6975585B1 (en) * 2000-07-27 2005-12-13 Conexant Systems, Inc. Slotted synchronous frequency division multiplexing for multi-drop networks
US20060034358A1 (en) * 2004-08-16 2006-02-16 Hitoshi Okamura Methods and transmitters for loop-back adaptive pre-emphasis data transmission

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9910112B2 (en) 2013-07-12 2018-03-06 Toshiba Medical Systems Corporation Noise suppression for MRI signals directly sampled and digitized in imaging room

Also Published As

Publication number Publication date
DE112007000574B4 (en) 2013-11-28
GB2448651A (en) 2008-10-22
KR20080100843A (en) 2008-11-19
US20070208819A1 (en) 2007-09-06
JP2009529289A (en) 2009-08-13
WO2007102981A3 (en) 2007-11-29
GB2448651B (en) 2010-11-17
GB0815491D0 (en) 2008-10-01
DE112007000574T5 (en) 2009-01-22
TW200801961A (en) 2008-01-01
CN101416435B (en) 2012-04-04
TWI421699B (en) 2014-01-01
US7729465B2 (en) 2010-06-01
CN101416435A (en) 2009-04-22

Similar Documents

Publication Publication Date Title
US7929549B1 (en) Method and apparatus for scrambling data for control of high-speed bidirectional signaling
US7729465B2 (en) Asymmetric control of high-speed bidirectional signaling
US7358771B1 (en) System including a single ended switching topology for high-speed bidirectional signaling
US10404236B2 (en) Receiver with time-varying threshold voltage
US8384423B2 (en) Reference voltage and impedance calibration in a multi-mode interface
US8767531B2 (en) Dynamic fault detection and repair in a data communications mechanism
US20190095264A1 (en) Memory Controller With Error Detection And Retry Modes Of Operation
US8462891B2 (en) Error detection and offset cancellation during multi-wire communication
US7562285B2 (en) Unidirectional error code transfer for a bidirectional data link
US20120106687A1 (en) Calibration of Multiple Parallel Data Communications Lines for High Skew Conditions
US20120106539A1 (en) Coordinating Communications Interface Activities in Data Communicating Devices Using Redundant Lines
US7506222B1 (en) System for phase tracking and equalization across a byte group for asymmetric control of high-speed bidirectional signaling
JP2013507040A (en) Method and system for reducing supply noise and termination noise
US7505332B1 (en) Input offset correction for asymmetric control of high-speed bidirectional signaling
US11683204B2 (en) High speed data links with low-latency retimer
WO2008033312A2 (en) System for controlling high-speed bidirectional communication
US11159353B1 (en) Seamless bit-level low-latency retimer for data links
US8811528B2 (en) Methods and apparatus for transmitting signals with selective delay for compensation of intersymbol interference and simultaneous switching outputs
US7721160B2 (en) System for protecting data during high-speed bidirectional communication between a master device and a slave device
WO2008143937A2 (en) Asymmetric transmit/receive data rate circuit interface
US10348535B1 (en) Fast-settling voltage reference generator for serdes applications
WO2022039945A1 (en) Circuits and methods for self-adaptive decision-feedback equalization a memory system
Sim et al. Multilevel differential encoding with precentering for high-speed parallel link transceiver

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
WWE Wipo information: entry into national phase

Ref document number: 815491

Country of ref document: GB

Ref document number: 0815491

Country of ref document: GB

Ref document number: 0815491.6

Country of ref document: GB

WWE Wipo information: entry into national phase

Ref document number: 200780007752.9

Country of ref document: CN

WWE Wipo information: entry into national phase

Ref document number: 2008558284

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 1020087024453

Country of ref document: KR

RET De translation (de og part 6b)

Ref document number: 112007000574

Country of ref document: DE

Date of ref document: 20090122

Kind code of ref document: P

122 Ep: pct application non-entry in european phase

Ref document number: 07751155

Country of ref document: EP

Kind code of ref document: A2