MITEL PDSP16488AMAACBR

PDSP16488A MA
PDSP16488A MA
Single Chip 2D Convolver with Integral Line Delays
Supersedes January 1997 version, DS3742 - 3.1
DS3742 - 4.0 January 2000
The PDSP16488A is a fully integrated, application specific, image processing device. It performs a two dimensional
convolution between the pixels within a video window and a
set of stored coefficients. An internal multiplier accumulator
array can be multi-cycled at double or quadruple the pixel
clock rate. This then gives the window size options listed in
Table 1.
An internal 32k bit RAM can be configured to provide
either four or eight line delays. The length of each delay can
be programmed to the users requirement, up to a maximum of
1024 pixels per line. The line delays are arranged in two
groups,which may be internally connected in series or may be
configured to accept separate pixel inputs. This allows interlaced video or frame to frame operations to be supported.
The 8 bit coefficients are also stored internally and can
be downloaded from a host computer or from an EPROM. No
additional logic is required to support the EPROM and a single
device can support up to 16 convolvers.
The PDSP16488A contains an expansion adder and
delay network which allows several devices to be cascaded.
Convolvers with larger windows can then be fabricated as
shown in Table 2.
Intermediate 32 bit precision is provided to avoid any
danger of overflow, but the final result will not normally occupy
all bits. The PDSP16488A thus provides a multiplier in the
output path, which allows the user to align the result to the
most significant end of the 32 bit word.
FEATURES
■
The PDSP16488A is a fully compatible replacement
for the PDSP16488
■
8 or 16 bit pixels with rates up to 40 MHz
■
Window sizes up to 8 x 8 with a single device
■
Eight internal line delays
■
Supports interlace and frame to frame operations
■
Coefficients supplied from an EPROM or remote host
■
■
Expandable in both X and Y for larger windows
Gain control and pixel output manipulation
■
132 pin QFP
Rev
A
Date
B
C
MAR 1993 JUL 1996
D
JAN1997
NOTE
Polyimide is used as an inter-layer dielectric and as
glassivation.
Polymeric material is also used for die attach which according
to the requirement in paragraph 1.2.1.b. (2) precludes
catagorising this device as fully compliant. In every other
respect this device has been manufactured and screened in full
accordance with the requirements of Mil-Std 883 (latest revision).
CHANGE NOTIFICATION
Data
Size
8
8
8
16
16
Window Size
Width X Depth
4
8
8
4
8
Max Pixel
Rate
Line
Delays
40MHz
20MHz
10MHz
20MHz
10MHz
4x1024
4x1024
8x512
4x512
4x512
4
4
8
4
4
The change notification requirements of MIL-PRF-38535 will
be implemented on this device type. Known customers will be
notified of any changes since the last buy when ordering further
parts if significant changes have been made.
PIXEL
CLOCK
GENERATOR
EPROM
ADDR
POWER ON
RESET
DATA
Table 1 Single Device Configurations
Max Pixel
Pixel
CLK
SYNC
SYNC
EXTRACT
Window size
Rate
Size
3x3
5x5
7x7
10MHz
8
1
1
1
4
4
4
9
10MHz
16
1
2
2
-
-
-
-
20MHz
8
1
2
2
6
6
8
-
20MHz
16
1
4
4
-
-
-
-
40MHz
8
1
4*
4 *
-
-
-
-
40MHz
16
2
-
-
-
-
-
-
RES
DELAYED
SYNC
BYPASS
9x9 11x11 15x15 23x23
DATA
IN
A/D
CONVERTER
COMPOSITE
OPTIONAL
FIELD
STORE
PDSP
16488A
CONVOLVER
OUTPUT
DATA
AUX
DATA
* Maximum rate is limited to 30 MHz by line store expansion delays
Table 2 Devices needed to implement typical window sizes
Fig. 1 Typical , Stand Alone, Real Time System
1
PDSP16488A MA
CE DS R/W PC0 PC1 RE
S CS3:0
MULTI PURPOSE
DATA BUS
X15:0
PROG
MASTER
CONTROL
X
DELAY
SINGLE
CONTROL
REGISTERS
DELOP
Y
DELAY
IP7:0
1
LINE
DELAY
BY
PASS
MUX
4
LINE
DLYS
8X8
ARRAY OF
MAC'S
BIN
OVER
FLOW
SCALER
Y
DELAY
L7:0
COMPARATOR
ADDER
3
LINE
DLYS
COEFFICIENT
STORE (64)
D15:0
DATA
OUT
OEN
CLOCK
Fig. 2 Functional Block Diagram
PIN NO
FUNCTION
AC PACKAGE
A1
B1
C2
C1
D2
D1
E2
E1
F2
G2
G1
H2
J1
J2
K1
K2
L1
L2
M1
N1
N2
L0
F1
L1
L2
L3
SPARE
L4
L5
L6
L7
IP7
SPARE
IP6
IP5
IP4
SPARE
IP3
IP2
IP1
IP0
BYPASS
PIN NO
FUNCTION
AC PACKAGE
X15
M3
X14
N3
X13
M4
SPARE
N4
SINGLE
M5
X12
N5
X11
M6
MASTER
M7
X10
N7
X9
M8
X8
N9
X7
M9
X6
N10
X5
M10
X4
N11
X3
M11
X2
N12
X1
N13
X0
M13
DELOP
L12
PC0
L13
PIN NO
AC PACKAGE
K12
K13
J12
J13
H12
G12
G13
F12
E13
E12
D13
D12
C13
C12
B13
A13
A12
B11
A11
B10
A10
FUNCTION
Pin out Table (84 pin PGA - AC84)
2
RES
CS0
CS1
CS2
CS3
PROG
DS
CE
R/W
HRES
OV
PC1
BIN
OEN
D0
D1
D2
D3
D4
D5
D6
PIN NO
FUNCTION
AC PACKAGE
B9
D7
A9
D8
B8
CLK
B7
SPARE
A7
D9
B6
D10
A5
D11
B5
SPARE
A4
D12
B4
D13
A3
D14
B3
D15
A2
F0
F1
VDD
N6
VDD
F13
VDD
A6
VDD
H1
GND
N8
GND
H13
GND
A8
GND
PDSP16488A MA
NAME
TYPE
DESCRIPTION
IP7:0
INPUT
Pixel data input to the first line delay. [most significant byte in 16 bit mode]
L7:0
I/O
Pixel data input to the second group of line delays. [least significant byte in 16bit mode]. Alternatively
an output from the last line delay when the appropriate mode bit is set.
BYPASS
INPUT
The first line delay in the first group is bypassed when this input is active. (High). No internal pull up.
HRES
INPUT
Resets the line delay address pointers when high. Normally the composite sync signal in real time
applications. In non real time systems it defines a frame store update period, when low.
X15:0
DUAL
FUNCTION
Address/data connections from a MASTER or SINGLE device to the external coefficient source, with
X15 defining EPROM or Host support. Otherwise they provide the expansion data input.
D15:0
OUTPUT
Signed 16 bit scaled data or multiplexed 32 bit intermediate data. During intermediate transfers the
most significant half is valid when the clock is low, and the least significant half when clock is high.
PC1
OUTPUT
During programming a MASTER device outputs a timing strobe on this pin. This is passed down the
chain in a multiple device system, using the PC0 input on the next device.
PC0
INPUT
This pin is used in conjunction with PC1 in multiple device systems. It terminates the write strobe from
a MASTER device which is EPROM supported.
DELOP
OUTPUT
This output provides a version of the HRES input which has been delayed by an amount defined by
the user.
DS
I/O
The data strobe from a host computer. Active low. This pin will be an output from an EPROM supported
MASTER device which provides strobes to the remaining devices.
CE
INPUT
An active low enable which is internally gated with R/ W and DS to perform reads or writes to the
internal registers. In a SINGLE or MASTER device, which is supported from an EPROM, the bottom
72 addresses are always used and CE is not needed. CE can then be used to initiate a new register
load sequence after the power on load sequence.
R/ W
INPUT
Read / not write line from the host CPU. When an EPROM is used this pin should be tied low.
PROG
I/O
This pin is normally an input which signifies that registers are to be changed or examined. It is,
however, an output from an EPROM supported SINGLE or MASTER device indicating to the rest of
the system that registers are being updated.
CLK
INPUT
Clock. All events are triggered on the rising edge of the clock, except the latching of least significant
expansion inputs . Internally the clock can be multiplied by two or four in order to increase the effective
number of multipliers.
BIN
OUTPUT
This output indicates the result from the internal comparison. A high value indicates that the pixel
was greater than the internal threshold. The output is only valid from the last device in a chain.
OV
OUTPUT
When high this output indicates that there has been a gain control overflow.
RES
INPUT
Active low power on reset signal.
SINGLE
INPUT
Tied to ground to indicate a SINGLE device system. Internal pull up resistor.
MASTER
INPUT
Tied to ground to indicate the MASTER device in a multiple device system. Must be left open circuit
in a SINGLE device system. Internal pull up.
OEN
INPUT
Output enable signal. Active low.
CS3:0
OUTPUTS
Four address bits from a MASTER specifying one of sixteen devices in a multiple device system. Must
be externally decoded to provide chip enables for the additional devices.
F1:0
OUTPUTS
These bits indicate the field selection given by the auto select logic. The same coding as that used
for Control Register bits C5:4 is used.
VCC / GND
SUPPLY
Four Power and ground pairs. All must be connected.
3
PDSP16488A MA
BASIC OPERATION
MULTIPLIER ARRAY
The PDSP16488A convolver performs a weighted
sum of all the pixels within an N x N two dimensional window.
Each pixel value is multiplied by a signed coefficient, or weight,
and the products are summed together. In practice positive
weights would be used to produce averaging effects, with
various distribution laws, and negative weights would be used
for edge enhancement. The window is moved continuously
over the video frame, and for real time operation a new result
must be obtained for every pixel clock. In most applications
odd sized windows will be used, resulting in a centre pixel
whose value is modified by the surrounding pixels.
OUTPUT ACCURACY
With 8 bit pixels, and an 8 x 8 window, it is possible for
the accumulated sum to grow to 22 bits within a single device.
With 16 bit pixels, and an 8 x 4 window ( the maximum
possible ), the sum can grow to 29 bits. The PDSP16488A
actually allows for word growth up to 32 bits, and thus allows
several devices to be cascaded without any danger of overflow. Since coefficients can be negative, the final result is a 32
bit signed two's complement number.
In a particular application the desired output will lie
somewhere within these 32 bits, the actual position being
dependent on the coefficient values used. This causes problems in physically choosing which output pins to connect to the
rest of the system. To overcome this problem the
PDSP16488A contains an output multiplier, or gain control,
which allows the final result to be aligned to the most significant end of the 32 bit internal result.The provision of a
multiplier, rather than a simple shifter, allows the gain to be
defined more accurately.
The sixteen most significant bits of the adjusted result are
available on output pins, and contain a sign bit.
OUTPUT SATURATION
If the output from the convolver is driving a display,
negative pixels will give erroneous results. An option is thus
provided which forces all negative results to zero, which are
then interpreted as black by the display. At the same time
positive results, which overflow the gain control, are forced to
saturate at the most positive number ie peak white. In this
mode the output sign bit is always zero,and should not be
connected to an A/D converter.
A separate option forces both negative and positive
overflows to saturate at their respective maximum values, but
in scale negative results remain valid. A gain control overflow
warning flag is also available, which can be used in a host CPU
supported system to change the gain parameters if overflows
are not acceptable.
BINARY OUTPUT
The PDSP16488A contains a 16 bit arithmetic comparator which allows the output from the gain control to be
compared with a previously programmed value. An output
flag allows the user to detemine if the result was above or
below a value contained within an internal register.
4
The PDSP16488A contains sixteen 8x8 multipliers
each producing a 16 bit result. Internally the pixel clock
supplied by the user can be multiplied by two or four, which
together with the proprietary architecture, allows each multiplier to be used several times within a pixel clock period. This
increases the effective number of multipliers, which are available to the user, from 16 to 32 or 64 respectively. This
architecture produces a very efficient utilization of chip area,
and allows the line delays to be accommodated on the same
device.
The sixteen multipliers are arranged in a 4 deep by 4
wide array, resulting in effective arrays of 4 by 8 or 8 by 8 with
the multi-cycling options. The multiplier array can also be
configured to handle 16 bit signed pixels; the effective number
of available multipliers is then halved.
LINE DELAY OPERATION
Internal RAM is arranged in two separate groups, and
can be configured to provide line delays to match the chosen
size of the convolver. When a four deep arrangement is used,
with 8 bit pixels, four line delays are available, and each can
be programmed to contain up to 1024 pixels. In an eight deep
array, or if16 bit pixels are needed, each line can contain up
to 512 pixels. Figure 4 illustrates the options available.
The first line delay in one of the groups can optionally
be switched in or out under the control of an input pin. It is used
to delay the pixel input when data is obtained from another
convolver in a multiple device system, or it is used to support
interlaced video.
Signals L7:0 may be used as pixel inputs or outputs.
They are configured as inputs at power-on to avoid possible
bus conflicts, but by setting a mode control bit can become
outputs. They can then be used to drive another device when
multiple PDSP16488A's are required.
INTERLACED VIDEO
When using real time interlaced video, a picture or
frame is composed from two fields, with odd lines in one field
and even lines in the other. An external field delay is thus
required to gather information from adjacent lines, and the
convolver needs two input busses. The bus providing the
delayed pixels has an extra internal line delay. This is only
used in the field containing the upper line in any pair of lines,
and must be bypassed in the other field. It ensures that data
from the previous field always corresponds to the line above
the present active line, and avoids the need to change the
position of the coefficients from one field to the next.
Figure 3 shows the translation from physical to internal
line positions, for single device interlaced systems. Line N is
the line presently being convolved, which is either one or two
lines previous to the line presently being produced.
When windows requiring four or more lines are to be
implemented, the first line delay, in the group supplied from
the L7:0 pins, must always be by-passed. This by-pass option
is controlled by Register B, bit 7 and is not effected by the
BYPASS input pin.. The coefficients must be loaded into the
locations shown, which match the translated line positions,
with unused coefficients, shown shaded, loaded with zero's.
PDSP16488A MA
IP7:0
1024
FIELD
DELAY
3 X 3 WINDOW
C4
LINE N-1
C5
C6
C8
C9
C10
C0
C1
C2
LINE N
ODD
FIELD
1024
N-1
L7:0
VIDEO
LINE N+2
N+1
1024
N
LINE N+1
4X4
OR
8X4
ARRAY
Output is shifted
by 1 line in
every field
1024
5 X 5 WINDOW
LINE N-2
C48
C49
C50
C51
IP7:0
512
C52
ODD
FIELD
LINE N-1
C8
C9
C10
C11
C12
LINE N
C40
C41
C42
C43
C44
C0
C1
C2
C3
C4
C32
C33
C34
C35
C36
LINE N+1
LINE N+2
512
FIELD
DELAY
512
L7:0
VIDEO
LINE N+2
512
*Delay is By-Passed
512
[REG B,BIT 7 IS SET]
8 X 8 WINDOW
LINE N-2
C24
C56
LINE N-1
C16
LINE N
C48
C25
C57
C17
C49
C26
C58
C18
C50
C27
C59
C28
C60
LINE N+1
C8
C9
LINE N+2
C40
C41
LINE N+3
C0
C1
C32
C33
C10
C61
C30
C62
C31
512
ODD
FIELD
C63
512
FIELD
DELAY
C20
C21
C22
C23
C51
C52
C53
C54
C55
512
512
C11
C12
C13
C14
C15
C42
C43
C44
C45
C46
C47
C2
C3
C5
C6
C7
C34
C35
C36
N+2
8X8
ARRAY
Output is shifted
by 1 line in
every field
N
N-2
IP7:0
C29
C19
C4
*
512
L7:0
LINE N+4
N-1
512
512
LINE N-3
N+1
C37
C38
VIDEO
LINE N+4
*Delay is By-Passed
[REG B,BIT 7 IS SET]
512
N+3
N+1
N-1
N-3
*
N+4
512
N+2
512
N
512
N-2
8X8
ARRAY
Output is shifted
by 2 lines in
every field
C39
Figure 3. Line Delay Allocations in Single Device Interlaced Systems
5
PDSP16488A MA
IP7:0
IP7:0
BYPASS
512
BYPASS
512
512
512
512
512
512
8X8
ARRAY
512
8x8
ARRAY
L7:0
512
512
512
512
512
512
512
512
L7:0
IP7:0
BYPASS
IP7:0
1024
1024
BYPASS
1024
1024
4X4
OR
8X4
ARRAY
1024
L7:0
1024
1024
4X4
OR
8X4
ARRAY
1024
L7:0
L7:0
IP7:0
BYPASS
512
512
512
512 16
512
512 16
512
512 16
16
4X4
OR
8X4
Fig. 4. Line Delay Configurations
DEFINING THE LENGTH OF THE LINE DELAY
Figure 4 defines the maximum line lengths available in
each of the window size options. The actual line lengths can
be defined in one of three ways, to support both real time
applications, taking pixels directly from a camera, and also
use in systems supported by a frame store. In the former case
the line delays must be referenced to video synchronization
pulses. In the latter case the line lengths are well defined, and
the horizontal flyback 'dead times' will have been removed.
To support real time applications an option is provided
in which the length of the line delay is defined by the number
of clocks obtained whilst an input pin ( HRES ) is in-active.
HRES would normally be composite sync when the convolver
is directly attached to an NTSC or PAL video camera.
Conceptually, the line delay is achieved by reading the
previous contents of a RAM based line store, and then writing
new information to the same address. When HRES is active
write operations are inhibited, and the address counter is
reset. During an active line the counter is incremented by the
pixel clock. If the maximum count is reached before the end of
a line, then write operations are terminated and wrap-around
effects avoided.
The active going edge of HRES, marking the end of a
line, is normally asynchronous to the pixel clock, and it is
possible for an additional pixel to be stored on some lines. This
has no effect on the convolver operation, and will not cause a
cumulative shift in the pixel position from line to line.
6
An alternative means of defining the line length is,
however, provided when an exact number of pixels is needed.
HRES going in-active then starts the delay operation for every
line, but it ceases when the 10 bit value contained in two
registers is reached. This method can avoid the need to store
blank pixels at the end of a line before sync goes active. With
this method the line must contain an even number of pixels,
but the value loaded into the control registers defining the line
length, must be one less than the even number needed.
In an image processing system, the pixel clock is often
re-synchronized, or even inhibited, during blanking or sync.
The next line is then started with a precise time interval from
the end of sync to the first pixel clock edge. This avoids any
visible pixel jitter at the beginning of the line, which would
otherwise be present since pixel clock is asynchronous with
respect to video sync pulses.
When using the PDSP16488A the pixel clock should
not be inhibited, or re-synchronized, until the delayed version
of the HRES input goes active. This is present on the DELOP
output pin. This will ensure that no pixels on the right hand
edge are lost due to the internal pipeline delay.
If the pixel clock is a continuous signal, the user must
ensure that the HRES in-active transition meets the timing
requirements defined in Figure 10. The active going edge at
the end of a line need not be synchronized.
When pixels are read/written to a frame store, an
alternative line delay configuration is needed. Within the
frame store lines would be stored in contiguous locations,
with no gaps caused by the flyback period between the lines.
This method of use makes the HRES defined line delay
operation difficult to use, and an alternative mode of operation
is provided. The HRES input is then driven by a system
provided signal, which defines a complete frame store update
period. It is not a line defining signal. The high to low transition
of this signal will initiate the line store update sequence and
allow the internal address pointers to increment. These pointers will be synchronously reset at the end of a line, when they
reach the pre-programmed value. They will then immediately
start a new operation using address zero. The actual line delay
must be pre-loaded into two control registers as described
previously.
Write operations back to the frame store must allow for
the total pipeline delay. This can be achieved by inhibiting
write operations until the delayed version of HRES goes low
at the DELOP output pin. Write operations then continue until
it goes back high. The PDSP16488A assumes that data is
valid when a clock signal is applied, and that it also meets the
set up and hold requirements given in Figure 10. If data is not
valid, due for example to a frame store DRAM refresh cycle,
then the user must externally inhibit the clock. The clock
supplied to the convolver will in this mode be a signal which
defines a frame store cycle time.
The use of the convolver in a line scan system is similar
to its use with a frame store. These systems have no flyback
period, and the address counter must be synchronously reset
at the end of the line and then allowed to continue.
GAIN CONTROL
The gain control is provided as an aid to locating the
bits of interest in the 32 bit internal result. The magnitude of the
largest convolved output will depend on the size of the
PDSP16488A MA
window, and the coefficient values used. The function of the
gain control is then to produce an output, which is accurate to
16 bits, and which is aligned to the most significant end of this
32 bit word. The sixteen most significant bits of the word are
available on output pins, and the largest number need only
have one sign bit if the gain control is correctly adjusted.
Fiigure 5 indicates the mechanism employed with the
required function implemented in two steps. Two mode control
bits allow one of four 20 bit fields to be selected from the final
32 bit value. These four fields are positioned with the first at
the most significant end, and then at four bit displacements
down to the least significant end.
By setting an enabling bit, the field selection can
optionally be done automatically. This feature should only be
used in the real time operating mode, when HRES defines
video lines. Internal logic examines the most significant 13, 9,
or 5 bits from the 32 bit result, and makes a field selection
dependent on which group does not contain identical sign bits.
If less than five sign bits are obtained, the logic will select the
field containing the most significant 20 bits.
The automatic selection is particularly useful when a
fixed scene is being processed. The selection is reset when
any internal register is updated ( ie PROG has been active )
and is then held in-active for ten further occurances of the
HRES input. This allows the internal multiplier/ accumulator
array to be completely flushed before a field selection is made.
As convolver outputs of greater magnitude are produced the
field selection logic will respond by selecting a more significant
field. The most significant field found necessary remains
selected until PROG again goes active. Even if the automatic
field selection is not enabled, two outputs, F1:0, will still
indicate which field would have been selected. These are
coded in the same way as Register C, bits 5:4.
Having chosen a field, either manually or automatically, it is then multiplied by a 4 bit unsigned integer. This is
contained within a user programmed register, and the multiplication will produce a 24 bit result . The middle 16 bits of this
result contain the required output bits. The gain control multiplier can overflow in to the unused most significant four bits if
the parameters are chosen wrongly. This condition is indicated by an overflow flag .
By setting appropriate mode control bits, further manipulation of the gain control output is possible. One option
allows all negative outputs to be forced to zero, and at the
same time positive gain control overflows will saturate at the
maximum positive number. A different option will saturate
positive and negative overflows at their respective maximum
values, but otherwise leaves them unchanged. Occasional
FROM EXPANSION ADDER
32 BITS
MSB
D15:0
20 12 4 20 8
8 20 4
12 20
LSB
MUX
20
4
GAIN
REGISTER
4
X
24
16
SATURATE
LOGIC
overflows can be tolerated in some systems, and this option
prevents any gross errors.
EXPANSION
Multiple devices can be connected in cascade in order
to fabricate window sizes larger than those provided by a
single device. This requires an additional adder in each device
which is fed from expansion data inputs. This adder is not
used by a single device or the first device in a cascaded
system, and can be disabled by a mode control bit.
The first device in the cascaded system must be
designated as a MASTER device by tying an input pin low. Its
expansion input bus is then used as the source of data for the
coefficient and control registers in all devices in the system.
In order to reduce the pin count required for 32 bit
busses, both expansion in and data out are time multiplexed
with the phases of the pixel clock. When the clock is high the
least significant half will be valid, and when the clock is low the
most significant half will be valid.
In practice this multiplexing is only possible with pixel
clocks up to 20MHz. Above these frequencies the multiplexing
must be inhibited by setting a Mode Control bit ( Register A,
Bit 7 ). The intermediate data accuracy will then be reduced,
since only the lower 16 bits of the internal 32 bit intermediate
sum are available on the output pins. In such systems the
coefficients must be scaled down in order to keep the
intermediate and final results down to 16 bits. The final device
should not use the gain control, and instead should simply
output the non-multiplexed 16 bit result. The overflow flag and
pixel saturation options will not be available.
PIXEL INPUT AND OUTPUT DELAYS
In a real time system, when line delays are referenced
to video sync pulses present on the HRES input, the first pixel
from the last line delay does not appear on the L7:0 pins until
the fifth active pixel clock edge after HRES has gone low. This
is illustrated in Figure 7. In a vertically expanded system, this
output provides the input to the first line delays in the vertically
displaced devices. The internal logic is thus designed to
always expect this five clock delay. Compensation must thus
be applied to the devices which are directly connected to the
video source, such that the first pixel is not valid until the fifth
clock edge.
For this reason the PDSP16488A contains an optional
four clock pipeline delay on each of the pixel data inputs.
When the delay is used the first pixel in a video line must be
available on the input pins after the first pixel clock edge. This
would be so if the device were connected to an A/D converter,
since that would introduce a one pixel pipeline delay. If the
system introduces any further external pipeline delays, then
the internal delay should be bypassed, and the user should
ensure that the first pixel is valid after the fifth clock edge.
The use of this four clock delay is controlled by Bit 3,
in Control Register B. This delay is in addition to the delays
which are provided to support expansion in both the X and Y
directions, and are controlled by Register D, Bits 3:2. Both
delays are in fact simply added together in the device, but are
provided for conceptually different reasons.
4
Fig. 5. Gain Control Operation
7
PDSP16488A MA
INPUT
4
delays
0
delays
B3 = 1
4
delays
B3 = 1
D3:2 = 00
0
delays
D3:2 = 00
N th DEVICE IN THE ROW
WIDTH = S
WIDTH = S
line
delays
line
delays
4 clock
delay
4 clock
delay
ZERO
4 clock
delay
0
delays
D
delays
0/4
delays
0
delays
D0 = 0
D0 = 0 or 1
0
delays
B3 = 0
B3 = 0
D
delays
D = 4+S(N-1) Defined by D3:2
N th DEVICE IN THE ROW
D = 4+S(N-1) Defined by D3:2
WIDTH = S
WIDTH = S
line
delays
line
delays
0
delays
4 clock
delay
0 IF S = 4, 4 IF S = 8
4 clock
delay
4 clock
delay
0/4
delays
D0 = 0
0
delays
0
delays
B3 = 0
D
delays
D = 4+S(N-1) Defined by D3:2
0 IF S = 4, 4 IF S = 8
D0 = 0 OR 1
B3 = 0
D
delays
WIDTH = S
line
delays
N th DEVICE IN THE ROW
D = 4+S(N-1) Defined by D3:2
WIDTH = S
line
delays
4 clock
delay
4 clock
delay
0
delays
0/4
delays
D0 = 0
D0 = 0 OR 1
0 IF S = 4,4 IF S = 8
OUTPUT
Fig. 6. Multi-Device Delay Paths
DELAY COMPENSATION FOR LARGE WINDOWS
A large window is composed of several partial windows
each of which is implemented in an individual device. If
necessary the partial window must be padded with zero
coefficients to become one of the standard sizes. When
constructing a large window it is necessary to delay the
expansion data inputs in order to compensate for growth in the
horizontal direction. Delays in the partial sums are also
necessary to compensate for the total pipeline delay needed
to produce the previous complete horizontal stripe.
Within each device in a horizontal stripe, apart from
the first, the expansion input must be delayed by the width of
the partial window, before it is added to the internal sum. Since
partial windows can only be 4 or 8 pixels wide,a delay of 4 or
8 pixel clocks is needed. There is, however, an in-built delay
8
of 4 pixels in the inter device connection, and the
PDSP16488A thus only needs an option to delay the
expansion input by an additional four pixels.
The data from the last device in a horizontal row of
convolvers feeds the expansion input of the first device in the
next row. This is shown in Figure 6. With this arrangement, the
position of the partial window as illustrated, is the inverse of
its vertical position on a normal TV screen. Thus the top, left
hand, device corresponds to the bottom, left hand, portion of
the complete window.
The output from the last device in the row is delayed
with respect to the original data input by an amount given by
the formula;
DELAY = 4 + [N-1].S where N is the number of devices in
a row and S is the partial window width, ie 4 or 8.
PDSP16488A MA
Function
The internal convolver sums, in each of the devices in
the next row, must be delayed by this amount before they are
added to results from the previous row. This is more conveniently achieved by delaying data going into the line stores. The
required cumulative delay with respect to the first horizontal
stripe is then automatically obtained when more than two rows
of devices are needed.
Two bits in Control Register D are used to define one
of four delay options. These delays have been selected to
support systems needing from two to eight devices and are
described in the applications section.
Mode Reg A
Mode Reg B
Mode Reg C
Mode Reg D
Comparator LSB
Comparator MSB
Scale Value
Pixels / Line LSB
Pixels / Line MSB
C0 - C15
C16 - C31
C32 - C47
C48 - C63
Unused
COEFFICIENTS
Sixty-four coefficients are stored internally and must
be initially loaded from an external source. Table 3 gives the
coefficient addresses within a device, with coefficent C0
specified by the least significant address and C63 by the most
significant address. Table 5 shows the physical window position within the device which is allocated to each coefficient in
the various modes of operation. Horizontally the coefficient
positions correspond to the convolution process as if it were
conceptually observed on a viewing screen, ie the left hand
pixel is multiplied with C0. In the vertical direction the lines of
coefficients are inverted with respect to a visual screen, ie the
line starting with C0 is actually at the bottom of the visualized
window.
The coefficients may be provided from a Host CPU
using conventional addressing, a read/write line, data strobe,
and a chip enable. Alternatively, in stand alone systems, an
EPROM may be used. A single EPROM can support up to 16
devices with no additional hardware.
When windows are to be fabricated which are smaller
than the maximum size that the device will provide in the
required configuration, then the areas which are not to be used
must contain zero coefficients. The pipeline delay will then be
that of a completely filled window.
TOTAL PIPELINE DELAY
The total pipeline delay is dependent on the device
configuration and the number of devices in the system. Table
4 gives the delays obtained with the various single device
Hex. Addr
00
01
02
03
04
05
06
07
08
40 - 4F
50 - 5F
60 - 6F
70 - 7F
09 - 3F
Table 3 Internal Register Addressing
Data
size
8
8
8
16
16
Window
Size
4x4
8x4
8x8
4x4
8x4
Pipeline
Delay
34
30
26
28
26
Table 4 Pipe line dalays
configurations when the gain control is used. These delays
are the the internal processing delays and do not include the
delays needed to move a given size window completely into
a field of interest. When multiple devices are needed, additional delays are produced which must be calculated for the
particular application. These delays are discussed in the
applications section.
The PDSP16488A contains facilities for outputing a
delayed version of HRES to match any processing delay.
Control register bits allow this delay to be selected from any
value between 29 and 92 pixel clocks.
ASYNCHRONOUS BACK EDGE
ACTIVE LINE PERIOD
Set Up
Time
HRES
[SYNC]
2
3
4
5
6
7
8
1
2
6
7
CLOCK
First
pixel
valid
[B3 set]
First
pixel
from
line
store
valid
last 2
pixels
internally
stored
LINE STORE
WRITES INHIBITED
Fig.7 Pixel Input Delays
9
PDSP16488A MA
IP7:0
512
C0
C1
C2
C3
C4
C5
C6
C7
C8
C9
C10
C11
C12
C13
C14
C15
C16
C17
C18
C19
C20
C21
C22
C23
C24
C25
C26
C27
C28
C29
C30
C31
C32
C33
C34
C35
C36
C37
C38
C39
C40
C41
C42
C43
C44
C45
C46
C47
C48
C49
C50
C51
C52
C53
C54
C55
C56
C57
C58
C59
C60
C61
C62
C63
512
512
512
512
512
512
512
L7:0
IP7:0
L7:0
512
512
MSB
8X8, 8 Bit Data
LSB
512
512
512
512
512
512
16
CO
C32
C1
C33
C2
C34
C3
C35
C4
C36
C5
C37
C6
C38
C7
C39
16
C8
C40
C9
C41
C10
C42
C11
C43
C12
C44
C13
C45
C14
C46
C15
C47
16
C16
C48
C17
C49
C18
C50
C19
C51
C20
C52
C21
C53
C22
C54
C23
C55
16
C24
C56
C25
C57
C26
C58
C27
C59
C28
C60
C29
C61
C30
C62
C31
C63
8X4, 16 Bit Data
IP7:0
1024
C0
C1
C2
C3
C4
C5
C6
C7
C8
C9
C10
C11
C12
C13
C14
C15
C16
C17
C18
C19
C20
C21
C22
C23
C24
C25
C26
C27
C28
C29
C30
C31
1024
1024
1024
L7:0
8X4, 8 Bit Data
IP7:0
1024
C0
C1
C2
C3
1024
C4
C5
C6
C9
C10
L7:0
C13
C14
512
512
MSB
LSB
512
512
512
512
512
512
C11
1024
C12
L7:0
C7
1024
C8
IP7:0
C15
C0
C1
C2
C3
16
C16
C17
C18
C19
C4
C5
C6
C7
16
C20
C21
C22
C23
16
16
C8
C9
C10
C11
C24
C25
C26
C27
C12
C13
C14
C15
C28
C29
C30
C31
4X4, 8 Bit Data
4X4, 16 Bit Data
NOTE
Two coefficients occuring in the same box have identical values
Table 5 Physical Coefficient Position
10
PDSP16488A MA
LOADING REGISTERS FROM A HOST CPU
The expansion data inputs [X14:0] on a single or
master device are connected to the host bus to provide
address and data for the internal registers. In a multiple device
system the remaining devices receive addresses and data
which have been passed through the expansion connection
between earlier devices in the cascade chain. Each device
needs an individual chip enable plus a global data strobe,
read/write line, and PROG signal from the host.
Registers are individually addressed and can be
loaded in any sequence once the global PROG signal has
been produced by the host. The latter would normally be
produced from an address decode encompassing all the
necessary device addresses.
If a self timed system is to be implemented, a timing
strobe must be passed down the expansion chain through the
PC1/PC0 connections. The PC0 output from the final device
is used as a host REPLY signal, and indicates that the last
device has received data after the propogation delay of
previous devices. The timing strobe is produced in the
MASTER device from the host data strobe, and will appear on
the PC0 output. This feature allows the user to cascade any
number of devices without knowing the propogation delay
through each device. The timing information for this mode of
operation is given in Figure 8.
The host can also read the data contained in the
internal registers. The required device is selected using chip
enable with the R/W line indicating a read operation. Single
device systems output the data read on X7:0, but in multiple
device systems data is read from the D7:0 outputs on the final
device in the chain. These must be connected back to the host
data bus through three-state drivers. When earlier devices in
the chain are addressed, the register contents are transferred
through the expansion connections down to the final device.
In the self timed configuration the data will be valid when the
REPLY goes active, as shown in Figure 8.
If the REPLY signal is not to be used , the PC0/PC1
connections are not necessary, and the host data strobe for a
write operation must be wide enough to allow for the worst
case propogation delay through all the devices ( TDEL ). If the
data or address from the host does not meet the set up time
given in Fugure 8, the width of the data strobe can be simply
extended to compensate for the additional delay. When reading data the access time required is: TACC + ( N - 1 ).TDEL
using the maximum times obtained from Figure 8.
HOST CONTROL LINES
X7:0
8 bit data bus. In a single device system this bus is
bi-directional; in other configurations it is an input.
Only a SINGLE or MASTER device is connected
directly to the host. Other devices receive data from
the output of the previous device in the chain.
X14:8
7 bit address bus which is used to identify one of
the 73 internal registers. Connected in the same
manner as X7:0.
X15
X15 must be open circuit on the MASTER device
PC0
An input from the previous PC1 output in a multiple
device chain. Not needed on a SINGLE device or
if the self timed feature is not used.
PC1
Reply to the host from a SINGLE device or from the
last device in a cascade chain. It indicates that the
write strobe can be terminated. Connected to PC0
input of the next device at intermediate points in the
chain if the self timed feature is used.
R/W
Read/Not Write line from the host CPU which is
connected to all devices in the system.
CE
An active low enable which is normally produced
from a global address decode for the particular
device. This must encompass all internal register
addresses.
An active low host data strobe which is connected
to all devices. in the system.
DS
PROG
An active low global signal, produced by the host,
which is connected to all devices in the system.
Together with a unique chip enable for every device, it allows the internal registers to be updated
or examined by the host. PROG and CE should be
tied together in a single device system.
LOADING REGISTERS FROM AN EPROM
In the EPROM supported mode, one device has to
assume the role of a host computer. If more than one device
is present, this must be the first component in the chain,
which must have its MASTER pin tied low.
The MASTER device contains internal address counters which allow the registers in up to 16 cascaded devices to
be specified. It also generates the PROG signal and a data
strobe on the pins which were previously inputs. These
outputs must be connected to the other devices in the system,
which still use them as inputs. The R/W input should be tied
low on all devices.
The width of the data strobe is determined by the
feedback connection from the PC1 output on the last device
to the PC0 input on the MASTER. The PC0/PC1 connections
must be made between devices in a multiple device system;
in a single device system the connection is made internally.
The available EPROM access time is determined by
an internal oscillator and does not require the pixel clock to be
present during the programming sequence. Any pixel clock resynchronization in a real time system will thus not effect the
coefficient load operation. The relevent EPROM timing information is shown in figure 9.
The load procedure will commence after reset has
gone from active to in-active, and will be indicated by the
PROG output going active. The data from 73 EPROM locations will be loaded into the internal registers using addresses
corresponding to those in Table 3. Within a particular page of
128 EPROM locations, the first nine locations supply control
register information, and the top 64 supply coefficients. The
middle 55 locations are not used. If the window size is 8 x 4,
the top 32 locations will also contain redundant data, and if
the size is 4 x 4 the top 48 will be redundant.
11
PDSP16488A MA
In a multiple device system the load sequence will be
repeated for every device, and four additional address bits will
be generated on the CS3:0 pins. These address bits provide
the EPROM with a page address, with one page allocated to
each device in the system. Within each page only 73 locations
provide data for a convolver, the remainder are redundant as
in the single device system. The CS3:0 outputs must also be
decoded in order to provide individual chip enables for each
device. These can readily be derived by using an AS138 TTL
decoder. Bits in an internal control register determine the
number of times that the sequence is repeated.
If changes to the convolver operation are to be made
after power-on, activating the CE input on the MASTER or
SINGLE device will instigate the load procedure. Additional
EPROM address bits supplied from the system will allow
different filter coefficients to be used.
register load sequence is occuring, either after
power on, or as the result of CE as explained above.
It remains active until register 73 in the final device
has been loaded. Four bits in a control register
define the number of cascaded devices.
SYSTEM CONFIGURATION
The device is configured using a combination of the state of
the SINGLE and MASTER pins, and the contents of the four
Mode Control registers. In a MASTER or SINGLE device the
state of the X15 pin is used to define whether the system is
EPROM or host supported.
MODE CONTROL REGISTERS
EPROM CONTROL LINES
REGISTER A Bit Allocation
X7:0
8 bit data from the EPROM to the MASTER or
SINGLE device. Otherwise data is received from
the previous device in the chain.
X14:8
Lower 7 address bits to the EPROM from a MASTER or SINGLE device. Otherwise an input from
the data outs of the previous device.
BIT
CODE
FUNCTION
3:0
XXXX
Number of extra devices from1-15
X15
Tied to ground on a MASTER device to indicate the
EPROM mode.
6:4
000
R/W
Tied low on all devices.
6:4
001
DS
An output from a MASTER or SINGLE device
which provides a data strobe for the other devices.
6:4
010
6:4
011
6:4
101
7
7
0
1
8 bit, 8x8 window,
8x512 line delays.
16 bit, 8x4 window,
4x512 line delays.
16 bit, 4x4 window,
4x512 line delays.
8 bit, 8x4 window,
4x1024 line delays.
8 bit, 4x4 window,
4x1024 line delays
Multiplexed exp. data
Non-mux. exp. data
CS3: 0
.
PC0
Four additional address bits for the EPROM which
are provided by the MASTER device. They allow
16 additional devices to be used and must be
externally decoded to provide chip enables.
An input on the MASTER device which is driven
from the PC1 output of the last device in the chain.
Used internally to terminate the write strobe. Connected to previous PC1 outputs at intermediate
points in the chain. Not needed for a SINGLE
device.
PC1
An output connected to the PC0 input of the next
device in the chain. The last device feeds back to
the MASTER. Not needed for a SINGLE device.
CE
An enable which is produced by decoding CS3:0
from the MASTER. It is not needed for a MASTER
or SINGLE device which will always use the
bottom block of addresses with internally generated write strobes. It can however be used on
these devices to initiate a new load procedure
after the initial power on sequence.
PROG
12
An active low going signal produced by an
EPROM supported MASTER or SINGLE device.
An input to all other devices. It indicates that a
10MHz max,
10MHz max,
20MHz max,
20MHz max,
40MHz max,
BITS 3:0 These bits are 'don't care' when using a host
computer but to a MASTER device, in an EPROM
supported system, they define the number of interconnected chips. The EPROM must contain contiguous 128 byte blocks for each of the devices in
the system and a 4 bit counter in the MASTER
device will sequence through up to 16 block reads.
An internal comparator in the MASTER causes the
loading of the internal registers to cease when the
value in the counter equals that contained in these
bits. The bits are redundant in a SINGLE device
which only uses one 128 byte block.
BITS 6:4 These bits define one of the five basic configurations. The line delays will automatically be configured to match the chosen window size and pixel
accuracy. The maximum clock rate that is available to the user reflects the internal mutiplication
factor.
PDSP16488A MA
BIT 7
This bit must be set if the pixel clock is greater than
20MHz. It disables the output and input time
multiplexing, and instead outputs the least significant half of the 32 bit intermediate sum for the
complete clock cycle. When the gain control is
used, the output multiplexing will automatically be
disabled.
REGISTER B Bit Allocation
BIT
CODE
FUNCTION
0
0
Second line delay group fed from the
first group
Second line delay group fed from L7:0
which become inputs
Store pixels to end of line
Store pixels till count is reached
Frame store operation
Not Used
No delays on pixel inputs
4 delays on both pixel inputs
Use expansion adder
Expansion adder disabled
Not used
Use first delay in second group
Bypass first delay in second group
0
1
2:1
2:1
2:1
2:1
3
3
4
4
6:5
7
7
00
01
10
11
0
1
0
1
0
1
BIT 0
This bit defines the input for the second group of
line delays. It must be set in the 16 bit pixel modes,
and is set by power on reset.
BIT 2:1
These bits control the mode of operation of the line
stores. In real time systems pixels can be stored
either until HRES [ SYNC ] goes active , or until a
pre-determined count is reached. In the frame
store mode line store operations are continuous,
with a pre-determined line length.
BIT 3
When this bit is set four pipeline delays are added
to the pixel inputs to compensate for the internal/
external delays between line stores. The extra
delay is only necessary when a device supplied
with system video in which the first pixel in a line
is valid in the period following the first active clock
edge. See Fig 7. The delay is not necessary if the
device is fed from the output of another convolver.
When set this bit will add four additional delays to
those defined by Register D, bits 4: 2.
BIT 4
When this bit is set the expansion adder will not be
used. It is automatically set in a MASTER or SINGLE device.
BIT 7
This bit controls the bypass option on the first line
delay on the L7:0 inputs. It is only effective when
an 8 bit pixel mode is selected, which also needs
more than four line delays. When L7:0 are used as
outputs it should always be reset. In the 16 bit
modes the bypass function is only controlled by the
BYPASS pin, and the bit is redundant.
REGISTER C Bit Allocation
BIT
CODE
FUNCTION
0
0
Field selection defined by C5:4
0
1
Automatic field selection
3:1
000
DELOP = 29 + 0 clks
3:1
001
DELOP = 29 + 8 clks
3:1
010
DELOP = 29 + 16 clks
3:1
011
DELOP = 29 + 24 clks
3:1
100
DELOP = 29 + 32 clks
3:1
101
DELOP = 29 + 40 clks
3:1
110
DELOP = 29 + 48 clks
3:1
111
DELOP = 29 + 56 clks
5:4
00
Select upper 20 bits
5:4
01
Select next 20 bits
5:4
10
Select next 20 bits
5:4
11
Select bottom 20 bits
7:6
00
By-pass the gain control
7:6
01
Normal gain control O/P
7:6
10
Saturate at max + and -ve values.
7:6
11
Force -ve to zero.Sat.+ve values.
BIT 0
If this bit is set, the 20 bit field selected from the 32
bit result, is defined automatically by internal logic.
BITS 3:1 These bits are in conjunction with Register D, bits
7:5 to define the pixel delay from the HRES input
to the DELOP pin. They are used to match the
appropriate processing delay in a particular system. The minimum delay is 29 pixel clocks.
BITS 5:4 These bits define which of the four 20 bit fields out
of the 32 bit final result is selected as the input to
the gain control. They are redundant when the gain
control is not used, or if Register C, bit0, is set.
BITS 7:6 These bits define the use of the gain control as
given in the table. Intermediate devices in a multiple device system MUST by-pass the gain control, otherwise the additional pipeline delays will
effect the result. Disabling the scaler will reduce
the device pipeline by 13 PCLK cycles from the
delays shown in Table 4.
13
PDSP16488A MA
REGISTER D Bit Allocation
BIT 0
If this bit is set the expansion data input is delayed
by four pixel clocks before it is added to the present
convolver output. It is used in multiple device
systems when the partial window width is 8 pixels.
BIT 1
When this bit is set the internal sum is shifted to the
left by 8 places before being added to the expansion input. It is used when two devices are used,
each in an 8 bit pixel mode, to fabricate a 16 bit
pixel mode.
BIT
CODE
FUNCTION
0
0
X15:0 Not delayed
0
1
X15:0 Delayed
1
0
Internal sum not shifted
1
1
Internal sum multiplied by 256
3:2
00
I/P to line stores not delayed
3:2
01
I/P to line stores delayed by 4
3:2
10
I/P to line stores delayed by 8
3:2
11
I/P to line stores delayed by 12
BITS 3::2 These bits define the delays on both sets of pixel
inputs before entering the line stores. The delays
are always identical on both sets.
4
0
Un-signed pixel data input
BIT 4
4
1
2's complement pixel data input
When this bit is set the convolver interprets 8 or 16
bit pixels as 2's complement signed numbers
7:5
XXX
Add 0 to 7 clock delays to DELOP
BIT 7:5
These bits add 0 to 7 additional clock delays to
those selected by Register C, bits 3:1.
output.
ABSOLUTE MAXIMUM RATINGS [See Notes]
Waveform - measurement level
Test
Supply voltage Vcc
-0.5V to 7.0V
Input voltage VIN
-0.5V to Vcc + 0.5V
Output voltage VOUT
-0.5V to Vcc + 0.5V
Clamp diode current per pin IK (see note 2)
18mA
Static discharge voltage (HMB)
500V
-65°C to 150°C
Storage temperature TS
Max. junction temperature Military
150°C
Package power dissipation
3000mW
Thermal resistances, junction to case øJC
5°C/W
Delay from output
high to output
high impedance
VH
0.5V
Delay from output
low to output
high impedance
0.5V
VL
Delay from output
high impedance to
output low
NOTES ON MAXIMUM RATINGS
1. Exceeding these ratings may cause permanent damage.
Functional operation under these conditions is not implied.
2. Maximum dissipation or 1 second should not be exceeded,
only one output to be tested at any one time.
3. Exposure to absolute maximum ratings for extended
periods may affect device reliablity.
4. Current is defined as negative into the device.
Delay from output
high impedance to
output high
1.5V
0.5V
0.5V
1.5V
VH - Voltage reached when output driven high
VL - Voltage reached when output driven low
STATIC ELECTRICAL CHARACTERISTICS
Operating Conditions (unless otherwise stated)
Tamb =-55°C to +125°C. Vcc = 5.0v ± 10%
NOTE: Signal pins PC0, X15, MASTER, SINGLE, BYPASS and 0V have pull-up resistors in the range 15kΩ to 200kΩ.
Signal pins PROG and DS require external pull-up resistors in EPROM mode.
Characteristic
Symbol
Min.
*
*
*
*
*
†
*
†
Output high voltage
Output low voltage
Input high voltage
Input low voltage
Input leakage current
Input capacitance
Output leakage current
Output S/C current
14
VOH
VOL
VIH
VIL
IIN
CIN
IOZ
ISC
Value
Typ.
0.4
0.8
+10
2.4
2.0
-10
10
-50
10
Units
Subgroup
V
V
V
V
µA
pF
µA
mA
1,2,3
1,2,3
1,2,3
1,2,3
1,2,3
IOH = 4mA
IOL = -4mA
Except CLK, RES = 4V
1,2,3
GND < VOUT < VCC.No internal pull up
VCC = Max
Conditions
Max.
+50
300
GND < VIN < VCC.No internal pull up
PDSP16488A MA
Characteristic
Min.
TDSH
THSU
20
0
Read Set UpTime to prevent Write
T
RA
Host Signal Hold Time
Expansion in to Data Out in PROG mode
Delay from strobe to PC1
[Equivalent to PC0 to PC1 delay ]
Chip Enable Set Up Time
PROG Set Up Time
PROG Hold Time
THH
TDEL
TEXP
5
5
†
†
DS Hold Time after REPLY active
Host Address/data Set Up Time
†
†
†
†
†
†
†
†
†
Symbol
Chip Enable Hold Time
PC1 In-active Delay after DS in-active
TCSU
TPSU
Coefficient Read Time
†
Coefficients valid Time before REPLY
ns
Only applicable for read ops & if REPLY is used.
Only applicable if REPLY is used. Otherwise
time is referenced to risng edge of strobe
when set up must be N xTDEL, for Ndevices
ns
ns
30
ns
ns
Must always be guaranteed.
No clocks are needed in PROG mode
50
ns
Greater than TDEL under all conditions
ns
ns
TPCH
ACC
T
RSU
Notes
Units
0
0
0
0
T PH
T CH
T
†
Value
Max.
50
ns
Defines Data Strobe in-active time
50
ns
From MASTER or SINGLE device
5
ns
All parameters marked * are tested during production.
Parameters marked † are guaranteed by design and characterisation
T Wait
DATUM
T
> T
PCH
Wait
Host Data Strobe
T
T
CH
csu
Chip Enable
T
PSU
T
PH
PROG
T
ACC
Coefficient
Output
T
T
RSU
EXP
T PCH
PC1 from
MASTER or SINGLE device
T DSH
PC1 from last
device (REPLY)
T
RA
R/W from the Host
T
HSU
Address/data
from the Host
HH
VALID
T
Host Data O/P
from First Device
T
DEL
VALID
FIG. 8. Host Timing
15
PDSP16488A MA
Characteristic
Symbol
Min.
Value
Max.
Units
†
Delay from Data Strobe to MASTER PC1
T
†
Delay from PC0 Input to Write in-active
T
†
PC1 In-Active Delay
T
†
Write from MASTER In-Active
T
†
Write In-Active to new Address
T AD
†
EPROM Data Set Up Time
T DS
20
ns
†
Data Strobe from MASTER
T
10
ns
†
Chip Enable Set Up Time
TCSU
0
ns
†
Chip Enable Hold Time
T CH
0
ns
†
Availible EPROM Access Time
T
200
ns
†
Expansion In to Data Out
T
DEL
30
ns
PC0 to PC1 Delay
T
EXP
50
ns
†
ns
50
PCD
ns
5
WH
ns
50
PCH
ns
250
WW
30
RW
DA
Notes
ns
Single device
Greater than T DEL at at
emps
t
all all
temps
TRW
TWH
DS from
MASTER
TWW
TPCD
PC1 from
MASTER
T
EXP
PC1 from next Device
TPCH
PC1 from Last Device
[ PC0 to MASTER ]
EPROM
ADDRESS
VALID
VALID
T AD
TDS
EPROM
DATA
VALID
TDA
T
CSU
CE
T CH
Data O/P from
First Device
TDEL
VALID
TDEL
Data O/P from
Second Device
16
Fig. 9. EPROM Timing
VALID
Value
Characteristic
Symbol
Min.
Units
Sub group
ns
9,10,11
ns
9,10,11
*
Pixel Clock Low Time
T CL
25 (a)
10 (b)
*
Pixel Clock High Time
T CH
25 (a)
10 (b)
*
Data in Set Up Time
T DSU
10
ns
9,10,11
*
Data in Hold Time
T DH
0
ns
9,10,11
*
CLK rising to Output delay
T RD
21
ns
9,10,11
*
Line Store Output Delay
T LD
20
ns
9,10,11
†
HRES In-active Set Up Time
T
ns
9,10,11
†
Output Enable Time
†
Output Disable Time
RSU
Notes
Max.
10
T
DLZ
15
ns
9,10,11
T DHZ
15
ns
9,10,11
(a)
(b)
32 Bit Muxed Output
16 Bit Output
(a)
(b)
32 Bit Muxed Output
16 Bit Output
Increase to 25ns for
DELOP output
Measured with a 15kΩ
series resistor and
30pF load capacitance
PDSP16488A MA
APPLICATIONS INFORMATION
DEVICE REQUIREMENTS
The number of devices required to implement a given
convolver window depends on the size of the window, the
required pixel rate, and whether the pixel accuracy is to be 8
or 16 bits. In practice the PDSP16488A supports windows
requiring one, two, four, six, or eight devices without additional logic. Table 2 gives typical window sizes which may be
obtained with the above number of devices.
Figures 11 through 18 show system interconnections
for these arrangements. Other configurations are possible but
may need the support of additional pixel/line delays and/or
expansion adders. Although not necessarily shown, all configurations can be supported by either an EPROM or a Host
Computer . Interlaced or non-interlaced video may also be
used, unless explicitly stated otherwise in the text.
Expansion with 8 bit pixels is a straightforward process
and the number of devices needed is easily deduced from the
window sizes available in a single device. At pixel rates above
20MHz it may not be practical to use more than four devices.
since the full 32 bit intermediate precision is not available. The
lack of expansion multiplexing reduces the intermediate precision to 16 bits. The partial sum outputs must thus not
overflow these 16 bits; this will require the coefficients to be
scaled down appropriately with a resulting loss in accuracy.
Expansion with 16 bit pixels can be achieved in several
ways. The simplest way is to use two devices, each working
with 8 bit pixels. One device handles the least significant part
of the data, and its output feeds the expansion input of a
second device. This performs the most significant half of the
calculation. The least significant half is then added to the most
significant sum, after the latter has been multiplied by 256 ie
shifted by eight places. This shift is done internally and
controlled by Register D, bit 1. The internal 32 bit accuracy
prevents any loss in precision due the shift and add operation.
The window size with this arrangement is restricted to
that available in a single device, at the required pixel rate but
with 8 bit pixels. Thus two devices can be used , for example,
to provide an 8 x 8 window with 16 bit pixels and 10 MHz rates.
If a larger extended precision window is needed, it is
possible to use four devices. Each device is then programmed
to be in a 16 bit data mode, but should be restricted to rates
below 20 MHz, if the 32 bit intermediate precision is to be
maintained. In the 16 bit modes, however, the output from the
last line delay is not available due to pin limitations. This is not
a problem in a four device interlaced system, since half of the
devices will be fed from an external field delay. In non
interlaced systems additional external line delays would be
needed. An alternative approach would be to configure all the
devices in the appropriate 8 bit mode, do separate least
significant and most significant calculations, and then combine the results in an external adder after a wired in shift.
SINGLE DEVICE SYSTEMS
Figures 11 illustrates both EPROM and Host supported single device systems, with or without interlaced video.
In both cases the SINGLE and X15 pins must be tied tied low,
and the PC0, PC1, and DS pins are redundant. The PROG pin
18
becomes an output and indicates that a register load sequence is occuring. The first line delay must always be
bypassed in a non interlaced system, however, since an
internal pull up is not provided, the BYPASS pin should be tied
to VCC for the correct operation. With interlaced video the
BYPASS input is used to distinguish between the odd and
even fields.
The CE input may be left open circuit if coefficients are
to be simply loaded after a power on reset signal; the latter
being applied to the RES input. Alternatively the CE input may
be used to change the coefficients at any time after power on
reset; the EPROM would then need additional address bits for
the extra sets of coefficients that are to be stored.
In an interlaced system the pixels from the previous
field must use the IP7:0 inputs, and the live pixels must use the
L7:0 inputs. Interlaced sysytems requiring extended precision
pixels are non supported with a single device, since the L7:0
inputs are then use for the least significant 8 bits, and the IP7:0
inputs for any more significant bits.
If the X15 pin is left open circuit, an internal pull up will
configure the device in the host supported mode. The host
must then supply a data strobe and a R/W control line. The
X7:0 pins must be connected to the host data bus, and are
used to both load and read back register values. The PROG
and CE pins may be connected together, and then driven by
a host address decode. The output on PC1, which provides a
REPLY to the host, need not be used if the width of the data
strobe is greater than the maximum TEXP value given in
Figure 7.
The configuration bits 6:4 in REGISTER A define the
window size, maximum pixel rate, and pixel resolution. Window sizes smaller than the maximum in any configuration are
implemented by filling in the window with `zero' coefficients.
Bits 3:0 are irrelevent in the SINGLE mode, as is bit 7 if the gain
contol is used.
The result would be expected to lie in either the bottom 20
bits of the 32 bit result , or possibly in the next 20 bit field
displaced by four bits. Register C, bits 5:4, must thus select
one of these fields for subsequent use by the gain control. The
gain is then adjusted such that the 16 outputs available on
pins are in fact the 16 most significant bits of the result. The
gain needed is application specific, but if too much gain is used
the OV pin will indicate an overflow.
Register B, bits 2:1, must be set to select the required
method of defining the length of the line delays, and the use
of bit 3 is dependent on any external pixel delays before the
convolver input. No additional delays are needed on the pixel
inputs in a single device system, and REGISTER D, bits 4:2,
should be reset. The pipeline delay in the DELOP output path
should match one of those in Table 4, and is window size
dependent.
DUAL DEVICE CONFIGURATIONS
Two devices, each configured with 8 bit pixels and 8W
x 4D windows, can be used to provide an 8 x 8 window at up
to 20 MHz pixel rates. Figure 12 shows both the non interlaced
and interlaced arrangements.
Video lines containing up to 1024 pixels are possible
in both configurations, since each device only needs four line
delays. One device is configured as the MASTER by grounding the MASTER pin; the other then receives control signals in
PDSP16488A MA
those in a single device. This compensates for the twelve
delays added to the convolver sums in the second row, plus
an additional eight delays to compensate for the partial width
of the first device in the secind row.
Four devices can also be used to give an 8x8 window,
but with a 30 MHz pixel clock. Each device is configured to
provide a 4x4 partial window, but the maximum pixel rate is
reduced from 40 to 30 MHz because of the response of the line
delay expansion circuitry. Intermediate precision is restricted
to 16 bits, since time multiplexed data outputs cannot be used
above 20 MHz.
This configuration requires no additional delay in the
expansion inputs, and the inputs to the line stores in both
devices in the second row must be delayed by 8 clock cycles
[ Register D bits 3:2 = 10 ]. The DELOP output needs twelve
additional clock delays to match the processing delay.
Figures 14 and 15 show non-interlaced and interlaced
versions of the above 8 x 8 and 4 x 4 arrangements
Figure 16 shows how four devices can also be used to
provide an 8x8 window, with 16 bit pixels and 20MHz clock
rates. The expansion data from a previous device needs no
additional delay since the partial window size in each device
is only 4x4. The internal convolver sums from each device in
the second row must be delayed by 8 Clks and the DELOP
output must have 12 additional delays. If this arrangement is
to be used in a non-interlaced application, the field store must
be replaced by four line delays.
the normal way and has its MASTER and SINGLE pins left
open circuit.
The internal convolver sum, in the device producing
the final result, must be delayed by 4 pixels to match the
inherent delay in the expansion output from the other device.
This is actually achieved by delaying the pixel inputs to the line
stores [ Register D bits 3:2 = 01 ]. No additional delay in the
expansion input is needed, but the pipeline delay used to
produce DELOP must be four clocks greater than that given
in Table 4 for a single device. The DELOP output is redundant
in one of the two devices.
Two devices can also be used to support systems
requiring 16 bit pixels. With this approach the 16 x 8 multiplication is mechanized as two 8 x 8 operations, with the results
added together after the most significant half has been shifted
by 8 places to the most significant end. This shift operation is
controlled by Register D, Bit 1. Both convolvers are programmed to contain the same coefficients. The convolved
output can theoretically grow to 30 bits, and the appropriate
field must be selected before using the gain control.
Examples of this operating mode are shown in Figure
13. Each device must be configured in the same 8 bit pixel
operating mode, but the device producing the final result must
use the 8 place shift option on its internal sum.
The least significant 8 bits of the pixel are connected to
the MASTER device and the most significant 8 bits are
connected to the device producing the final result.. The
internal sum in this device must be delayed by four pixels to
match the delay in the expansion output from the first device.
This is actually achieved by delaying the pixel inputs to the line
stores( Register D, bits 4:2, = 001 ]. The expansion input
needs no additional delay [ Register D bits 1:0 = 10 ].
The actual pixel precision can be any number of pixels
between 8 and 16, and may be a signed or unsigned number.
Any unused, more significant bits, must respectively be either
sign extended or be tied low.
DELOP must have four additional pipeline delays in order
to match the total processing delay. This output can be
obtained from either device.
As shown in figure 17, six devices, each in an 8Wx4D
mode using 8 bit pixels, can provide a 16W x 12D window at
20MHz clock rates. Expansion inputs from previous devices
in a row [but not the first device in each row] need an extra 4
Clks of delay since the partial window is eight pixels wide.
Internal convolver sums need a differential delay of 12 Clk
cycles from row to row [ Register D bits 3:2 = 11 ].
The DELOP output must have 32 additional delays to
match the total processing delay.
FOUR DEVICE SYSTEMS
EIGHT DEVICE SYSTEMS
Four devices, each in the 8x8 mode, can be used to provide
a 16 x 16 window, with 8 bit pixel resolution and 10 MHz clock
rates. The partial sum from the first device in each row must
be delayed by eight pixel clocks before it is added to the result
from the next device. This provides the eight pixel displacement to match the width of the window. The delay is actually
provided by four additional delays in the expansion input to the
next device, plus the inherent four clock delays in outputing
results from the first device. Register D, Bit 0 controls the
additional delay.
The internal convolver sums, in the two devices in the
second row, must be delayed by 12 clocks before they are
added to the result from the first row. This twelve clock delay
is necessary because of the combination of the eight pixel
horizontal displacement delay , and the four clock delay in
outputing the result from the last device in the top row. It is
actually achieved by delaying the pixel inputs to the line stores.
(Register D, bits 3:2 = 11 ].
The DELOP output must have 20 delays additional to
Two additional chips will extend the above six device
configuration to a 16 x 16 window. Internal convolver sums
must have differential delays of 12 clock cycles between rows,
as in the six device system. The DELOP output needs 44
additional clock delays.
SIX DEVICE SYSTEMS
NINE DEVICE SYSTEMS
Nine devices each in the 8 x 8 mode will provide a 24
x 24 window with 8 bit data and 10 MHz pixel clocks. This is
shown in Figure 18. Expansion data inputs from previous
devices in a row [ but not the first device in each row ] need an
extra 4 Clks of delay. The internal convolver sums need
differential delays of 20 Clk cycles between rows. Sixteen of
the latter delays can be provided internally by setting Register
B, bit3, and also Register D, bits 3:2. The four extra delays
must be provided externally.
The DELOP output needs 56 clock delays in addition
to the 29 required for the 8 x 8 single device configuration.
19
PDSP16488A MA
ADDR
ADDR
DATA
DATA
EPROM
EPROM
BIN
DATA OUT
DELAYED
SYNC
OUTPUT
ENABLE
GND
PIXEL
DATA
BIN
X7:0
CE
PDSP
PDSP
16488
16488A
BYPASS
D15:0
R/W
DELOP
L7:0
OEN
OV
FIELD
DELAY
CLK
SNG
MSTR
OEN
HRES
CHANGE
COEFFICIENTS
PROG
DELOP
RES
IP7:0
RESET
MSTR
R/W
L7:0
X15
BIN
D15:0
OV
LEAST SIG
BYTE OF 16
BIT PIXEL
BYPASS
CLK
GND
CE
PDSP
16488A
16488
PROG
VCC
O/C
RES
SNG
HRES
SYNC
X7:0
X15
IP7:0
X14:8
SYNC
X14:8
PIXEL
DATA
GND
BIN
GND
CLOCK
PROG
DS
DATA
ADDR
R/W
REPLY
DATA
ADDR
ADDRESS
DECODE
HOST CPU
DS
REPLY
R/W
CLOCK
ADDRESS
DECODE
HOST CPU
BIN
GND
O/C
OEN
OUTPUT
ENABLE
OVERFLOW
PIXEL
DATA
DS
D15:0
L7:0
DELOP
ODD
FIELD
Figure 11 Single Device Systems
BIN
PC1
X7:0
X14:8
PDSP
PDSP
16488
16488A
R/W
CLOCK
20
BYPASS
GND
O/C
OEN
OV
DELAYED
SYNC
CE
CLK
DELOP
FIELD
DELAY
RES
HRES
PROG
DATA OUT
IP7:0
MSTR
D15:0
OV
CLK
PROG
L7:0
SNG
LEAST SIG
BYTE OF 16
BIT PIXEL
MSTR
R/W
PC0
SYNC
CE
X15
RES
HRES
RESET
SNG
DS
IP7:0
PDSP
16488
16488A
SYNC
BIN
PC1
X7:0
X14:8
X15
PC0
PIXEL
DATA
BYPASS
BIN
O/C
O/C
VCC
O/C
DELAYED
SYNC
OUTPUT
ENABLE
OVERFLOW
OVERFLOW
PROG
DATA OUT
ODD
FIELD
GND O/C
GND O/C
RESET
CHANGE
COEFFICIENTS
RESET
DATA OUT
DELAYED
SYNC
OUTPUT
ENABLE
OVERFLOW
CLOCK
PDSP16488A MA
EPROM
MSB
VCC
O/C
BYPASS
GND
R/W
CS1
PC1
CS0
RES
CE
PDSP
16488A
16488
8X4
8X4
WINDOW
SNG
DS
DELAYED
SYNC
DELOP
OEN
PROG
L7:0
D15:0
MST
HRES
X7:0
IP7:0
SYNC
X15
8 BIT
PIXEL
DATA
X14:8
PC0
GND
CLOCK
GND
O/C
CLOCK
GND
BIN
HRES
PROG
DS
OV
BIN
PC1
O/C
O/C
HOST CPU
DATA
X14:8
X7:0
ADDRESS
DECODE
STROBE
ADDR
R/W
O/P ENABLE
OEN
CLOCK
L7:0
R/W
DATA OUT
D15:0
MST
O/C
GND
CE
PDSP
PDSP
16488
16488A
8X4
8X4
WINDOW
WINDOW
BYPASS
RESET
RES
SNG
GND
X7:0
X14:8
X15
IP7:0
PC0
OVERFLOW
REPLY
IP7:0
HRES
DS
CE
PDSP
PDSP
16488A
16488
8X4
8X4
WINDOW
WINDOW
L7:0
OEN
PROG
R/W
D15:0
DELAYED
SYNC
DELOP
CLOCK
BYPASS
MST
ODD
FIELD
RES
SNG
SYNC
PC1
X15
O/C
GND
O/C
FIELD
DELAY
CLOCK
GND
BIN
R/W
O/C
DS
OV
BIN
PC1
X7:0
X14:8
PDSP
PDSP
16488A
16488
8X4
8X4
WINDOW
WINDOW
RESET
CE
D15:0
OEN
DATA OUT
O/P ENABLE
CLOCK
L7:0
RES
MST
O/C
BYPASS
SNG
HRES
VCC
O/C
X15
IP7:0
PROG
8 BIT
PIXEL
DATA
PC0
OVERFLOW
READ
REG
O/C
Figure 12. 8 Bit Dual Device Systems
21
PDSP16488A MA
GC
SIG
GC
SIG
GC
SIG
GC
SIG
1
N/C
34
N/C
67
N/C
100
N/C
2
D0
35
X2
68
IP1
101
VDD
3
OEN
36
X3
69
GND
102
F0
4
BIN
37
X4
70
IP2
103
D15
5
PC1
38
N/C
71
N/C
104
N/C
72
VDD
105
D14
6
VDD
39
X5
7
GND
40
GND
73
IP3
106
D13
8
OVER
41
X6
74
VDD
107
GND
9
N/C
42
X7
75
IP4
108
D12
10
HRES
43
N/C
76
GND
109
GND
11
R/W
44
X8
77
IP5
110
VDD
78
GND
111
VDD
12
CE
45
X9
13
N/C
46
VDD
79
IP6
112
D11
14
N/C
47
VDD
80
VDD
113
D10
15
GND
48
VDD
81
IP7
114
D9
16
N/C
49
X10
82
VDD
115
GND
17
DS
50
MASTER
83
N/C
116
CLK
84
L7
117
CLK
18
GND
51
N/C
19
VDD
52
X11
85
GND
118
CLK
20
PROG
53
X12
86
L6
119
GND
21
GND
54
SINGLE
87
GND
120
GND
22
CS3
55
GND
88
L5
121
D8
23
CS2
56
GND
89
VDD
122
VDD
90
L4
123
D7
24
CS1
57
N/C
25
CS0
58
X13
91
VDD
124
D6
26
VDD
59
X14
92
L3
125
D5
27
RES
60
N/C
93
VDD
126
D4
28
PC0
61
X15
94
L2
127
GND
29
N/C
62
VDD
95
GND
128
D3
96
L1
129
N/C
30
DELOP
63
BYPASS
31
X0
64
IP0
97
F1
130
D2
32
X1
65
VDD
98
L0
131
D1
33
N/C
66
N/C
99
N/C
132
N/C
GC132 Pin out table
22
PDSP16488A MA
EPROM
MSB
IP7:0
HRES
BYPASS
L7:0
DS
PROG
R/W
GND
PC1
CS0
CS1
CE
D15:0
DELAYED
SYNC
DELOP
OEN
MST
VCC
O/C
RES
PDSP
16488
16488A
8X4
WINDOW
SNG
SYNC
X7:0
X15
LSB
X14:8
PC0
GND
16 BIT
PIXEL
DATA
CLOCK
GND
O/C
CLOCK
GND
BIN
IP7:0
HRES
O/C
L7:0
OV
BIN
PC1
SNG
DS
PROG
R/W
GND
CE
PDSP
16488A
16488
8X4
WINDOW
O/C
HOST CPU
DATA
X7:0
ADDRESS
DECODE
STROBE
ADDR
O/P ENABLE
OEN
O/C
R/W
X14:8
DATA OUT
D15:0
CLOCK
BYPASS
RESET
RES
MST
O/C
X7:0
X14:8
MSB
X15
PC0
OVERFLOW
REPLY
IP7:0
HRES
DS
PC1
CE
PDSP
16488
16488A
8X4
WINDOW
BYPASS
LSB
L7:0
OEN
PROG
R/W
D15:0
DELAYED
SYNC
DELOP
CLOCK
ODD
FIELD
RES
MST
SYNC
SNG
LSB
X15
PC0
O/C
GND
O/C
CLOCK
GND
BIN
MSB
BYPASS
L7:0
R/W
O/C
D7;0
DS
OV
BIN
PC1
X7:0
X14:8
PDSP
16488A
16488
8X4
WINDOW
RESET
CE
D15:0
OEN
DATA OUT
O/P ENABLE
CLOCK
16 BIT
PIXEL
DATA
O/C
VCC
RES
MST
FIELD
DELAY
SNG
HRES
PROG
IP7:0
X15
MSB
PC0
OVERFLOW
READ
REG
O/C
Figure 13. Dual Device 16 Bit Systems.
23
REPLY
PDSP16488A MA
ADDR
DATA
X7:0
R/W
X14:8
PROG
DECODE
HOST
CPU
DS
PIXEL
SYNC DATA
RES
PC1
CLK
MST
OV
BIN
PC1
X14:8
X15
PC0
RES
PC1
X7:0
X14:8
BIN
OVERFLOW
CE
PDSP
PDSP
16488A
16488
BYPASS
D15:0
HRES
GND
CE
PDSP
PDSP
16488A
16488
BYPASS
RESET
D15:0
DATA
OUT
OEN
O/P
ENABLE
O/C
O/C
Figure 14. Four Device Non Interlaced System.
CLK
PROG
MST
O/C
GND
DS
SNG
O/C
CLK
MST
SNG
PROG
L7:0
OEN
R/W
X15
PC0
GND
O/C
O/C
L7:0
24
DELAYED
SYNC
OEN
SNG
GND
R/W
CLK
MST
SNG
DELOP
DS
IP7:0
HRES
R/W
X7:0
X14:8
L7:0
IP7:0
DS
D15:0
O/C
VCC
O/C GND
GND
CE
RES
D15:0
OEN
PROG
R/W
PDSP
PDSP
16488A
16488
BYPASS
[MASTER]
DS
X15
PC0
HRES
CE
X7:0
BYPASS
L7:0
RES
PDSP
PDSP
16488A
16488
HRES
O/C
VCC
IP7:0
PROG
IP7:0
PC1
X15
O/C
PDSP16488A MA
EPROM
ADDR
DATA
X14:8
X7:0
UPPER
ADDR
BITS
ALS
138
PIXEL
SYNC DATA
RES
PC1
X7:0
X15
PC0
CS1
RES
CS0
IP7:0
HRES
HRES
VCC
O/C
X14:8
IP7:0
PC1
X15
PC0
GND
PDSP
16488A
16488
BYPASS
D15:0
O/C
VCC
CE
PDSP
16488
16488A
BYPASS
D15:0
[MASTER]
O/C
O/C
CLK
MST
GND
O/C GND
OEN
PROG
R/W
CLK
MST
DS
GND
SNG
GND
OEN
SNG
R/W
PROG
DS
DELAYED
SYNC
DELOP
GND
O/C
RES
OV
BIN
PC1
X7:0
DS
GND
O/C
O/C
CE
RESET
D15:0
DATA
OUT
OEN
O/P
ENABLE
CLK
GND
PROG
CLK
PDSP
16488
16488A
BYPASS
MST
O/C
MST
OEN
SNG
PROG
R/W
D15:0
R/W
PDSP
16488
16488A
HRES
SNG
CE
BYPASS
GND
BIN
OVERFLOW
HRES
DS
X14:8
IP7:0
IP7:0
ODD
FIELD
X15
PC0
RES
PC1
X7:0
X14:8
X15
PC0
FIELD
DELAY
Figure 15. Four Device Interlaced System.
25
REPLY
PDSP16488A MA
DATA
PROG
DECODE
ADDR
R/W
X7:0
16 BIT
PIXEL
SYNC DATA
X14:8
HOST
CPU
DS
MSB
RES
PC1
X7:0
X15
HRES
CE
CE
PDSP
16488
16488A
BYPASS
LSB
L7:0
D15:0
VCC
O/C
BYPASS
PDSP
16488
16488A
D15:0
L7:0
DELAYED
SYNC
DELOP
[MASTER]
DS
OEN
O/C
O/C
CLK
MST
O/C GND
SNG
PROG
GND
R/W
CLK
OEN
MST
R/W
PROG
DS
SNG
O/C
PC0
IP7:0
HRES
VCC
X14:8
IP7:0
RES
MSB
PC1
X15
O/C
GND
LSB
FIELD
DELAY
CE
PDSP
PDSP
16488
16488A
HRES
D15:0
BYPASS
OEN
DS
L7:0
LSB
RES
OV
PC1
X7:0
BIN
CE
PDSP
16488
16488A
DATA
OUT
O/C
O/C
OEN
O/P
ENABLE
CLK
MST
SNG
PROG
CLK
MST
SNG
O/C
GND
O/C
Figure 16. Four Device System with 16 Bit Pixels
26
RESET
D15:0
L7:0
PROG
R/W
DS
R/W
LSB
X14:8
OVERFLOW
HRES
ODD
FIELD
BIN
IP7:0
IP7:0
BYPASS
X15
PC0
RES
X7:0
PC1
X14:8
X15
MSB
PC0
MSB
PDSP16488A MA
SYNC
DATA
ADDR
EPROM
ALS
138
CHIP
ENABLES
UPPER
ADDR
DATA
IN
RES
PC1
X7:0
X15
X14:8
PCO
RES
X7:0
PC1
GND
X14:8
PCO
DELAYED
SYNC
OEN
CLOCK
PROG
R/W
DS
GND
RES
PC1
DELOP
L7:0
IP7:0
CE
GND
BYPASS
DS
GND
OEN
L7:0
OEN
CLOCK
DS
R/W
PROG
L7:0
D15:0
CLOCK
GND
BYPASS
CE
PDSP
16488
16488A
HRES
D15:0
R/W
PDSP
PDSP
16488
16488A
HRES
PROG
IP7:0
GND
D15:0
BYPASS
GND
X7:0
PCO
X15
X14:8
GND
VCC
O/C
OEN
CLOCK
R/W
DS
MSTR
GND
CE
PDSP
16488
16488A
HRES
D15:0
BYPASS [MASTER]
L7:0
IP7:0
O/C
X15
CE
PDSP
PDSP
16488A
16488
PROG
O/C
VCC
GND
GND
OV
RES
BIN
PC1
X7:0
X14:8
PCO
X15
RES
PC1
X7:0
X14:8
X15
OVERFLOW
RESET
IP7:0
GND
GND
BYPASS
O/C
L7:0
D15:0
CLOCK
OEN
R/W
OEN
L7:0
GND
CE
PDSP
PDSP
1648
16488A
8
HRES
D15:0
CLOCK
O/C
PROG
BYPASS
R/W
GND
CE
PDSP
PDSP
1648
16488
16488A
8
HRES
PROG
IP7:0
DS
PCO
BIN
DS
HRES
CS1
RES
IP7:0
HRES
CS2
CS0
PC1
X7:0
PCO
X15
X14:8
GND
DATA
OUT
O/P
ENABLE
GND
Figure 17. Six Device Non Interlaced System.
27
PDSP16488A MA
CE1
ALS
138
EPROM
DATA IN
DATA
DATA
ADDR
ADDR
EPROM
CE7
UPPER
UPPER
ADDR
ADDRESS
CE8
HRES
D15:0
VCC
O/C
BYPASS
GND
GND
OV
RES
PC1
BIN
X7:0
PC0
GND
BYPASS
CE
PDSP
PDSP
16488
16488A
D15:0
OEN
L7:0
DS
CLOCK
PROG
GND
R/W
MST
HRES
O/C
VCC
OEN
DS
CLOCK
OEN
R/W
PROG
L7:0
IP7:0
CE1
D15:0
RES
X15
X14:8
OV
PC1
BIN
X7:0
RES
CE
PDSP
PDSP
16488
16488A
GND
DS
BYPASS
IP7:0
O/C
CLOCK
VCC
O/C
CE
PDSP
PDSP
16488
16488A
PROG
HRES
R/W
SYNC
PC0
X15
CS3
IP7:0
X14:8
CS1
CS2
PC1
CS0
X7:0
X15
X14:8
PC0
GND
GND
GND
CE
HRES
D15:0
GND
GND
BYPASS
OV
PC1
BIN
RES
CE
PDSP
PDSP
16488
16488A
D15:0
GND
DS
DS
CLOCK
PROG
OEN
R/W
OEN
CLOCK
BYPASS
PROG
GND
GND
PDSP
PDSP
16488
16488A
R/W
R/W
D15:0
OEN
PROG
L7:0
HRES
DS
BYPASS
IP7:0
CE
PDSP
PDSP
16488
16488A
CLOCK
HRES
X7:0
PC0
X15
X14:8
OV
RES
PC1
BIN
IP7:0
IP7:0
GND
X7:0
PC0
X15
X14:8
OV
RES
BIN
PC1
X7:0
X14:8
PC0
X15
4 CLK
DELAYS
GND
GND
4 CLK
DELAYS
BIN
OVERFLOW
BYPASS
O/C
L7:0
DS
DELAYED SYNC
GND
GND
Figure 18. Nine Device Non Interlaced System.
28
OV
RES
PC1
BIN
X7:0
PDSP
PDSP
16488
16488A
CE
CE8
D15:0
DATA
OUT
OEN
O/P
ENABLE
DS
GND
GND
DELOP
X14:8
X15
PC0
OV
RES
PC1
BIN
CLOCK
R/W
L7:0
PROG
OEN
O/C
DS
BYPASS
HRES
D15:0
CLOCK
GND
IP7:0
CE7
CE
PROG
D15:0
PDSP
PDSP
16488
16488A
R/W
HRES
OEN
CLOCK
L7:0
CE
GND
BYPASS
R/W
O/C
PDSP
PDSP
16488
16488A
PROG
GND
X7:0
IP7:0
IP7:0
HRES
X14:8
PC0
X15
OV
RES
PC1
BIN
X7:0
X14:8
X15
PC0
RESET
PDSP16488A MA
Part No:
PDSP16488 Single Chip 2D Convolver with Integral Line Delays
Package Type:
GC132
Pin No.
GC
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
Volts.
N/C
N/C
V1
N/C
GND
V1
GND
N/C
N/C
V1
GND
V1
N/C
N/C
V1
N/C
V1
GND
V1
GND
GND
N/C
N/C
N/C
N/C
V1
GND
N/C
N/C
N/C
N/C
N/C
N/C
Pin No.
GC
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
Volts.
N/C
N/C
N/C
N/C
N/C
N/C
GND
N/C
N/C
N/C
N/C
N/C
V1
V1
V1
N/C
V1
N/C
N/C
N/C
V1
GND
GND
N/C
N/C
N/C
N/C
N/C
V1
GND
GND
V1
N/C
Pin No.
GC
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
Volts
N/C
GND
GND
GND
N/C
V1
GND
V1
V1
GND
V1
GND
V1
V1
V1
V1
N/C
GND
GND
GND
GND
GND
V1
GND
V1
GND
V1
GND
GND
GND
N/C
GND
N/C
Pin No.
GC
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
Volts.
N/C
V1
N/C
N/C
N/C
N/C
N/C
GND
N/C
GND
V1
V1
N/C
N/C
N/C
GND
N/C
N/C
V1
GND
GND
N/C
V1
N/C
N/C
N/C
N/C
GND
N/C
N/C
N/C
N/C
N/C
VDD max = +5.0V = V1
N/C = not connected
Figure 19. Life Test/Burn-in connections
NOTE: PDA is 5% and based on groups 1 and 7
ORDERING INFORMATION
PDSP16488A MA GCPR (QFP Package - Non-compliant to MIL-STD-883 Class B)
PDSP16488A MA ACBR (PGA Package - Non-compliant to MIL-STD-883 Class B)
29
http://www.mitelsemi.com
World Headquarters - Canada
Tel: +1 (613) 592 2122
Fax: +1 (613) 592 6909
North America
Tel: +1 (770) 486 0194
Fax: +1 (770) 631 8213
Asia/Pacific
Tel: +65 333 6193
Fax: +65 333 6192
Europe, Middle East,
and Africa (EMEA)
Tel: +44 (0) 1793 518528
Fax: +44 (0) 1793 518581
Information relating to products and services furnished herein by Mitel Corporation or its subsidiaries (collectively “Mitel”) is believed to be reliable. However, Mitel assumes no
liability for errors that may appear in this publication, or for liability otherwise arising from the application or use of any such information, product or service or for any infringement of
patents or other intellectual property rights owned by third parties which may result from such application or use. Neither the supply of such information or purchase of product or
service conveys any license, either express or implied, under patents or other intellectual property rights owned by Mitel or licensed from third parties by Mitel, whatsoever.
Purchasers of products are also hereby notified that the use of product in certain ways or in combination with Mitel, or non-Mitel furnished goods or services may infringe patents or
other intellectual property rights owned by Mitel.
This publication is issued to provide information only and (unless agreed by Mitel in writing) may not be used, applied or reproduced for any purpose nor form part of any order or
contract nor to be regarded as a representation relating to the products or services concerned. The products, their specifications, services and other information appearing in this
publication are subject to change by Mitel without notice. No warranty or guarantee express or implied is made regarding the capability, performance or suitability of any product or
service. Information concerning possible methods of use is provided as a guide only and does not constitute any guarantee that such methods of use will be satisfactory in a specific
piece of equipment. It is the user’s responsibility to fully determine the performance and suitability of any equipment using such information and to ensure that any publication or
data used is up to date and has not been superseded. Manufacturing does not necessarily include testing of all functions or parameters. These products are not suitable for use in
any medical products whose failure to perform may result in significant injury or death to the user. All products and materials are sold and services provided subject to Mitel’s
conditions of sale which are available on request.
M Mitel (design) and ST-BUS are registered trademarks of MITEL Corporation
Mitel Semiconductor is an ISO 9001 Registered Company
Copyright 1999 MITEL Corporation
All Rights Reserved
Printed in CANADA
TECHNICAL DOCUMENTATION - NOT FOR RESALE