PSAM; NEC PCIe SSD Appliance for Microsoft® SQL Server® (Reference Architecture)

“PSAM”, NEC PCIe SSD Appliance
for Microsoft® SQL Server®
(Reference Architecture)
September 11th, 2014
NEC Corporation
1. Overview of
NEC PCIe SSD Appliance for
Microsoft® SQL Server®
Page 2
© NEC Corporation 2014
NEC PCIe SSD Appliance for Microsoft SQL Server
- Customer’s Pain : Background of the Solution ▌System bottleneck from
performance gap between in-box
memory / storage and external
disks
l Performance tuning is getting more
complicated, needs a lot of external
storage and Host Bus Adapters to
provide enough IO bandwidth for a
high-performance system
▌Balanced CPU-IO configuration
restricted by the latest QPI
architecture-based processors
l The architecture requires extra CPU
sockets, even if customer just wants
to have more IO slots (or memory).
The extra CPU sockets contribute to
increased power consumption and
system cost
1PB
Huge Performance gap
QPI
100TB
10TB
PCIe
SSD
<1us
<150us
~2ms
~8ms
~20ms
DIMM
DIMM
DIMM
DIMM
DIMM
DIMM
DIMM
DIMM
DIMM
DIMM
DIMM
DIMM
DIMM
DIMM
DIMM
DIMM
IO Controller
CPU Cache
<ns
Memory
Controller
SATA
Disk
Array
(Access latency)
PCIe slot
DRAM
QPI
QPI
PCIe slot
PCIe slot
1TB
SAS
SSD
SAS
Disk
Array
CPU
Memory
Controller
(Storage Capacity)
QPI based CPU
* QPI = Quick Path Interconnect
(Interconnect between CPUs)
“PCIe SSD (server-mounted flash storage)” and
“Rich IO slots (per CPU)” are the keys to solve those issues
Page 3
© NEC Corporation 2014
NEC PCIe SSD Appliance for Microsoft SQL Server
- Overview Proven performance reference architecture for TCO optimization
- Best fit for High-Performance & Scalable DWH/BI/OLTP system NEC Solution Platforms
l High-performance and high availability appliance
の露出用 system (reference
記載文言は検討中
architecture) with no external storage
ü NEC Express5800 Scalable Enterprise Server-based “all in a box” solution combined
with high-performance HGST FlashMAX II PCIe server-mounted flash storage
l Best practice configuration fully utilizing NEC’s knowledge of SQL Server
ü Support the latest Microsoft SQL Server “SQL Server 2014”
ü Balanced CPU-IO performance depending on number of Flash MAX II
(i.e., depending on data size )
ü No data lost by utilizing mirrored storage pool feature of Windows OS
Single Node model
AlwaysOn AG two node cluster model
Express5800 Scalable
Enterprise Server
• Max. 4 Intel® Xeon® Processor
E7 v2 family CPU
• Max. 4TB memory
• Max. 16 slots of PCI-Express 3.0
(Class No.1 rich IO slots)
Page 4
© NEC Corporation 2014
10G Ether
High band width
Low Latency
2. Reference configuration of
NEC PCIe SSD Appliance for
Microsoft® SQL Server®
Page 5
© NEC Corporation 2014
NEC PCIe SSD Appliance for Microsoft SQL Server
- Reference configuration per node Ø Optimized performance combining fast clock CPU (E7-8893 v2:6C, 3.4GHz) and
high-performance HGST FlashMAX II PCIe server-mounted flash storage (FM2)
Ø No data loss by utilizing software-mirrored storage pool feature of Windows OS
Ø Balanced CPU-IO performance depending on number of FM2 (i.e., data size )
• 1CPU (3.4GHz, 6C)
• 128GB RAM
• 2x FM2 in mirrored
storage pool
• 4CPU (3.4GHz, 6C)
• 512GB RAM
• 8x FM2 in mirrored
storage pool
Express5800 Scalable Enterprise Server
FlashMAX II
SW Mirroring
FlashMAX II
128GB
RAM
128GB
RAM
E78893
v2
E78893
v2
FlashMAX II
SW Mirroring
FlashMAX II
QPI
Scale
Up
Scale
Up
• 2CPU (3.4GHz, 6C)
• 256GB RAM
• 4x FM2 in mirrored
storage pool
• 3CPU (3.4GHz, 6C)
• 384GB RAM
• 6x FM2 in mirrored
storage pool
Scale Up
Page 6
© NEC Corporation 2014
FlashMAX II
SW Mirroring
FlashMAX II
Internal Drive
E78893
v2
E78893
v2
128GB
RAM
128GB
RAM
FlashMAX II
SW Mirroring
FlashMAX II
Reference configuration as single node Data Warehouse system
CPU and Memory
Express5800 Scalable Enterprise Server
Built-in DVD-ROM Drive
Additional CPU (E7-8893 v2, 6 core)
Memory Riser
16GB additional memory (8GB DIMM x2)
10KRPM 600GB HDD
RAID SAs-Controller (1GB, RAID 0/1/5/6)
1000BASE-T Network Adopter (4 port)
1.1TB FlashMAX II
# of card
Maximum Physical Storage
Capacity per node *
•Max capacity mode
•Mirrored Storage Pool
1CPU
128GB
RAM
1
1
1
2
8
3
1
1
2CPU
256GB
RAM
1
1
2
4
16
5
1
2
3CPU
384GB
RAM
1
1
3
6
24
5
1
3
4CPU
512GB
RAM
1
1
4
8
32
7
1
4
2
4
6
8
1.1TB
2.2TB
3.3TB
4.4TB
* Data capacity after compression. About 1/5 of logical user data capacity before compression
Page 7
© NEC Corporation 2014
Reference configuration as single node OLTP system
CPU and Memory
Express5800 Scalable Enterprise Server
Built-in DVD-ROM drive
Additional CPU (E7-8893 v2, 6 core)
Memory Riser
16GB additional memory (8GB DIMMx2)
10krpm 600GB SAS-HDD
RAID SAS-Controller (1GB, RAID 0/1/5/6)
1000BASE-T Network Adopter (4 port)
# of card
Maximum Physical Storage
1.1TB
Capacity per node
Flash MAXII
• Max Performance mode
•Mirrored Storage Pool
Page 8
© NEC Corporation 2014
1CPU
128GB
RAM
1
2CPU
256GB
RAM
1
3CPU
384GB
RAM
1
4CPU
512GB
RAM
1
1
1
2
8
3
1
1
2
1
2
4
16
5
1
2
4
1
3
6
24
5
1
3
6
1
4
8
32
7
1
4
8
0.9TB
1.7TB
2.6TB
3.4TB
Reference configuration as two node OLTP system
(configuration per node)
CPU and Memory
Express5800 Scalable Enterprise Server
Built-in DVD-ROM drive
Additional CPU (E7-8893 v2, 6 core)
Memory Riser
16GB additional memory (8GB DIMMx2)
10krpm 600GB SAS-HDD
RAID SAS-Controller (1GB, RAID 0/1/5/6)
1000BASE-T Network Adopter (4 port)
# of card
Maximum Physical Storage
1.1TB
Capacity per node
FlashMAXII
• Max Performance mode
•Mirrored Storage Pool
10GBASE Network Adopter (SFP+/2 port)
Page 9
© NEC Corporation 2014
2CPU
256GB
RAM
1
3CPU
384GB
RAM
1
4CPU
512GB
RAM
1
1
2
4
16
5
1
2
4
1
3
6
24
5
1
3
6
1
4
8
32
7
1
4
8
1.7TB
2.6TB
3.4TB
1
1
1
3. Performance of
NEC PCIe SSD Appliance for
Microsoft® SQL Server®
as Dara Warehouse system
Page 10
© NEC Corporation 2014
Single Node DWH Performance (1/2)
DWH performance comparison with Express5800/A1080a based FTRA DWH
using Microsoft Fast-track Data Warehouse Benchmark tool.
(FTRA DWH =Microsoft SQL Server Fast Track Data Warehouse)
Configuration for
DWH performance comparison
CPU
NEC PCIe SSD Appliance
for Microsoft® SQL Server®
Express5800/A1080a-S
FTRA DWH
E7-8893 v2 (6C, 3.4GHz)
E7-8870 (10C, 2.4GHz)
CPU socket count
1
2
3
4
2
Total core count
6
12
18
24
20
Physical memory capacity
128GB
256GB
384GB
512GB
128GB
Storage configuration
1.1TB FlashMAX II (Max Capacity Mode)
Mirrored storage pool configuration
1set
Physical Storage Capacity
1.1TB
User data*
*
Physical Data capacity after compression ~0.5TB
About 1/5 of logical user data
Capacity
capacity before compression.
per
Transaction log
~0.2TB
Usage
tempdb
~0.3TB
Page 11
© NEC Corporation 2014
2 sets
3sets
4 sets
iStorage M100 x3
(72 450GB 10krpm SAS-HDD)
RAID-10 18 pair
2.2TB
3.3TB
4.4TB
16.2TB
~1.0TB
~1.5TB
~2.0TB
8.0TB
~0.4TB
~0.6TB
~0.8TB
3.5TB
0.6TB
~0.9TB
~1.2TB
4.5TB
Single Node DWH Performance (2/2)
Performance comparison using Microsoft Fast-track Data Warehouse benchmark tool
8,234
3.50
8,000
7,000
3.00
Benchmark Logical Scan rate [MB/s]
Read time to scan all user data [s]
2.50
6,000
4,820
5,000
4,000
4,759
3,422
1.50
3,000
2,000
1,000
0
1.00
1,425
0.50
0.36
0.30
0.32
0.25
1.72
1P6C
2P12C
3P18C
4P24C
2P20C
NEC PCIe SSD Appliance for Microsoft SQL Server
(E7-8893v2 3.4GHz)
Page 12
2.00
© NEC Corporation 2014
0.00
Fast-track Data ware house
NEC Express5800 A1080a-S
(E7-8870 2.4GHz)
Benefit as Data Warehouse system
Performance boost with significant TCO saving
Express5800 A1080a-S base
Microsoft Fast-track Data Warehouse
NEC PCIe SSD Appliance for
Microsoft® SQL Server ®
2P20C (2.4GHz) + 128GB RAM
4P24C (3.4GHz) + 512GB RAM
To Balance CPU power
and IO bandwidth,
12 FC channels and
72 HDDs are required
8G-FC channels x12
(2p 8G-FC HBA x6)
Total 72 SAS-HDD drives
13U
69% less rack space
4U
2,125W
27% less power consumption
1,551W
4,759MB/s
173% better performance
8,234MB/s
Rack space
Power Consumption
Benchmark logical
scan rate
• Microsoft Fast-track Data ware
house benchmark tool
Page 13
© NEC Corporation 2014
All-n-a-box
Solution
4. Performance of
NEC PCIe SSD Appliance for
Microsoft® SQL Server®
as single node OLTP system
Page 14
© NEC Corporation 2014
Single Node OLTP Performance (1/2)
OLTP performance comparison with “Express5800/A1080a-S + external
FC storage system” using “Hammer DB” benchmark tool.
Configuration for
OLTP performance
Comparison
NEC PCIe SSD Appliance
for Microsoft® SQL Server®
Express5800/A1080a-S
+ External FC storage
E7-8893 v2 (6C, 3.4GHz)
E7-8870 (10C, 2.4GHz)
CPU
CPU socket count
1
2
3
4
2
Total core count
6
12
18
24
20
128GB
256GB
384GB
512GB
256GB
Physical memory capacity
1.1TB FlashMAX II (Max Performance Mode)
Mirrored storage pool configuration
1set
2 sets
3sets
4 sets
iStorage M100 x1
(24 450GB 10krpm SAS-HDD)
RAID-10 6 pair
Physical Storage Capacity
0.9TB
1.7TB
2.6TB
3.4TB
5.4TB
Physical User data
Capacity
Transaction log
per
tempdb
Usage
~0.5TB
~1.0TB
~1.5TB
~2.0TB
3.6TB
~0.2TB
~0.4TB
~0.6TB
~0.8TB
1.0TB
~0.2TB
~0.3TB
~0.5TB
~0.6TB
0.8TB
Storage configuration
Page 15
© NEC Corporation 2014
Single Node OLTP Performance (2/2)
Performance comparison using HammerDB benchmark tool
Relative performance
200
Performance down rate
Relative performance of Non-Optimized I/O scenario
150
Performance down rate between 2 scenarios (%)
50%
47%
100
131
46%
95
50
0
100%
Relative performance of Optimized I/O scenario
96%
178
75%
49%
91
50%
100
71
54
25%
50
27
1P6C
4
2P12C
3P18C
NEC PCIe SSD Appliance
for Microsoft SQL Server
(E7-8893v2 3.4GHz)
4P24C
0%
2P20C
NEC Express5800 A1080a-S
(E7-8870 2.4GHz)
+ iStorage M100 x1
ü NEC Express5800 A1080a-S performance of optimized I/O scenario is the base performance of the relative performance comparison
ü Optimized IO scenario
: Minimize IO by enabling Server side memory caches and Storage side cache.
ü Non-optimized IO scenario: Maximize IO by disabling Server side memory caches and Storage side cache.
Page 16
© NEC Corporation 2014
Benefit as single node OLTP system
Achieve high performance even in heavy IO stress, requires
less DB tuning cost (i.e., less system integration cost)
Express5800 A1080a-S
+iStorage M100
2P20C (2.4GHz) + 256GB RAM
NEC PCIe SSD Appliance for
Microsoft® SQL Server ®
4P24C (3.4GHz) + 512GB RAM
8G-FC channels x 4
(2p 8G-FC HBA x 2)
All-in-a-box
solution
Total 24 SAS-HDD drives
Relative OLTP performance (TPS)
comparison for optimized I/O
scenario *1, *2
100
OLTP performance down ratio
between the optimized IO scenario
And Non-optimized IO scenarios *1,3
96%
178% better performance
53% less impact of IO
178
49%
*1 Relative performance comparison using HammerDB benchmark tool
*2 Optimized IO scenario
: Minimize IO by enabling Server side memory caches and Storage side cache.
*3 Non-optimized IO scenario: Maximize IO by disabling Server side memory caches and Storage side cache.
Page 17
© NEC Corporation 2014
5. Performance of
NEC PCIe SSD Appliance for
Microsoft® SQL Server®
as two node cluster OLTP system
Page 18
© NEC Corporation 2014
Two Node OLTP Performance (1/2)
OLTP performance comparison with “Express5800/A1080a-S + external
FC storage system” using “Hammer DB” benchmark tool.
Configuration (per node)
for OLTP performance
Comparison
NEC PCIe SSD Appliance
for Microsoft® SQL Server®
Express5800/A1080a-S
+ External FC storage
CPU
E7-8893 v2 (6C, 3.4GHz)
E7-8870 (10C, 2.4GHz)
CPU socket count
2
3
4
2
Total core count
12
18
24
20
256GB
384GB
512GB
256GB
Physical memory capacity
1.1TB FlashMAX II (Max
Performance Mode)
Storage configuration
Mirrored storage pool
configuration
iStorage M100 x1
(24 450GB 10krpm SAS-HDD)
RAID-10 6 pair
2 sets
3sets
4 sets
Physical Storage Capacity
1.7TB
2.6TB
3.4TB
5.4TB
Physical User data
Capacity
Transaction log
per
tempdb
Usage
~1.0TB
~1.5TB
~2.0TB
3.6TB
~0.4TB
~0.6TB
~0.8TB
1.0TB
~0.3TB
~0.5TB
~0.6TB
0.8TB
Page 19
© NEC Corporation 2014
Two Node OLTP Performance (2/2)
Performance comparison using HammerDB benchmark tool
Relative performance
200
Performance down rate
100%
Relative performance of Optimized I/O scenario
98%
Relative performance of Non-Optimized I/O scenario
150
Performance down rate between 2 scenarios (%)
124
100
50
29%
29%
165
113
88
80
75%
32%
50%
100
25%
57
2
0
2P12C
3P18C
NEC PCIe SSD Appliance
for Microsoft SQL Server
(E7-8893v2 3.4GHz)
4P24C
0%
2P20C
NEC Express5800 A1080a-S
(E7-8870 2.4GHz)
+ iStorage M100 x1
ü NEC Express5800 A1080a-S performance of optimized I/O scenario is the base performance of the relative performance comparison
ü Optimized IO scenario
: Minimize IO by enabling Server side memory caches and Storage side cache.
ü Non-optimized IO scenario: Maximize IO by disabling Server side memory caches and Storage side cache.
Page 20
© NEC Corporation 2014
Benefit as two node OLTP system
Achieve high performance even in heavy IO stress, requires
less DB tuning cost (i.e., less system integration cost)
Express5800 A1080a-S
+iStorage M100
2P20C (2.4GHz) + 256GB RAM
NEC PCIe SSD Appliance for
Microsoft® SQL Server ®
4P24C (3.4GHz) + 512GB RAM
8G-FC channels x 4
(2p 8G-FC HBA x 2)
All-in-a-box
solution
Total 24 SAS-HDD drives
Relative OLTP performance (TPS)
comparison for optimized I/O
scenario *1, *2
100
OLTP performance down ratio
between the optimized IO scenario
And Non-optimized IO scenarios *1,3
98%
165% better performance
34% less impact of IO
165
32%
*1 Relative performance comparison using HammerDB benchmark tool
*2 Optimized IO scenario
: Minimize IO by enabling Server side memory caches and Storage side cache.
*3 Non-optimized IO scenario: Maximize IO by disabling Server side memory caches and Storage side cache.
Page 21
© NEC Corporation 2014
6. Recommendation to improve
availability of the system
Page 22
© NEC Corporation 2014
EXPRESSCLUSTER X SingleServerSafe (SSS)
To reinforce availability of the system, it is recommended
to install EXPRESSSLUSTER X SSS
▌What is EXPRESSCLUSTER X SSS ?
lEXPRESSCLUSTER X SSS is a program product
which monitors abnormality of HW & SW, recovers
them from failure, and improves availability of the
system.
lEXPRESSCLUSTER X SSS contributes to reinforce
fault resiliency of 2 node AlwaysON AG system also.
üAdditional to monitoring targets of the AlwaysOn, improve
fault resiliency of the 2 node system by monitoring
• Disk failure:
Restarts the node, then fail over to the secondary node to
continue operation.
• LAN failure of the secondary “read only” node:
Change routing of “read only access” from the faulty node to
the other heathy node (in 2 node system, primary node) to
continue system operation.
Page 23
© NEC Corporation 2014
Router, other network devices
Ping check monitoring
Application layer
App
Presence monitoring
Response monitoring
Monitoring agent
X SingleServerSafe
OS layer
Stall monitoring
OS space
Access Check
Hardware layer
System
disk
PCIeSSD
NIC
Link-down monitoring
NIC
EXPRESSCLUSTER X SSS’s target of monitoring
▌Accurate monitoring to check the actual status
Router, other network devices
Ping check monitoring
Application layer
• Restarts when process abnormality is
detected
App
Response monitoring
Presence monitoring
Monitoring agent
X SingleServerSafe
OS layer
Hardware layer
• Restarts when abnormality in SQL, HTTP
and so on is detected
▌OS layer
l User space monitoring
• Restarts when abnormality in AP layer
such as hang-up is detected.
• Restarts when disk/PCIeSSD access
abnormality is detected.
Access check
System
disk
PCIeSSD
Page 24
l Monitoring agent
▌Hardware layer
l Access check
Stall monitoring
OS space
▌Application layer:
l Presence monitoring of process
© NEC Corporation 2014
NIC
l Ping monitoring
NIC
l NIC Link UP/DOWN monitoring
Link-down monitoring
• Restarts when ping abnormality is detected.
• Restarts when link-down is detected.
• Availability can be improved by NIC
duplication.
7. Summary
Benefit of
NEC PCIe SSD Appliance for
Microsoft® SQL Server®
Page 25
© NEC Corporation 2014
Benefit of NEC PCIe SSD Appliance for Microsoft® SQL Server ®
Express5800
Scalable Enterprise Server
High-performance
PCIe server-mounted flash storage
X
Balanced and Predictable
Performance
• Balanced and Proven CPU-IO
Performance depending
on number of Flash MAX II.
• Consistent high-performance
even under heavy IO load.
Easy to introduce
TCO reduction
• Customer can choose best
practice configuration
depending on data size.
• Cook book (Configuration
guide, including best
parameter setting) will be
available *
• Less platform integration cost
• Less Rack apace
• Less Power Consumption
Initial Cost
Operating Cost
Time to introduction
* Pre-configuration
and Pre-installation
Are subject of discussion
Page 26
© NEC Corporation 2014
Down
Image of Big Data Analytics Solution
built on NEC PCIe SSD Appliance for Microsoft® SQL Server®
“Microsoft SQL Server 2014” equipped with ETL, DB, BI, Data Mining, easy-to-use
“Excel & Power BI” and “NEC PCIe SSD Appliance for Microsoft® SQL Server®”
provide Low-Cost All-in-One Big Data Analysis Platform
INPUT
Big Data Analytics Platform
Data Extract
Process &
Aggregation
Store data into High-Performance
PCIe server-mounted flash storage
Enterprise
System
TARO SUZUKI
08/07
Customer Data
NEC PCIe SSD Appliance
for Microsoft® SQL Server®
Web site Blog
Data Scientist
SMS
SQL Server 2014
Analysis Services
Smart Device
Page 27
© NEC Corporation 2014
End user
NEC brings together and integrates technology and expertise to create
the ICT-enabled society of tomorrow.
We collaborate closely with partners and customers around the world,
orchestrating each project to ensure all its parts are fine-tuned to local needs.
Every day, our innovative solutions for society contribute to
greater safety, security, efficiency and equality, and enable people to live brighter lives.
Page 28
© NEC Corporation 2014
Page 29
© NEC Corporation 2014