Tải bản đầy đủ (.pdf) (10 trang)

Hands-On Microsoft SQL Server 2008 Integration Services part 71 ppt

Bạn đang xem bản rút gọn của tài liệu. Xem và tải ngay bản đầy đủ của tài liệu tại đây (600.74 KB, 10 trang )

678 Hands-On Microsoft SQL Server 2008 Integration Services
architecture, SSIS
data flow, 20, 644–645
engines for managing workflows in SSIS, 4
object model, 20
overview, 18–19
run time, 20
service component, 19
archives
Archiving Downloaded Files package, 175–179
exercise archiving downloaded files, 156–160
exercise deleting data month by month from archives. See data
deletion exercise
historical data, 657
AssignExpression, For Loops and, 124
asynchronous transformations
blocking asynchronous full row-set-based transformations,
649–650
exercise writing asynchronous output to nonstandard text file,
528–529
partially blocking asynchronous row-set-based
transformations, 648–649
performance enhancement with, 645–647
scripting, 520–525
attributes
of data in data warehouse, 441–443
in ER modeling, 541
setting on files and folders, 155
auditing transformations
Audit, 436–437
data flow and, 5, 355


overview of, 436
Row Count, 437–438
auto-generated scripts, 492–494
AVERAGE
as Aggregate transformation, 429–430
exercise aggregating sales orders, 433
B
Back Up Database task
control flow tasks, 212–213
as maintenance task, 140
backup compression, in data warehousing, 557–558
backward compatibility
Client Tools Backward Compatibility, 602
control flow tasks, 141
batch files
calling using Execute Process task, 152–154
creating, 149
BIDS (Business Intelligence Development Studio)
Adding Connection Managers to packages, 75
adding imported package to, 54–55
based on VSTA, 12
comparing with SQL Server Management Studio, 71
creating blank project for use with, 29–31
as development tool, 4–5
digitally signing packages, 271
exploring imported packages, 53
overview, 28–29
Package Configurations Organizer utility in, 569
running packages during development phase, 244–245
SSIS Designer, 111

starting SQL Server Import and Export Wizard, 43
validation alerts in, 632
windows and tab options in, 31–36
binary data option, Flat File source, 347
BLOB Bytes Read, performance counters, 659
BLOB Bytes Written, performance counters, 659
BLOB Files In Use, performance counters, 659
blocking asynchronous full row-set-based transformations, 649–650
bottom-up design, data warehouse, 539
breakpoints
debugging Script task and, 501–503
exercise setting, 635–638
overview of, 634
Buffer Memory, performance counters, 659
buffers, in memory management, 644
Buffers In Use, performance counters, 659
Buffers Spooled, performance counters, 659–660
BufferSizeTuning, logging event, 661
BufferWrapper class, in Script component, 510
Bulk Insert task
configuring to import text files, 183
as control flow tasks, 169
mapping Dts services to SSIS, 615
overview of, 5
as SQL Server task, 138
Business Intelligence Development Studio. See BIDS (Business
Intelligence Development Studio)
business intelligence transformations
data flow and, 352–353
Data Mining Query, 455–456

exercise configuring SCD transformation, 445–452
exercise executing package for removing duplicates,
478–480
exercise filtering with SCD key, 444–445
exercise loading SCD transformation, 452–455
exercise removing duplicates by Fuzzy Grouping, 474–478
exercise removing exact duplicates, 470–472
exercise removing fuzzy duplicates, 472–474
exercise using SCD transformation, 444
Fuzzy Grouping, 460–463
Fuzzy Lookup, 463–467
overview of, 352–353, 439
SCD (Slowly Changing Dimension), 439–443
Term Extraction, 457–460
Term Lookup, 456–457
Index 679
C
Cache Connection Manager, 76
Cache Mode section, of lookup transformation, 397–399
cache transformation, as split and join transformation, 394–395
Campaign database, attaching to SQL Server 2008, 674
candidate key profile, for multiple-column profiles, 65
case sensitivity, Term Lookup and, 457
CDC (Change Data Capture)
data warehouse enhancements in SQL Server 2008 R2,
562–564
MERGE statement used with, 561
what’s new in SQL Server 2008, 13
change types, available in SCD Wizard, 441–443
changing attribute

attribute change types in SCD Wizard, 441–442, 546
configuring SCD transformation, 450
Character Map transformation
exercise using to convert pipeline data, 407–408
as row transformation, 379–380
standardization of data with, 6
Check Database Integrity task
as control flow task, 213–214
as maintenance task, 140
CheckFile option, DTExec utility, 253
CheckpointFileName property, 312
CheckPointing option, DTExec utility, 252
checkpoints
exercise in seeing effect of transactions on checkpoints,
316–317
restarting packages with, 311–313
CheckpointUsage property, 312
CIF (Corporate Information Factory), 538
CIM (Common Information Model), 196
Client Tools Backward Compatibility, 602
Code window, BIDS, 35
column length distribution profile, single-column profiles, 64, 68
column null ratio profile, single-column profiles, 64
column pattern profile, single-column profiles, 64
column statistics profile, single-column profiles, 64
column value distribution profile, single-column profiles, 64
columns
ADO.NET source, 343
copying input columns to output columns, 378–379
creating new columns on row by row basis (row

transformation), 353
derivations on input column data, 382–385
Excel source, 346
Export Column transformation, 385–386
Import Column transformation, 386
mappings, 58–60
raw file source, 350
SQL Server Import and Export Wizard, 45–46
Columns tab, Fuzzy Grouping transformation, 462
Columns tab, Fuzzy Lookup transformation, 465
comma-separated value (CSV)
exercise writing synchronous output to, 526–528
text file log provider writing information into, 293
command prompt
DTExec utility for running packages, 250–256
installing SSIS from, 25–26
command sourcing phase, DTExec utility, 251
Common Information Model (CIM), 196
Component Wrapper class, in Script component, 510–511
compression, backup compression in SQL Server 2008 R2, 557–558
Conditional Split transformation, as row transformation, 389–390
ConfigFile option, DTExec utility, 253
configuration
Archiving Downloaded Files for sending filenames, 175–179
Bulk Insert task for importing text files, 183
data flow path, 372–375
Data Flow task, 421–424
DTExec utility phase, 252–255
Execute SQL task, 90–92, 115–118
exercise applying checkpoint configurations to packages,

313–316
FTP Connection Manager, 145
FTP task, 143, 144–147
logging, 296–299
package. See package configurations
SCD transformation, 445–452
Transfer Database task, 202–203
XML task areas, 162–165
configuring Script component
as data source, 507–514
as destination, 525–526
as transformation, 515
conformed dimensions, in dimensional modeling, 544
Connection Manager tab, Fuzzy Grouping transformation, 461–462
Connection Managers
adding for contacts email campaign, 114–115
ADO Connection Manager, 75
ADO.NET Connection Manager, 75–76, 213–214, 343
Analysis Services Connection Manager, 78–79
Cache Connection Manager, 76
Connections Project Wizard and, 63
displaying all Connection Managers defined in package, 58
Excel Connection Manager, 76–77, 345
exercise adding, 89–90, 431
File Connection Manager, 77–78
Flat File Connection Manager, 78
FTP Connection Manager, 78, 143, 145
HTTP Connection Manager, 78, 160
Microsoft Connector 1.0 for SAP BI, 81–82
Microsoft Connector for Oracle by Attunity, 82–83

Microsoft Connector for Teradata by Attunity, 83–84
MSMQ Connection Manager, 78
680 Hands-On Microsoft SQL Server 2008 Integration Services
Connection Managers (cont.)
Multiple Files Connection Manager, 79
Multiple Flat Files Connection Manager, 79–80
ODBC Connection Manager, 80
OLE DB Connection Manager, 80, 114–115, 369, 461
overview of, 74–75
raw file source, 350
SMO Connection Manager, 80–81
SMTP Connection Manager, 81, 195
SQL Server Compact Edition Connection Manager, 81
types of objects in SSIS packages, 71
WMI Connection Manager, 81, 197–199
Connection option, DTExec utility, 253
connection strings, types of sensitive data, 271
Connections Managers tab, BIDS windows
displaying all Connection Managers defined in package, 58
overview of, 35
connections, mapping Dts services to SSIS, 616
Connections Project Wizard, 63
ConnectionString property, evaluating at runtime, 106
consistency of data. See data integrity
ConsoleLog option, DTExec utility, 253–254
constrained executable, 99–100
constraints. See precedence constraints
contacts email campaign
adding Connection Managers for, 114–115
adding Foreach Loop Container, 119–120

adding Send Mail Task and executing package, 119–122
configuring Execute SQL Task, 115–118
method, 113
review, 123
control flow
data flow and, 334–335
managing workflow with, 55–56
running parallel task in, 669–670
types of objects in SSIS packages, 70
control flow containers
Event Handler, 110
exercise avoiding inconsistency in single container, 306
exercise avoiding inconsistency over multiple container,
306–310
exercise creating e-mail campaign. See contacts email
campaign
exercise deleting data month by month from archives. See data
deletion exercise
Foreach Loop, 111–113, 119–120
For Loop, 123–126
packages, 110–111
Sequence, 131–132
summary, 133
Task Host, 132
types of, 110
control flow engine
comparing with data flow engine, 482–483
engines for managing workflows in SSIS, 3–4
separation from data flow engine, 334–335
in SSIS architecture, 20

working programmatically and, 483
Control Flow tab, BIDS SSIS Designer, 33, 111
control flow tasks
analysis services, 139
Back Up Database task, 212–213
backward compatibility and, 141
Bulk Insert task, 169
Check Database Integrity task, 213–214
custom, 141–142
data flow, 137
data preparation, 137
Execute Package task, 187–188
Execute Process task, 148
Execute SQL Server Agent Job task, 214–215
Execute SQL task, 165–168
Execute T-SQL Statement task, 215–216
exercise archiving downloaded files, 156–160
exercise consolidating workflow packages, 188–195
exercise downloading zipped files from FTP server, 144–148
exercise expanding zipped files, 149–154
exercise importing expanded files. See importing
expanded zip files
exercise reading application log, 197–200
File System task, 155–156
FTP task, 142–143
History Cleanup task, 216–217
maintaining, 140–141
Maintenance Cleanup task, 217–218
managing from Control Flow pane, 482
Message Queue task, 169–173

Notify Operator task, 218–219
overview, 136
Rebuild Index task, 220–221
Reorganize Index task, 221–222
scripting, 139
Send Mail task, 195
Shrink Database task, 222–223
SQL Server, 138
summary, 223–224
tasks as smallest unit for restarting packages, 312
transfer, 139–140
Transfer Database task, 202–203
Transfer Error Messages task, 203–205
Transfer Jobs task, 205–206
Transfer Logins task, 206–208
Transfer Master Stored Procedures task, 208–209
Transfer SQL Server Objects task, 210–211
Update Statistics task, 223–224
Web Service task, 160–161
WMI Data Reader task, 196–197
WMI Event Watcher task, 200–202
workflow, 138
XML task, 161–165
Index 681
Copy Column, as row transformation, 378–379
COPY command, dtutil utility, 238–239
Copy SQL Server Objects task, mapping Dts services to SSIS, 615
Corporate Information Factory (CIF), 538
COUNT, as Aggregate transformation, 429–430
COUNT DISTINCT

as Aggregate transformation, 429–430
exercise aggregating sales orders, 433
CPUs
32-bit vs. 64-bit systems, 17, 641–643
parallel processing. See parallel processing
CreateDeploymentUtility property, deployment utility, 588
CRM (customer support systems)
contact management and, 113
data management and, 2
cryptography. See encryption
CSV (comma-separated value)
exercise writing synchronous output to, 526–528
text file log provider writing information into, 293
CUBE operators, GROUP BY supporting, 561
custom deployment, 592–594
customer support systems (CRM)
contact management and, 113
data management and, 2
customization. See programming SSIS
D
data
analyzing data quality with Data Profiling Task, 63–64
consolidation features, 7
converting raw data into meaningful information, 7
Data Flow pane managing movement of, 482
preparing for control flow tasks, 137
segmentation using Percentage Sampling transformation,
417–418
standardization features in SSIS, 6
timestamps, 436

data cleansing
business intelligence operations, 352
Fuzzy Grouping transformation and, 460
Data Conversion transformation
converting data formats or locales, 7
exercise converting Excel data and combining with flat files,
403–405
as row transformation, 381–382
standardization of data, 6
data deletion exercise
adding Execute SQL task for deleting monthly records,
127–130
adding For Loop Container for, 125–126
method, 124–125
review, 130
Data Driven Query task, mapping Dts services to SSIS, 615
data flow
ADO.NET as destination, 357
ADO.NET as source, 343–345
architecture of, 644–645
auditing transformations and, 355
bringing data into, 337–341
business intelligence transformations and, 352–353
component interfaces, 335–336
control flow and, 137, 334–335
data mining model training destination, 357–358
DataReader destination, 358
destinations. See destinations, data flow
dimension processing destination, 358–360
error output interface, 337

Excel as destination, 360
Excel as source, 345–346
exercise adding OLE DB source and Flat File destination, 370–372
exercise configuring data flow path and executing package,
372–375
exercise configuring OLE DB Connection Manager and adding
data flow path, 369
exercise executing package, 412–413
external metadata interface, 336
flat files as destination, 360–361
flat files as source, 346–348
input interface, 336–337
multiple data flows with parallel processing, 670
OLE DB as destination, 361–363
OLE DB as source, 348–349
output interface, 337
overview of, 334
partition processing destination, 363
paths. See paths, data flow
raw files as destination, 364–365
raw files as source, 349–350
recordset destination, 365
row transformations, 353
rowset transformations, 354
Script components as destination, 365
Script components as source, 350
sources, 342–343
split and join transformations, 354–355
SQLServer Compact destination, 365–366
SQLServer destination, 366–367

summary, 375
transformations. See transformations, data flow
types of objects in SSIS packages, 70
XML files as source, 351
data flow engine
comparing with control flow engine, 482–483
engines for managing workflows in SSIS, 3–4
managing pipeline activities with, 57–58
overview of, 334
separation from control flow engine, 334–335
in SSIS architecture, 20
working programmatically and, 483
Data Flow tab, BIDS SSIS Designer, 33, 111
682 Hands-On Microsoft SQL Server 2008 Integration Services
Data Flow task
checkpoints and, 312–313
creating multiple data flows with parallel processing, 670
exercise adding, 431, 444
exercise configuring, 421–424
exercise enabling logging on, 663–665
extracting data from disparate data sources, 5
overview of, 335
performance and, 644–645
source and destination adapters, 335
data integrity
exercise avoiding inconsistency in single container, 306
exercise avoiding inconsistency over multiple container, 306–310
exercise creating simulation package for data consistency issues,
302–306
maintaining with transactions, 301–302

data marts. See also data warehouses
bottom-up design, 539
centralized EDW with dependent data marts, 539–540
dimension types and, 544
distributed independent data marts, 540–541
hub-and-spoke architecture and, 556–557
integrating with parallel data warehouses, 552
star schema and, 547
top-down design, 538
data mining model training, 357–358
Data Mining Prediction task, mapping Dts services to SSIS, 615
Data Mining Query
as Analysis Services task, 139
as business intelligence transformations, 455–456
data model schemas
building star schema, 549–550
overview of, 547
snowflake model, 548–549
star schema, 547–548
data models
dimension types, 544–545
dimensional modeling, 542–544
ER modeling, 541–542
loading dimensions using SCD, 545–546
overview of, 541
data parsing, converting source data to SSIS data types, 338–339
Data Profile Viewer
overview of, 12
review statistics generated by Data Profiling Task, 66–67
reviewing data in XML format, 42

Data Profiling task
analyzing data quality with, 63–64
as data preparation task, 137
exercise profiling imported data, 66–68
function of, 42
multiple-column profiles, 65
overview of, 5
single-column profiles, 64
what’s new, 12
Data Protection API (DPAPI), 272
data source views, 85–86
data sources
ADO.NET, 343–345
configuring Script component as, 507–514
as data flow component, 335–336
design-time connections to, 84–85
enabling SSIS as, 328–329
Excel, 345–346
exercise addings, 370–372, 401–403, 444
flat files, 346–348
Import and Export Wizard data source options, 43–44
OLE DB, 348–349
overview of, 342–343
raw files, 349–350
Script components, 350
SSIS as data source for Reporting Services, 328
SSIS packages as, 8, 330–331
XML files, 351
data storage systems, 2
data streams, exercise combining two, 403–405

data types
converting source data to SSIS data types, 338–339, 381–382
exercise converting Excel data and combining with flat files,
403–405
new date and time data types, 14
user-defined variables and, 95
data viewers
as debugging tool, 640
exercises adding, 412, 424–426
data warehouses
architecture, 539
backup compression in SQL Server 2008 R2, 557–558
bottom-up design, 539
building star schema, 549–550
centralized EDW with dependent data marts, 539–540
change data capture, 562–564
data model schemas, 547
data models for, 541
dimension types, 544–545
dimensional modeling, 542–544
distributed independent data marts, 540–541
ER modeling, 541–542
Fast Track Data Warehouse solution, 553–554
GROUP BY extensions, 561
loading data into, 5–6
loading dimension using SCD, 545–546
managing data attributes. See SCD (Slowly Changing
Dimension) transformation
MERGE statement, 559–561
need for, 536–538

overview of, 536
Parallel Data Warehouse solution, 554–557
partitioned table parallelism, 564–565
snowflake model, 548–549
Index 683
SQL Server 2008 R2 editions, 551–552
SQL Server 2008 R2 enhancements, 557
SQL Server 2008 R2 solutions, 552
star join query processing, 562
star schema, 547–548
summary, 566
top-down design, 538
database-level roles
applying access control with user-defined roles, 283
exercise using roles for granular access control, 287
exercise using to assign execute permissions to packages,
285–287
permissions and, 282–283
server storage and, 50
databases
attaching Campaign database (for exercises in this book) to
SQL Server 2008, 674
backing up. See Back Up Database task
checking integrity of. See Check Database Integrity task
cleaning up backup or maintenance report files. See
Maintenance Cleanup task
normalization of data, 419–420
Pivot transformation, 421–424
running T-SQL statements against. See Execute T-SQL
Statement task

shrinking. See Shrink Database task
transferring from source SQL Server to destination SQL Server.
See Transfer Database task
Unpivot transformation, 427–428
Datacenter Edition, SQL Server 2008 editions, 17
DataReader destination
data flow and, 358
using SSIS package as data source, 8
date data types, 14
date/time functions
in Expression Builder, 104
performing derivations on input column data, 383
DBCC CHECKDB, T-SQL statement, 213–214
DBCC SHRINKDATABASE, T-SQL statement, 222
db_ssisadmin role, 282–283
db_ssisltduser role, 282
db_ssisoperator role
exercise assigning execute permissions to packages, 285–287
overview of, 282
DCOM (Distributed COM)
Message Queuing DCOM Proxy, 172
permissions for accessing packages on remote servers, 229
debugging. See also troubleshooting packages
configurable debugging features, 634
default debugging features, 632–633
enhancements to, 14
package configurations, 575–576
Script component, 530–533
Script task, 501–503
decision support system (DSS), 537

default debugging features, 632–633
default settings, package management, 227
degenerate dimensions, in dimensional modeling, 545
DEL, deleting packages with dtutil utility, 240
delimited file formats. See also CSV (comma-separated value), 73
deploying packages
custom deployment, 592–594
direct and indirect configurations, 577–578
exercise assigning values to and debugging package
configurations, 575–576
exercise building projects, 591
exercise configuring project properties, 590
exercise creating computer-specific package configuration,
579–582
exercise creating indirect package configuration, 583–585
exercise deploying and executing the package, 586
exercise enabling and adding package configurations,
572–575
exercise installing and running project package, 591–592
exercise setting up environment variable as pointer to
configuration file, 582
exercise using property expressions to update exported
properties, 582–583
overview of, 10, 568
package configurations and, 568–570
Parent Package Variable configuration type, 571
project deployment, 589
Registry Entry configuration type, 571
setting up deployment utility, 587–588
SQLServer configuration type, 571–572

steps in creating indirect configurations, 578–579
summary, 592–594
XML Configuration File, 570–571
deployment utility
overview of, 587–588
properties for controlling project deployment, 588
DeploymentOutputPath property, deployment utility, 588
Derived Column transformation
exercise using, 406–407
as row transformation, 382–385
for standardization of data, 6
design, data warehouses
bottom-up, 539
top-down, 538
designer, SSIS. See SSIS Designer
destinations, data flow
ADO.NET destination, 357
bringing data into data flows and, 340–341
configuring Script component as, 525–526
as data flow component, 335–336
data mining model training destination, 357–358
DataReader destination, 358
dimension processing destination, 358–360
Excel destination, 360
exercise adding flat files as, 370–372
684 Hands-On Microsoft SQL Server 2008 Integration Services
destinations, data flow (cont.)
exercise writing asynchronous output to nonstandard text file,
528–529
exercise writing synchronous output to CSV file, 526–528

Flat File destination, 360–361
OLE DB destination, 361–363
overview of, 355–357
partition processing destination, 363
raw file destination, 364–365
recordset destination, 365
Script component destination, 365
SQL Server Import and Export Wizard, 46–49
SQLServer Compact destination, 365–366
SQLServer destination, 366–367
Developer Edition, SQL Server 2008 editions, 17
Diff operations, comparing XML documents, 164–165
digital certificates
digitally signing packages, 270–271
package security features, 7–8
dimension hierarchies, in dimensional modeling, 543–544
dimension members, in dimensional modeling, 543
Dimension Processing destination
data flow and, 358–360
data warehousing and, 5
dimension tables, in dimensional modeling, 439
dimensional modeling
dimension tables, 439
dimension types, 544–545
dimensions, 543–544
fact tables, 439, 542–543
loading dimensions using SCD, 545–546
measures, 543
overview of, 542
dimensions

in dimensional modeling, 543–544
loading using SCD, 545–546
types of, 544–545
direct package configurations, 577–578
directories
Active Directory, 172
exercise creating, 95–99
performing operations on using File System tasks, 155
Directory Service Integration, 172
Distributed COM (DCOM)
Message Queuing DCOM Proxy, 172
permissions for accessing packages on remote servers, 229
distributed independent data marts, 540–541
Distributed Management Task Force (DMTF), 196
DML operations, 559
DMTF (Distributed Management Task Force), 196
DMX prediction queries
business intelligence operations, 352
Data Mining Query transformation, 455–456
Document Type Definition (DTD), 163
DPAPI (Data Protection API), 272
DSS (decision support system), 537
DTD (Document Type Definition), 163
DTExec utility
command sourcing phase, 251
configuration phase, 252–255
overview of, 250–251
package load phase, 251–252
validation and execution phase, 256
DTExecUI (Execute Package Utility)

exercise running package with, 246–249
overview, 245
review, 249–250
DTS 2000 package migration
embedding DTS 2000 packages in SSIS packages, 609–610
Execute DTS 2000 Package task, 610
exercise configuring Execute DTS 2000 Package task, 610–614
exercise editing and executing DTS 2000 package, 607–609
exercise enumerating and importing DTS 2000 package,
605–607
exercise executing migrated package, 621–622
exercise using Package Migration Wizard, 618–621
exercise using Upgrade Advisor for analyzing, 597–599
installing Dts support components, 601–604
mapping DTS services to SSIS, 615–617
options for, 600–601
Package Migration Wizard and, 614, 617
running DTS 2000 packages as-is with run-time
support, 604
Dts (Data Transformation Services)
ActiveX Script task for extending, 484, 486–487
comparing SSIS to, 270
connecting to SSIS and managing Dts packages, 37–39
legacy support, 10
limitations in package deployment, 568
run time, 602
SQL Server 2008 and, 14–15
upgrading from, 27
Dts Designer Components, 602, 607–608
Dts object, properties and methods of, 495–496

Dts Package Migration Wizard, 601
DtsDebugHost, 245
DTSInstall.exe, installing packages with, 591–592
DTSWizard.exe, starting SQL Server Import and Export Wizard, 43
dtutil utility
copying packages, 238–239
creating folders, 242
deleting folders, 243–244
deleting packages, 240
/Dump option, 14
encrypting packages, 241
importing and exporting packages, 233
listing folder contents, 242
moving packages, 239
overview, 235–236
renaming folders, 243
Index 685
signing packages, 240–241
verifying existence of packages, 237–238
Dump errorcode option, DTExec utility, 256
Dump option, dtutil utility, 14
DumpOnError option, DTExec utility, 256
Dynamic Properties task, mapping Dts services to SSIS, 615
E
editions, SQL Server 2008, 15–17
EDW (Enterprise Data Warehouse), 539–540
embedding DTS 2000 packages, in SSIS packages, 609–610
encryption
with dtutil utility, 241
exercise encrypting all information with package password,

280–281
exercise encrypting sensitive information with package
password, 277–279
exercises encrypting all information with user key, 279–280
exercises encrypting sensitive information with user key, 275–276
package security features, 7–8
ProtectionLevel property options, 272–273
of sensitive data, 49–50
Engine Threads property, parallel processing, 670
Enterprise Data Warehouse (EDW), 539–540
Enterprise Edition, SQL Server 2008 editions, 17
entities, in ER modeling, 541
enumerators, Foreach Loops, 112–113
environment variable
exercise setting up as pointer to package configuration file, 582
types of package configurations, 571
ER (entity-relationship) modeling, 541–542
error events, scripting, 503
Error List window
BIDS windows, 35
default debugging features, 633
error messages, transferring between SQL Servers. See Transfer Error
Messages task
error output
ADO.NET source, 344
data flow paths and, 639
Excel source, 346
interface for data flow, 337
types of errors and, 339–340
ETL (extracting, transforming, loading) tools

data consolidation with, 42
Data Flow task and, 137
data management and, 2–3
EvalExpression, For Loops and, 124
EvalOp property, constraint options, 100
event handlers
exercise working with, 325–328
handling events at package run time, 323–325
scripting events and, 503
types of objects in SSIS packages, 71
Event Handlers tab, BIDS SSIS Designer, 34, 111, 324
events
handling at package run time, 323–325
logging, 298
raising with scripts, 503–504, 530–532
Excel
data source options for SQL Server Import and Export Wizard, 43
data sources, 345–346
destination in data flow, 360
exercise converting Excel data and combining with flat files,
403–405
exercise using Pivot transformation, 421–424
Excel Connection Manager, 76–77, 345
executables, precedence executable, 99–100
Execute DTS 2000 Package task
as backward compatibility task, 141
exercise configuring Execute DTS 2000 Package task, 610–614
overview of, 610
Execute Package task
control flow tasks, 187–188

embedding DTS 2000 packages in SSIS packages, 609–610
mapping Dts services to SSIS, 615
transactions and, 301
as workflow task, 138
as wrapper for using packages within packages, 308
Execute Package Utility. See DTExecUI (Execute Package Utility)
Execute Process task
calling batch files with, 152–154
control flow tasks, 148
mapping Dts services to SSIS, 615
as workflow task, 138
Execute SQL Server Agent Job task
as control flow task, 214–215
as maintenance task, 140
Execute SQL task
configuring for contacts email campaign, 115–118
for deleting monthly records, 127–130
exercise configuring, 90–92
expressions page, 168
general section, 166–168
mapping Dts services to SSIS, 615
overview of, 165–166
parameter mapping page, 168
result set page, 168
as SQL Server task, 138
SQL statements and, 387
Execute T-SQL Statement task
compared with Execute SQL task, 166
control flow tasks, 215–216
as maintenance task, 140

execution trees, as performance monitoring tool, 662–663
ExecutionLog file, 665–668
Export Column transformation, as row transformation,
385–386
Express Edition, SQL Server 2008 editions, 15
686 Hands-On Microsoft SQL Server 2008 Integration Services
Expression Builder
building expressions with, 103
functions and operators, 104–105
expression field, constraint options, 101
expressions
building, 103–104
evaluating, 104, 106
exercise building property expressions for mailing
opportunities packages, 318–323
exercise using to update exported properties, 582–583
exercise using to update properties at run time, 105–107
functions and operators, 104–105
uses of, 102
variables and, 317–318
external metadata, interface for data flow, 336
extracting, transforming, loading (ETL) tools
data consolidation with, 42
Data Flow task and, 137
data management and, 2–3
F
fact tables, in dimensional modeling, 439, 542–543
fast parsing
converting source data to SSIS data types, 338–339
Flat File source, 347

Fast Track Data Warehouse solution, in SQL Server 2008 R2, 553–554
FC, creating folders with dtutil utility, 242
FDe, deleting folders with dtutil utility, 243
FDi, showing directory contents in dtutil utility, 242
File Connection Manager, 77–78
File System folder, as root-level folders, 230
File System task
archiving downloaded files, 158–160
configuring with hard-coded values, 105
control flow tasks, 155–156
creating folder using, 96–97
as data preparation task, 137
user-defined variable for passing folder path to, 97–99
File Transfer Protocol task, mapping Dts services to SSIS, 615
files/file system
default settings, 227
delimited file formats, 73
exercise archiving downloaded files, 156–160
exercise downloading zipped files from FTP server, 144–148
exercise expanding zipped files, 149–154
exercise importing expanded files. See importing expanded
zip files
fixed width file formats, 73
performing operations on using File System tasks, 155
ragged right file formats, 74
saving packages to, 290
fixed attribute, attribute change types in SCD Wizard, 441, 546
fixed database-level roles. See database-level roles
fixed width file formats, 73
Flat Buffer Memory, performance counters, 660

Flat Buffers In Use, performance counters, 660
Flat File Connection Manager, 78
Flat File source, 44
flat files
data sources, 346–348
destination in data flow, 360–361
exercise adding as data flow destination, 370–372
importing data from flat file into SQL Server, 169
flat files, importing into SQL Server 2008
column and row options, 45–46
data source selection, 43–44
destination options (new database), 46–49
mapping options, 49
report on steps in import process, 50–51
security options, 49–50
folders. See also files/file system
adding root-level folder to stored packages, 232–233
creating with dtutil, 242
deleting with dtutil, 243–244
exercise creating directory folder, 95–99
listing folder contents with dtutil, 242
renaming with dtutil, 243
SSIS service folder structure, 230
For Loops
applying repeating logic to units of work, 7
checkpoints and, 313
container, 123–126
Foreach ADO enumerator, 119–120
Foreach Loops
adding for contacts email campaign, 119–120

applying repeating logic to units of work, 7
archiving downloaded files, 156–158
checkpoints and, 313
enumerating multiple zipped files, 150–152
using Execute Process task inside, 152–154
FR, renaming folders with dtutil utility, 243
FTP Connection Manager
configuring, 145
connecting FTP Task Editor to FTP server, 143
overview of, 78
FTP task
configuring, 143
configuring for downloading zipped files, 144–147
as data preparation task, 137
exercise downloading zipped files from FTP server, 144–148
file transfer operations, 142
full outer joins, T-SQL, 392
functional dependency profile, multiple-column profiles, 65
functions
in Expression Builder, 104–105
performing derivations on input column data, 384
Fuzzy Grouping transformations
Advanced tab, 462–463
Columns tab, 462
Index 687
Connection Manager tab, 461–462
exercise removing duplicates by Fuzzy Grouping, 474–478
overview of, 460–461
Fuzzy Lookup transformations
Advanced tab, 466

Columns tab, 465
considerations before running, 467
exercise removing fuzzy duplicates, 472–474
operations for standardization of data, 6
overview of, 463–464
Reference Table tab, 464–465
G
GAC (global assembly cache), 485
GROUP BY
as Aggregate transformation, 429–430
data warehouse enhancements in SQL Server 2008 R2, 561
exercise aggregating sales orders, 431–433
GROUPING SETS, 561
GUI (graphical user interface)
for package management, 226
for Script task, 489–492
in SQL Server Management Studio, 229
H
handshake, scripting functionality for in data warehousing scenario,
488–492
hardware, Parallel Data Warehouse, 554–556
historical attribute
attribute change types in SCD Wizard, 442, 546
configuring SCD transformation, 451–452
historical data, archiving, 657
History Cleanup task
as control flow task, 216–217
as maintenance task, 140
HTTP Connection Manager, 78, 160
HTTP, Message Queuing service supporting, 172

hub-and-spoke architecture, Parallel Data Warehouse, 556–557
I
Import Column transformation, as row transformation, 386
importing and exporting packages, 233–235
importing expanded zip files
building expanded files package, 179–186
configuring Archiving Downloaded Files to send filenames,
175–179
installing Message Queuing service, 174–175
method, 173–174
overview of, 173
review, 186–187
indexes
partition-aligned indexed views, 564–565
rebuilding. See Rebuild Index task
reorganizing. See Reorganize Index task
indirect package configurations
exercise creating, 583–585
overview of, 577–578
steps in creating, 578–579
inferred member updates output
attribute change types in SCD Wizard,
442–443
configuring SCD transformation, 450
informational messages, raising with scripts, 530
InitExpression, For Loops and, 123
Inmon, Bill, 538
inner joins, T-SQL, 392
input
interface for data flow, 336–337

SQL Server Import and Export Wizard, 60–62
Input section, XML task, 162
installing project package, 591–592
installing SSIS
from command prompt, 25–26
overview, 21
side by side install with earlier versions of SSIS
or Dts, 26–27
upgrade installation, 27–28
using installation wizard on clean system, 22–25
interfaces, data flow
error output interface, 337
external metadata interface, 336
input interface, 336–337
output interface, 337
overview of, 335–336
J
jobs, SQL Server Agent
cleaning up job history. See History Cleanup task
copying jobs. See Transfer Job task
creating and adding package to, 257–258
executing. See Execute SQL Server Agent Job task
review of job automation process, 265–266
scheduling, 258–261
using proxy account to run jobs, 261–264
viewing job history, 264–265
join transformations. See split and join transformations
junk dimensions, in dimensional modeling, 544
K
Kimball, Ralph, 539

L
left outer joins, T-SQL, 392
legacy support, 10, 605–607
LOB (line of business) applications, 536
Locals window
BIDS windows, 36
default debugging features, 633

×