data preprocessing
Home > SQL Server Definitions - Data preprocessing
SearchSQLServer.com Definitions (Powered by WhatIs.com)
EMAIL THIS
LOOK UP TECH TERMS Powered by: WhatIs.com
Search listings for thousands of IT terms:
Browse tech terms alphabetically:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z #

data preprocessing



Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   

DEFINITION - Data preprocessing describes any type of processing performed on raw data to prepare it for another processing procedure. Commonly used as a preliminary data mining practice, data preprocessing transforms the data into a format that will be more easily and effectively processed for the purpose of the user -- for example, in a neural network. There are a number of different tools and methods used for preprocessing, including: sampling, which selects a representative subset from a large population of data; transformation, which manipulates raw data to produce a single input; denoising, which removes noise from data; normalization, which organizes data for more efficient access; and feature extraction, which pulls out specified data that is significant in some particular context.

In a customer relationship management (CRM) context, data preprocessing is a component of Web mining. Web usage logs may be preprocessed to extract meaningful sets of data called user transactions, which consist of groups of URL references. User sessions may be tracked to identify the user, the Web sites requested and their order, and the length of time spent on each one. Once these have been pulled out of the raw data, they yield more useful information that can be put to the user's purposes, such as consumer research, marketing, or personalization.

LAST UPDATED: 01 Apr 2005

Read more about data preprocessing:
- A paper at DePaul University discusses data preprocessing in "Automatic Personalization Based on Web Usage Mining."


Do you have something to add to this definition? Let us know.
Send your comments to techterms@whatis.com


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


RELATED CONTENT
Ensuring high availability of SSAS databases
In part three of this series, Baya Dewald hones in on the administrative side of SQL Server Analysis Services, with details on the various methods on...
Building a data warehousing and BI solution
In part two of his series, Baya Dewald breaks down the process of creating a data warehousing and business intelligence solution, with details on the...
An overview of SQL Server Report Builder 2.0
In this chapter excerpt from "Learning SQL Server 2008 Reporting Services", find out about the interface details and controls available in Report...

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
comma-separated values file  (SearchSQLServer.com)
data aggregation  (SearchSQLServer.com)




data preprocessing Solutions - SQL White Paper Library
HomeNewsTopicsITKnowledge ExchangeTipsAsk the ExpertsMultimediaWhite PapersIT Downloads
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2005 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts