data preprocessing
Home > SQL Server Definitions - Data preprocessing
SearchSQLServer.com Definitions (Powered by WhatIs.com)
EMAIL THIS
LOOK UP TECH TERMS Powered by: WhatIs.com
Search listings for thousands of IT terms:
Browse tech terms alphabetically:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z #

data preprocessing



Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   

DEFINITION - Data preprocessing describes any type of processing performed on raw data to prepare it for another processing procedure. Commonly used as a preliminary data mining practice, data preprocessing transforms the data into a format that will be more easily and effectively processed for the purpose of the user -- for example, in a neural network. There are a number of different tools and methods used for preprocessing, including: sampling, which selects a representative subset from a large population of data; transformation, which manipulates raw data to produce a single input; denoising, which removes noise from data; normalization, which organizes data for more efficient access; and feature extraction, which pulls out specified data that is significant in some particular context.

In a customer relationship management (CRM) context, data preprocessing is a component of Web mining. Web usage logs may be preprocessed to extract meaningful sets of data called user transactions, which consist of groups of URL references. User sessions may be tracked to identify the user, the Web sites requested and their order, and the length of time spent on each one. Once these have been pulled out of the raw data, they yield more useful information that can be put to the user's purposes, such as consumer research, marketing, or personalization.

LAST UPDATED: 01 Apr 2005

Read more about data preprocessing:
- A paper at DePaul University discusses data preprocessing in "Automatic Personalization Based on Web Usage Mining."


Do you have something to add to this definition? Let us know.
Send your comments to techterms@whatis.com


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


RELATED CONTENT
Tips for tuning SQL Server 2005 to improve reporting performance
Configure SQL Server 2005 for improved performance when reporting against tables with large rows of data. Tips include creating SQL indexes and report...
Parent-child dimensions in SQL Server 2005 with Analysis Services MDX
Retrieving a hierarchy with recursive relationships can be difficult in SQL. Easily implement these relationships with a parent-child dimension using...
Troubleshoot Web service issues in SQL Server 2005 Reporting Services
Learn why your Web service in SQL Server 2005 Reporting Services (SSRS) is not responding. You could be missing Reporting Services configurations.

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
comma-separated values file  (SearchSQLServer.com)
data aggregation  (SearchSQLServer.com)




HomeNewsTopicsITKnowledge ExchangeTipsAsk the ExpertsMultimediaWhite PapersIT Downloads
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Reprints  |  Site Map




All Rights Reserved, Copyright 2005 - 2008, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts