Unit 2: Data
Processing Concepts
Outlines
•Introduction
•Data Processing Concepts
•Data Processing Activities
•Data Processing Cycle
•Data Hierarchy
•Data File structures
•Data File Structures
•Application Portfolio Management
•Introduction to Micro Database
Manager
3.1 Data Processing Concepts
Data
The word "data" is the plural of datum, which
means facts, observations, occurrence and
observations. Data are representations of facts
pertaining to people, things, ideas and events.
Data are represented by symbols such as
alphabets, numerals or special symbols.
Data Processing
Data processing is the act of manipulating or
handling data in some manner. Thus, the idea of
processing is to transform data into
information Thus, data processing can be defined
as a series of actions or steps, which converts
data into useful information.
Information
Information can be defined as 'data transformed
into useful and meaningful form for specific
purpose'. Thus, data is not useful until it is
organised and manipulated because after that
only data becomes information.
3.2 Data Processing Activities
Data processing consists of all those activities,
which are required to convert data into
information. There are some tools, which help
in processing of data; these tools can be
manual, mechanical, electromechanical or
electronic such as manual tools as paper and
pencil, mechanical tools as filing cabinets,
electromechanical tools such as typewriters and
adding machines and electronic tools such as
calculators and computers.
3.3 Data Processing Cycle
1.Input: The term input refers to the activities required
to record data and to make it available for processing.
The input can also include the steps necessary to
check, verify and validate data contents.
2.Processing: The term processing denotes the
actual data manipulation techniques such as
classifying, sorting, calculating,summarising,
comparing, etc. that convert data into information.
3.Output: It is a communication function, which
transmits the information generated after processing
of data, to persons who need the information.
•Sometimes output also includes decoding activity,
which converts the electronically generated
information into human-readable form.
4.Storage: It involves the filing of data and information
for future use.
2.7 Data Hierarchy
It shows arrangement of data in hierarchical form
having following fields,records, files and so on.
A data field will keep date as September 8, 1971.
A record is shows details about an employee with
name fields, address fields, date of birth field etc.
A file carries related records. It helps in keeping track of
employee details.
Files are arranged in database using DBMS.
In case of data storage, data fields have bytes which
consist of bits.
3.8 Data File Structures
This is a particular way that information is encoded
for storage in a computer file. Since a disk drive or indeed
any computer storage, can store only bits, the computer
must have some way of converting information to 0s and
1s and vice-versa. There are different kinds of formats
for different kinds of information.
Within any format type, e.g. word processor
documents, there will typically be several different
formats. File formats are divided into proprietary and
open formats.
1. Generality
It is seen that there are certain file formats which are
used to store only particular type of data. The
formats such as JPEG are used to store static
photographic images while GIF format is used to
store images and animations. Apart from this,
QuickTime format stores certain types of multimedia.
2. Specifications
Many file formats, including some of the most well-
known file formats, often have a published
specification document (often with a reference
implementation) that describes exactly how the data is to
be encoded and which can be used to determine whether
or not a particular program treats a particular file format
correctly.
3. Identifying the Type of a File
A method is required to determine the format of a
particular file within the file system- an example of
metadata.
4. Filename extension
An important feature in operating systems is to
determine the format of file based on section of its name
following final period which is known as the filename
extension.
We see that an HTML documents will have extension
of .html or .htm, which an image file is having an
extension of .gif, .PNG etc.
For example, a company logo may be needed in
both .tif format (for publishing) and .gif format (for web
sites). With the extensions visible, these would appear
at the unique filenames "CompanyLogo.tif" and
"CompanyLogo.gif". With the extensions hidden, these
would both appear to have the identical filename
"Company Logo", making it more difficult to determine,
which one to select for a particular application.
5. Internal metadata
A second way to identify a file format is to store information
regarding the format inside the file itself. Usually, such
information is written in one (or more) binary string(s),
tagged or raw texts placed in fixed, specific locations
within the file. Since the easiest place to locate them is at
the beginning of it, this area is usually called a file header
when it is greater than a few bytes or a magic number if it
is just a few bytes long.
6. File header
First, the meta-data contained in a file header are not
necessarily stored only at the beginning of it but might be
present in other areas too, often including the end of the
file that depends on the file format or the type of data it
contains. Character-based (text) files have character-
based human-readable headers,whereas binary
formats usually feature binary headers, the need to read
all the bytes/records before it.
7. External metadata
A good way to store a file format is to store the information
about format in file system instead of keeping within the file
itself. Such idea not only keeps the metadata away from
main data and name but also not very safe as compared to
either file extensions.
8. Mac OS type-codes
Mac OS File System will able to store codes for creator
and types that serves as part of directory entry for each
file. They are called as OS Types or HyperCard stack
file. Type of code will show the format of file, while
creator code specifies default program to run.
9. OS/2 Extended Attributes
It found that HPFS, FAT12 and FAT16 file systems
allows storage of extended attributes with files which
contains an arbitrary set of triplets with name, coded type
for value and a value having different names.
10. POSIX extended attributes
On UNIX and Unix-like systems, ext2, ext3, ReiserFS
version 3, XFS, JFS, FFS and HFS+ file systems allow
storage of extended attributes with files.
11. PRONOM Unique Identifiers (PUIDs)
PRONOM Persistent Unique Identifier is an extension of
constant, special and definite identifiers which is for
file formats that was created by National Archives of
UK which serve as part of its PRONOM technical registry
service.
12. MIME types
MIME types are useful in Internet-related applications
that carries standardised system of identifiers having type
and sub-type which is separated by slash.
13. File content based format identification
It is another way of finding file format by way of file
contents that shows distinguishable patterns.
14. File format identifiers (FFIDs)
File format identifiers is also a way to find file formats
as per their origin and file category. It has several digits
which are of form NNNNNNNNN-XX-YYYYYYY, where
NNNNNNNN as per company/standards organisation
database and XX and YYYYYYY shows file type in
hexadecimal.
3.9 Application Portfolio Management.
Application Portfolio Management is a system
which is applied in medium to large size
Information Technology organisations. Such
system uses lessons of financial portfolio
management in order to verify financial benefits of
application to be compared with costs of the
application maintenance and operations.
Portfolio
Definition of an application
Application in case of application portfolio
management is an important aspect.
Application software: There are certain
executable software components which are used to
create, update, manage, calculate or display
information for particular business.
Software component: This is an executable
computer instructions having single deployment
container which cannot be broken further
Inclusions
The purpose of inclusion is:
To provide three web services: Invoice Create,
Invoice Search and Invoice Detail Get.
To have a service-oriented business application
where a user interface to get invoice which calls
Invoice Create service.
To have a legacy system having rich client,
server-based middle tier and database.
To pull data from database using website publishing
system and publishes it to HTML format as sub-site
on public URL.
Exclusions
The following are not applications:
HTML website
Having database which is not part of series of steps
for business values.
Web service which is incapable of being set of
steps.
Having a stand-alone batch script which compares
contents of databases by
making calls
3.10 Introduction to Micro Data Base Manager
Micro DB Manager is a database abstraction
class written in PHP using object-oriented
technologies.
The class realises the following functionalities:
Connecting to the database
Executing queries
Converting results to associative array
Getting selected rows
Getting affected rows
Getting last insert id
Getting the number of executed queries
Getting the execution time of the queries
Getting error messages and codes.
3.11 Glossary
1. Data - It is related to facts, observations,
occurrence of information.
2. Data processing - It is a processing of carrying
out data in particular
manner.
3. Processing - It is a procedure of giving some
meaning to data.
4. Application Portfolio Management - It is a system
applied in medium to large size Information
Technology organisations.

More Related Content

PPTX
DATABASE MANAGEMENT
PPTX
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
PPTX
File organization and introduction of DBMS
DOCX
File organization in database
PPT
Lecture #1 Introduction
PPT
2ndlec.database
PPT
Digital Object Identifiers: Affecting How Libraries Connect to Online Digital...
PPTX
Current trends in DBMS
DATABASE MANAGEMENT
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
File organization and introduction of DBMS
File organization in database
Lecture #1 Introduction
2ndlec.database
Digital Object Identifiers: Affecting How Libraries Connect to Online Digital...
Current trends in DBMS

What's hot (20)

PPT
Chapter10
PPTX
Multimedia database
PDF
A basic course on Reseach data management, part 2: protecting and organizing ...
PPTX
8 Document Capture Must Haves, a Document Management Tutorial
PPTX
Multimedia database
PPT
data resource management
PPT
5 data resource management
PPT
Uit9 ppt ch08_au_rev
PPTX
Automatic file naming and routing for scanned documents and existing files.
PDF
PPTX
An Introduction to Document Scanning, Understanding Your Requirements
PPT
Database Systems
PPT
Multimedia db system
PPT
Alternate Data Streams
PPTX
Batch Document Processing with ImageRamp Batch
PPTX
Windows 7
PPT
Chapter 05 pertemuan 7- donpas - manajemen data
PDF
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
PPT
Database Systems
PPTX
Chapter10
Multimedia database
A basic course on Reseach data management, part 2: protecting and organizing ...
8 Document Capture Must Haves, a Document Management Tutorial
Multimedia database
data resource management
5 data resource management
Uit9 ppt ch08_au_rev
Automatic file naming and routing for scanned documents and existing files.
An Introduction to Document Scanning, Understanding Your Requirements
Database Systems
Multimedia db system
Alternate Data Streams
Batch Document Processing with ImageRamp Batch
Windows 7
Chapter 05 pertemuan 7- donpas - manajemen data
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
Database Systems

Viewers also liked (14)

PDF
Micro-Controller based jimma university Interactive voice response System
DOCX
Hosting the archangels in your home
PPTX
5. pre production(20
PPTX
Bba203 unit 2 operating system concepts
PPTX
Data cleansing
PPTX
Equipo 5 Diferidos
PPTX
HOW TO PROCESS DATA IN VARIOUS GEO'S A COMPARATIVE ANALYSIS BY SANJEEV SINGH...
PDF
Comprehensive Validation with Laravel 4
PPT
Bitt I 2008 Lec1
PPTX
HIPPA COMPLIANCE (SANJEEV.S.BHARWAN)
PPT
Image Analysis using Definiens Professional 4
PPTX
Hyperspectral Imaging
PDF
CVSuite Art of Data webinar series: Presenting Data - Crafting Compelling Sto...
Micro-Controller based jimma university Interactive voice response System
Hosting the archangels in your home
5. pre production(20
Bba203 unit 2 operating system concepts
Data cleansing
Equipo 5 Diferidos
HOW TO PROCESS DATA IN VARIOUS GEO'S A COMPARATIVE ANALYSIS BY SANJEEV SINGH...
Comprehensive Validation with Laravel 4
Bitt I 2008 Lec1
HIPPA COMPLIANCE (SANJEEV.S.BHARWAN)
Image Analysis using Definiens Professional 4
Hyperspectral Imaging
CVSuite Art of Data webinar series: Presenting Data - Crafting Compelling Sto...

Similar to Bba203 unit 2data processing concepts (20)

PPTX
Database Management and it is definition
PPTX
Data Science Process.pptx
PPT
Metadata For Preservation Delos
PPTX
PPT
Understanding EDP (Electronic Data Processing) Environment
PPTX
File organization 01
PPTX
File management
PPTX
Presentation and architecture FDBMS Unit-1.pptx
PPT
D I T211 Chapter 1
PPT
D I T211 Chapter 1 1
PPT
Database, Lecture-1.ppt
PPTX
Dbms mca-section a
PPTX
DATABASE SYSTEMS VS ROUTINE FILE SYSTEMS
PPTX
Information management systems
PDF
Degonto file management
PPTX
chapter 1 HARDWARE AND NETWORKING SERVICE.pptx
PDF
Information Centric Network And Developing Channel Coding...
PPTX
Application portfolio development.advadisadvan.pptx
PPTX
Module 1_Overview of Database Management System
Database Management and it is definition
Data Science Process.pptx
Metadata For Preservation Delos
Understanding EDP (Electronic Data Processing) Environment
File organization 01
File management
Presentation and architecture FDBMS Unit-1.pptx
D I T211 Chapter 1
D I T211 Chapter 1 1
Database, Lecture-1.ppt
Dbms mca-section a
DATABASE SYSTEMS VS ROUTINE FILE SYSTEMS
Information management systems
Degonto file management
chapter 1 HARDWARE AND NETWORKING SERVICE.pptx
Information Centric Network And Developing Channel Coding...
Application portfolio development.advadisadvan.pptx
Module 1_Overview of Database Management System

Recently uploaded (20)

PDF
GIÁO ÁN TIẾNG ANH 7 GLOBAL SUCCESS (CẢ NĂM) THEO CÔNG VĂN 5512 (2 CỘT) NĂM HỌ...
PDF
V02-Session-4-Leadership-Through-Assessment-MLB.pdf
PPTX
macro complete discussion with given activities
PDF
FAMILY PLANNING (preventative and social medicine pdf)
PDF
Developing speaking skill_learning_mater.pdf
PPT
hsl powerpoint resource goyloveh feb 07.ppt
PDF
BSc-Zoology-02Sem-DrVijay-Comparative anatomy of vertebrates.pdf
PPTX
MMW-CHAPTER-1-final.pptx major Elementary Education
PPTX
Unit1_Kumod_deeplearning.pptx DEEP LEARNING
PDF
New_Round_Up_6_SB.pdf download for free, easy to learn
PDF
Global strategy and action plan on oral health 2023 - 2030.pdf
PDF
Jana-Ojana Finals 2025 - School Quiz by Pragya - UEMK Quiz Club
PDF
horaris de grups del curs 2025-2026 de l'institut
PDF
Physical pharmaceutics two in b pharmacy
PDF
The 10 Most Inspiring Education Leaders to Follow in 2025.pdf
PDF
Design and Evaluation of a Inonotus obliquus-AgNP-Maltodextrin Delivery Syste...
PDF
Kalaari-SaaS-Founder-Playbook-2024-Edition-.pdf
PDF
3-Elementary-Education-Prototype-Syllabi-Compendium.pdf
PPTX
Juvenile delinquency-Crim Research day 3x
PDF
NGÂN HÀNG CÂU HỎI TÁCH CHỌN LỌC THEO CHUYÊN ĐỀ TỪ ĐỀ THI THỬ TN THPT 2025 TIẾ...
GIÁO ÁN TIẾNG ANH 7 GLOBAL SUCCESS (CẢ NĂM) THEO CÔNG VĂN 5512 (2 CỘT) NĂM HỌ...
V02-Session-4-Leadership-Through-Assessment-MLB.pdf
macro complete discussion with given activities
FAMILY PLANNING (preventative and social medicine pdf)
Developing speaking skill_learning_mater.pdf
hsl powerpoint resource goyloveh feb 07.ppt
BSc-Zoology-02Sem-DrVijay-Comparative anatomy of vertebrates.pdf
MMW-CHAPTER-1-final.pptx major Elementary Education
Unit1_Kumod_deeplearning.pptx DEEP LEARNING
New_Round_Up_6_SB.pdf download for free, easy to learn
Global strategy and action plan on oral health 2023 - 2030.pdf
Jana-Ojana Finals 2025 - School Quiz by Pragya - UEMK Quiz Club
horaris de grups del curs 2025-2026 de l'institut
Physical pharmaceutics two in b pharmacy
The 10 Most Inspiring Education Leaders to Follow in 2025.pdf
Design and Evaluation of a Inonotus obliquus-AgNP-Maltodextrin Delivery Syste...
Kalaari-SaaS-Founder-Playbook-2024-Edition-.pdf
3-Elementary-Education-Prototype-Syllabi-Compendium.pdf
Juvenile delinquency-Crim Research day 3x
NGÂN HÀNG CÂU HỎI TÁCH CHỌN LỌC THEO CHUYÊN ĐỀ TỪ ĐỀ THI THỬ TN THPT 2025 TIẾ...

Bba203 unit 2data processing concepts

  • 2. Outlines •Introduction •Data Processing Concepts •Data Processing Activities •Data Processing Cycle •Data Hierarchy •Data File structures •Data File Structures •Application Portfolio Management •Introduction to Micro Database Manager
  • 3. 3.1 Data Processing Concepts Data The word "data" is the plural of datum, which means facts, observations, occurrence and observations. Data are representations of facts pertaining to people, things, ideas and events. Data are represented by symbols such as alphabets, numerals or special symbols. Data Processing Data processing is the act of manipulating or handling data in some manner. Thus, the idea of processing is to transform data into information Thus, data processing can be defined as a series of actions or steps, which converts data into useful information.
  • 4. Information Information can be defined as 'data transformed into useful and meaningful form for specific purpose'. Thus, data is not useful until it is organised and manipulated because after that only data becomes information.
  • 5. 3.2 Data Processing Activities Data processing consists of all those activities, which are required to convert data into information. There are some tools, which help in processing of data; these tools can be manual, mechanical, electromechanical or electronic such as manual tools as paper and pencil, mechanical tools as filing cabinets, electromechanical tools such as typewriters and adding machines and electronic tools such as calculators and computers.
  • 6. 3.3 Data Processing Cycle 1.Input: The term input refers to the activities required to record data and to make it available for processing. The input can also include the steps necessary to check, verify and validate data contents. 2.Processing: The term processing denotes the actual data manipulation techniques such as classifying, sorting, calculating,summarising, comparing, etc. that convert data into information. 3.Output: It is a communication function, which transmits the information generated after processing of data, to persons who need the information. •Sometimes output also includes decoding activity, which converts the electronically generated information into human-readable form.
  • 7. 4.Storage: It involves the filing of data and information for future use.
  • 8. 2.7 Data Hierarchy It shows arrangement of data in hierarchical form having following fields,records, files and so on. A data field will keep date as September 8, 1971. A record is shows details about an employee with name fields, address fields, date of birth field etc. A file carries related records. It helps in keeping track of employee details. Files are arranged in database using DBMS. In case of data storage, data fields have bytes which consist of bits.
  • 9. 3.8 Data File Structures This is a particular way that information is encoded for storage in a computer file. Since a disk drive or indeed any computer storage, can store only bits, the computer must have some way of converting information to 0s and 1s and vice-versa. There are different kinds of formats for different kinds of information. Within any format type, e.g. word processor documents, there will typically be several different formats. File formats are divided into proprietary and open formats.
  • 10. 1. Generality It is seen that there are certain file formats which are used to store only particular type of data. The formats such as JPEG are used to store static photographic images while GIF format is used to store images and animations. Apart from this, QuickTime format stores certain types of multimedia. 2. Specifications Many file formats, including some of the most well- known file formats, often have a published specification document (often with a reference implementation) that describes exactly how the data is to be encoded and which can be used to determine whether or not a particular program treats a particular file format correctly.
  • 11. 3. Identifying the Type of a File A method is required to determine the format of a particular file within the file system- an example of metadata. 4. Filename extension An important feature in operating systems is to determine the format of file based on section of its name following final period which is known as the filename extension. We see that an HTML documents will have extension of .html or .htm, which an image file is having an extension of .gif, .PNG etc.
  • 12. For example, a company logo may be needed in both .tif format (for publishing) and .gif format (for web sites). With the extensions visible, these would appear at the unique filenames "CompanyLogo.tif" and "CompanyLogo.gif". With the extensions hidden, these would both appear to have the identical filename "Company Logo", making it more difficult to determine, which one to select for a particular application. 5. Internal metadata A second way to identify a file format is to store information regarding the format inside the file itself. Usually, such information is written in one (or more) binary string(s), tagged or raw texts placed in fixed, specific locations within the file. Since the easiest place to locate them is at the beginning of it, this area is usually called a file header when it is greater than a few bytes or a magic number if it is just a few bytes long.
  • 13. 6. File header First, the meta-data contained in a file header are not necessarily stored only at the beginning of it but might be present in other areas too, often including the end of the file that depends on the file format or the type of data it contains. Character-based (text) files have character- based human-readable headers,whereas binary formats usually feature binary headers, the need to read all the bytes/records before it. 7. External metadata A good way to store a file format is to store the information about format in file system instead of keeping within the file itself. Such idea not only keeps the metadata away from main data and name but also not very safe as compared to either file extensions.
  • 14. 8. Mac OS type-codes Mac OS File System will able to store codes for creator and types that serves as part of directory entry for each file. They are called as OS Types or HyperCard stack file. Type of code will show the format of file, while creator code specifies default program to run. 9. OS/2 Extended Attributes It found that HPFS, FAT12 and FAT16 file systems allows storage of extended attributes with files which contains an arbitrary set of triplets with name, coded type for value and a value having different names. 10. POSIX extended attributes On UNIX and Unix-like systems, ext2, ext3, ReiserFS version 3, XFS, JFS, FFS and HFS+ file systems allow storage of extended attributes with files.
  • 15. 11. PRONOM Unique Identifiers (PUIDs) PRONOM Persistent Unique Identifier is an extension of constant, special and definite identifiers which is for file formats that was created by National Archives of UK which serve as part of its PRONOM technical registry service. 12. MIME types MIME types are useful in Internet-related applications that carries standardised system of identifiers having type and sub-type which is separated by slash. 13. File content based format identification It is another way of finding file format by way of file contents that shows distinguishable patterns.
  • 16. 14. File format identifiers (FFIDs) File format identifiers is also a way to find file formats as per their origin and file category. It has several digits which are of form NNNNNNNNN-XX-YYYYYYY, where NNNNNNNN as per company/standards organisation database and XX and YYYYYYY shows file type in hexadecimal.
  • 17. 3.9 Application Portfolio Management. Application Portfolio Management is a system which is applied in medium to large size Information Technology organisations. Such system uses lessons of financial portfolio management in order to verify financial benefits of application to be compared with costs of the application maintenance and operations. Portfolio
  • 18. Definition of an application Application in case of application portfolio management is an important aspect. Application software: There are certain executable software components which are used to create, update, manage, calculate or display information for particular business. Software component: This is an executable computer instructions having single deployment container which cannot be broken further
  • 19. Inclusions The purpose of inclusion is: To provide three web services: Invoice Create, Invoice Search and Invoice Detail Get. To have a service-oriented business application where a user interface to get invoice which calls Invoice Create service. To have a legacy system having rich client, server-based middle tier and database. To pull data from database using website publishing system and publishes it to HTML format as sub-site on public URL.
  • 20. Exclusions The following are not applications: HTML website Having database which is not part of series of steps for business values. Web service which is incapable of being set of steps. Having a stand-alone batch script which compares contents of databases by making calls
  • 21. 3.10 Introduction to Micro Data Base Manager Micro DB Manager is a database abstraction class written in PHP using object-oriented technologies. The class realises the following functionalities: Connecting to the database Executing queries Converting results to associative array Getting selected rows Getting affected rows Getting last insert id Getting the number of executed queries Getting the execution time of the queries Getting error messages and codes.
  • 22. 3.11 Glossary 1. Data - It is related to facts, observations, occurrence of information. 2. Data processing - It is a processing of carrying out data in particular manner. 3. Processing - It is a procedure of giving some meaning to data. 4. Application Portfolio Management - It is a system applied in medium to large size Information Technology organisations.