IR First Chapter
IR First Chapter
IR First Chapter
Admas University
Hargeisa Somaliland
Department of ICT
Concepts of Information
Data
Think of data as a "raw material" - it needs to be
processed before it can be turned into something useful,
hence, the need for "data processing".
Data comes in many forms - numbers, words, symbols.
Data relates to transactions, events and facts.
On its own - it is not very useful.
2
Concepts of Information
3
Concepts of Information
Information
• The above example demonstrates what information
is. Information is data that has been processed in
such a way as to be meaningful to the person who
receives it.
• Note the - "processed" and "meaningful“
• It is not enough for data simply to be processed; it
has to be of use to someone - otherwise why
bother?!
5
Concepts of Information
Attributes of Information
Characteristics of good information are as follows:
reliable,
timely,
accessible,
cost-effective,
accurate,
fit-for purpose,
relevant, and
understandable by the user.
6
Information life cycle
Information
• The above example demonstrates what information
Information Information
Creation Acquisition
Information
Information
Organization
Use
Informatio
Information n Storage
Distribution
7
Information life cycle
8
Information life cycle
9
Motivation Behind IR System
10
Motivation Behind IR System
• Information explosion
– The growth in information and the retrieval
mechanisms do not match
– The overload made storage and retrieval of
information very tough
– Because of overload our search space becomes
large
– In the search space we have information
items which could be in the form of books,
journals, etc.
11
Information Retrieval (IR)
12
Information Retrieval Systems
16
Activities of IR
• Content analysis
– Concerned with describing contents of documents
– Deals with representation of the document
– Involves the analysis and assignment of terms or identifiers that
are capable representing document content, which can be used
as access point to that document
• Indexing and cataloguing,
– are some of the processes used to represent the thought content
of the document Information structure
– Concerned with exploiting relationship between documents to
improve the efficiency and effectiveness of retrieval strategies
• Evaluation
17 – Deals with measurements of the effectiveness of retrieval
Data retrieval Vs Information
retrieval systems
Data retrieval Information retrieval
Data organization Structured (clear semantics: Nam Unstructured(No fields(other than tex
e, age ...) t))
Context Data Information
Data object Table Document
Matching Exact match Partial match, best match
Items wanted Matching Relevant
Query language SQL(artificial) Free text(Natural language, Boolean)
Query Complete Incomplete
specification
Accuracy 100%(results are always correct) <50%
Error response Sensitive Non sensitive
18
Basic Concepts of IR
The effective retrieval of relevant information is directly affected by
two things
1) The user task
– Anyone who need to find some information
– The user groups
• By their knowledge of the system
– Novice Vs experienced users
– End users Vs information specialist
• By their domain knowledge
– Domain experts Vs general public
• By their information needs
– Need to locate a particular item, need some information, need all information on a
subject
19
Basic Concepts of IR …
The effective retrieval of relevant information is directly affected by two
things
2) Logical view of the documents
• Documents in a collection are frequently represented through a set of index
terms or keywords
Index terms
• A keyword or group of related words which has some meaning of its
own
• Is simply a word whose semantic helps in remembering the
document’s main theme
• Might be extracted directly from the text of the documents or
specified by a human expert
20
Structure of an IRS
Information Retrieval System serves as a bridge between the world of
authors and the world of readers/users.
That is, writers present a set of ideas in a document using a set of concepts.
Then Users seek the IR system for relevant documents that satisfy their
information need.
21
Structure of an IRS …
22
Typical IR Task
24
Indexing…
Documents
Items in the file
Requests (Queries)
Expression of the users information needs
25
IR Functions
26
IR Functions…
27
IR Challenges
11/2/2022 31
Thank You !!!
11/2/2022 32