Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. Text data are unique in that they are usually generated directly by humans rather than a computer system or sensors, and are thus especially valuable for discovering knowledge about people’s opinions and preferences, in addition to many other kinds of knowledge that we encode in text.
Text Retrieval and Search Engines
This course is part of Data Mining Specialization
Instructor: ChengXiang Zhai
Sponsored by Louisiana Workforce Commission
59,360 already enrolled
(955 reviews)
Skills you'll gain
Details to know
Add to your LinkedIn profile
14 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
There are 7 modules in this course
You will become familiar with the course, your classmates, and our learning environment. The orientation will also help you obtain the technical skills required for the course.
What's included
2 videos6 readings2 assignments1 plugin
During this week's lessons, you will learn of natural language processing techniques, which are the foundation for all kinds of text-processing applications, the concept of a retrieval model, and the basic idea of the vector space model.
What's included
6 videos1 reading2 assignments
In this week's lessons, you will learn how the vector space model works in detail, the major heuristics used in designing a retrieval function for ranking documents with respect to a query, and how to implement an information retrieval system (i.e., a search engine), including how to build an inverted index and how to score documents quickly for a query.
What's included
6 videos1 reading2 assignments
In this week's lessons, you will learn how to evaluate an information retrieval system (a search engine), including the basic measures for evaluating a set of retrieved results and the major measures for evaluating a ranked list, including the average precision (AP) and the normalized discounted cumulative gain (nDCG), and practical issues in evaluation, including statistical significance testing and pooling.
What's included
6 videos2 readings2 assignments1 programming assignment
In this week's lessons, you will learn probabilistic retrieval models and statistical language models, particularly the detail of the query likelihood retrieval function with two specific smoothing methods, and how the query likelihood retrieval function is connected with the retrieval heuristics used in the vector space model.
What's included
7 videos1 reading2 assignments
In this week's lessons, you will learn feedback techniques in information retrieval, including the Rocchio feedback method for the vector space model, and a mixture model for feedback with language models. You will also learn how web search engines work, including web crawling, web indexing, and how links between web pages can be leveraged to score web pages.
What's included
8 videos1 reading2 assignments
In this week's lessons, you will learn how machine learning can be used to combine multiple scoring factors to optimize ranking of documents in web search (i.e., learning to rank), and learn techniques used in recommender systems (also called filtering systems), including content-based recommendation/filtering and collaborative filtering. You will also have a chance to review the entire course.
What's included
10 videos1 reading2 assignments1 programming assignment1 plugin
Instructor
Offered by
Why people choose Coursera for their career
Learner reviews
955 reviews
- 5 stars
65.89%
- 4 stars
23.64%
- 3 stars
6.69%
- 2 stars
1.67%
- 1 star
2.09%
Showing 3 of 955
Reviewed on Aug 23, 2018
Need indetail inputs on algorithm usage and correct MeTA assignments with working scripts. That makes learning a complete curve.
Reviewed on May 18, 2020
A bit difficult to complete as the Quiz questions were tougher. But when you go through all, you might feel good.
Reviewed on Sep 16, 2019
This course goes through the basics of text retrieval systems with an appropriate speed. However, the contents are quite out-dated for 2019.
Recommended if you're interested in Data Science
Coursera Instructor Network
Amazon Web Services
DeepLearning.AI
Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy