Chevron Left
Back to Big Data Integration and Processing

Learner Reviews & Feedback for Big Data Integration and Processing by University of California San Diego

4.4
stars
2,402 ratings

About the Course

At the end of the course, you will be able to: *Retrieve data from example database and big data management systems *Describe the connections between data management operations and the big data processing patterns needed to utilize them in large-scale analytical applications *Identify when a big data problem needs data integration *Execute simple big data integration and processing on Hadoop and Spark platforms This course is for those new to data science. Completion of Intro to Big Data is recommended. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Refer to the specialization technical requirements for complete hardware and software specifications. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge (except for data charges from your internet provider). Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+....

Top reviews

SB

Oct 21, 2020

Hello Gentlemen,

This course was very helpful foe me. It enhanced my knowledge about Big Data Integration. Thank you so much for providing me such important knowledge. Thank you once again.

AA

Mar 5, 2018

It was a good course, it could have been better if some examples of Spark were also provided in other Languages like Java, people without having background of python may find it difficult.

Filter by:

376 - 400 of 509 Reviews for Big Data Integration and Processing

By Jürgen B

Oct 31, 2018

Good overview.

By Alejandro S M

Apr 23, 2020

Great for db

By Mario L

Aug 6, 2017

has bugs

By Rohit K S

Oct 12, 2020

Nice!!

By HONGWEI Z

Oct 18, 2017

G

By Johan A P O

Nov 10, 2019

Last week was a disaster in terms of giving the necessary educational resources. I found it extremely hard to finish the assignment because I couldn't understand the knowledge set required to do it.

I think you must work on making sure students are getting tailored to the functions that you will request them at the end. It was tremendously underwhelming to me to find such interesting tasks and finding myself unable to understand any clear path to perform even the first actions.

I had to research a lot out of the platform and dig up old replies in the forum just to have hints about what I had to do to find the answers you were requesting. If you consider that it's sufficient with what you explained, you're applying an unfair filter to students.

If you didn't mean that, please adjust either this whole module to focus on

* pyspark syntaxis

* clear use cases in Data retrieval and analysis

* evaluating the syntaxis of each function that you will request later

Or just change the last module to make it according to what you've taught. Thanks, even though I found these struggles, I was able to learn.

By Sarwar A

Oct 7, 2020

I am writing a review for not only for this course but for the previous two courses as well.

The points that I want to make:

The first two courses were okay as far as the theory is concerned but I am very much disappoint with this course because of the following reasons:

1.Not enough exercises for MongoDB

2.That means we have to go further to learn more about MongoDB

3. Too many tools outlined in this course but in return, only a few quizzes comprise hardly more than six questions each.

4.The instructors could have opted for more quizzes on Apache Spark, SparkSQL, MongoDB, Spark Streaming.

5.The creator of this specialization should add two more courses down the line namely " Querying Databases using SparkSQL and MongoDB" and another course could be on "Spark streaming and Splunk"

Overall I didn't like this course at all.

I would like to tell the future learners don't register for this course if you want to take lessons on MongoDB, spark SQL, spark streaming, and Splunk. Look for the courses on COURSERA if you want to take lessons on the above frameworks.

By Dana B

Jul 14, 2021

I really enjoy working on the topic of Big Data. I also think that the course structure and theoretical content as such is very useful and logical. Hence the 3 stars. However, hands on assignments and packages provided are outdated and getting the environment to run properly takes a lot of time and programming knowledge that I, for one, do not have. Also, data in the hands on assignements have changed, hence it is not always possible to reproduce results from assignments, which is really annoying if these results are part of a quiz. Generally, I do not think that solutions to circumvent errors due to outdated packages and data should be sourced and applied by the student through the forum. It should be in the interest of Coursera and / or the instructors to test the environment and provide updates where necessary. I really have to consider whether I want to continue with the next modules and Coursera in general given that most of my time is spent on getting the environment to run the hands on assignments running.

By Tina L

Jan 16, 2018

The elaborations in video lecture sometimes are too complicated to understand. It should consider all students comes from different industry. For example, the disease/gene relationships, actually it can replaced by GeneA, DiseaseA, etc. Also, the slides are not clear enough for students to capture the outstanding points. It's not good for students to review since it's truly vague of the relationships between the list items. Overall, the lecture is just different to understand, even causing confusion sometimes.

By ZHE C

May 7, 2017

the course content is critical and as it appears in many interviews, and the fundamental understanding is important for beginners to learn this new area. however I think the software (spark or mongoDB) can be taught in a more systematic way (at least point out some resources that can help people learn them based on individual needs). I understand this course is for beginners and people supposed to learn deeper on themselves. but a road map will be helpful and reduce the pain finishing the tests.

By Lomiarz

Feb 4, 2017

The course was good enough...but exercises were very simple. Only the final course was little bit challenging. For a guy that sits in IT business for a while it's rather too simple. Besides, I've learned spark basics which is super cool...so thanks for that

Maybe you could consider to build docker image instead of using virtual machines. VM is ok, but I think that docker can simplify all the stuff without necessary downloading, installations etc.

Looking forward to the next spark challenges :)

By To P H

Dec 24, 2018

Too many software issues/installation bugs hampering the learning process. The setup procedures for every quiz takes up around 80% of the time and only 20% actually answering the quiz. Please reduce the number of quiz or consolidate them for learners do that we only need to do setup once. Mentor/Instructor presence in various discussions in which students encounter setup/installation issues are next to full absence and many sudents are left figuring out the problems themselves

By Gustavo V

Oct 12, 2020

This course gives an introductory overview in Bigdata processing and explain a variety of tools with little depth, concepts are well explained but the workshops take extra effort to complete due to the fact that the tools versions are outdated, some questionnaires don’t match python workbooks and some assignments for the final project don’t have practical examples in the lessons, so you have to use other learning resources.

By Bojan N

May 11, 2020

Good content, good instructors - they have a nice way of conveying a message, making it easy to follow. I'm rating this course as 3 stars as the content is not kept up to date at all: materials, files, technical dependencies, versioning of the tools - it consumes MUCH, MUCH more time to get the tools setup in place correctly (so that you are able to run the hands-on exercises) compared to the actual time spent studying

By Tomas M

Jul 27, 2017

While the contents are very interesting and the lectures very thorough the practical side has many draw backs. For instance: Connections to PostgresSql did not work even reading the FAQs, same with streaming data in spark. There are not enough examples on syntax and coding to correctly do the assignments. Overall I am happy with the course but it needs some improvements.

By Mauricio H

Sep 8, 2019

So, in general, the course provides you with significant knowledge about big data integration processing, however there were simple exercises that could be done faster if there were no problems executing the commands. This problem leads students to quit the course.

I request the staff correct those errors in order to increase the approval rate.

By David T

Oct 23, 2016

Good experience of using the big data tools but a total lack of engagement in the forums by the instructors and community mentors make it hard going if anything goes wrong. The final quiz took me over 8 hours mostly because there was no one to ask for hints when I was totally stuck and confused!

By Pranav K

May 23, 2021

The hands on exercises were very helpful and the course content was great. However, there were many issues while downloading datasets and configuring other applications. This should be updated so that students don't have to go to the forums every single time.

By Joren Z

Jul 5, 2017

The course covers interesting materials and seems thorough. It's mostly lectures and reading, and not so much actually working with the technology. Since the latter tends to be the hardest part, the overall difficulty remains on the low end of the scale.

By Rashmi U

Nov 28, 2016

I feel the contents of this course were great, no second thought on it. It makes your concepts crystal clear. But faced lots of issues during practicing the hands on exercises and did not get proper feedback or response on any of the queries.

By Shruthi R

Jul 7, 2019

The hands on dataset installation had lots of problems while installing and spark and mongodb hardly worked even after multiple installations and i had tried many ways to get it to work but there was no benifit.

By Ken C

Oct 15, 2017

Lots of technical issues with assignments. Spent a lot of time troubleshooting issues that have been around for 9 months or more and never addressed. Seems like this course has been abandoned by creators.

By A R

Sep 19, 2019

Content was up to date but practice exercises are limited to Cloudera platform as well as too old. Need to be updated with more use cases and more exercises.

Thanks Coursera :)

By Francesca S

May 6, 2018

the explanation for the hands on exercises are poor. Had to waste a lot of tie and consult forum discussions as well as other inline tutorial a lot.

By Rahul R

Jun 8, 2017

The course material is not sufficient to work out the exercises. For the Spark final quiz you will have to take up another course to pass this one.