您的位置:首页 > 大数据

大数据业务学习笔记_学习业务成为一名出色的数据科学家

2020-08-09 04:06 302 查看

大数据业务学习笔记

意见 (Opinion)

A lot of aspiring Data Scientists think what they need to become a Data Scientist is :

许多有抱负的数据科学家认为,成为一名数据科学家需要具备以下条件:

  • Coding

    编码
  • Statistic

    统计
  • Math

    数学
  • Machine Learning

    机器学习
  • Deep Learning

    深度学习

And any other technical skills.

以及其他任何技术技能。

The above list is accurate; most of the Data Scientist qualification you need right now is what I list above. It is unavoidable, as many job listing right now always list these skills as a prerequisite. Just look at the example of Data Scientist job requirements and preferences below.

上面的清单是准确的; 我上面列出的是您现在需要的大多数数据科学家资格。 这是不可避免的,因为现在很多工作清单总是将这些技能列为前提条件。 只需看下面的数据科学家工作要求和偏好示例。

Taken from indeed.com 摘自确实网站

Most of the requirements sound technical; degree, coding, math, and stats. Although, there is an underlying business understanding requirement that you might not realize at first from this job advertisement.

大部分要求听起来都是技术性的; 学位,编码,数学和统计信息。 但是,有一个潜在的业务理解要求,您可能首先不会从此招聘广告中意识到。

If you look closely, they require someone that had experience in applying the analytical method to solve practical business problems. It implies your everyday task would consisting of solving the business problem, which in turn, you need to understand what kind of business the company runs and how the process itself works.

如果您仔细观察,他们会要求那些具有应用分析方法来解决实际业务问题的经验的人。 这意味着您的日常任务将包括解决业务问题 ,而这又需要您了解公司经营哪种业务以及流程本身如何运作。

You might ask, “Why do I need to understand it? Just create the machine learning model and the problem is solved, isn’t it?” Well, that line of thinking is dangerous, and I would explain why.

您可能会问:“为什么我需要了解它? 只需创建机器学习模型即可解决问题,不是吗?” 好吧,这种思路很危险,我将解释原因。

Just for a reminder, I would argue what makes you great as a Data Scientist is not only how well your coding skill is or how much you understand the statistical theory or even the master of business understanding, but it is a combination of many.

提醒您, 让我成为数据科学家的不仅仅在于您的编码技能如何,或者您对统计理论甚至对业务理解的掌握有多少而且还包括很多方面。

Anybody, of course, could agree or not with my opinion as I believe there are no specific skills that make you a great Data Scientist.

当然,任何人都可以同意或不同意我的观点,因为我相信没有特定的技能可以使您成为一名出色的数据科学家。

Data Scientist employment is hard. It would not easy to get in this field. With many applicants and people with a similar set of skills, you need to stand out. Business Understanding is the skill that would certainly separate you from all the fish in the ponds.

数据科学家的工作很难。 进入这个领域并不容易。 由于许多申请人和具有类似技能的人,您需要脱颖而出。 业务理解能力无疑会使您与池塘中的所有鱼区分开。

In my experience as a Data Scientist, there is no skill that I felt underrated as much as the business understanding skill. I even thought that you don’t need to understand the business in my early career. How wrong I was.

根据我作为数据科学家的经验,没有什么比业务理解技能低估了。 我什至以为您在我的早期职业中不需要了解业务。 我错了

I am not ashamed, though, to admit that I did not consider the business aspect essential at first because many data science education and books did not even teach us about this.

但是,我并不感到ham愧,因为我一开始并不认为业务方面是必不可少的,因为许多数据科学教育和书籍甚至都没有教过我们这一点。

So, why is it crucial to learn the business and how it impacts your employment as a Data Scientist?

那么,为什么学习业务至关重要,它又如何影响您作为数据科学家的工作呢?

Just imagine this situation. You work in the data department of the food industry with candy as their main product, and the company plans to release a new sour candy product. The company then ask the sales department to sell the product. Now, the sales department know that the company had a data department and requesting the data team to give new leads where they can sell sour candy.

试想一下这种情况。 您在食品工业的数据部门工作时,以糖果为主要产品,并且该公司计划发布一种新的酸味糖果产品。 然后,公司要求销售部门出售产品。 现在,销售部门知道该公司有一个数据部门,并要求数据团队提供新的线索以销售酸味糖果。

Before anybody complains that “This is not our job, we create a machine learning model!” or “I work as a data scientist, not in the sales department.” No, this is precisely what Data scientists do in the company; many of the projects are to work with another department for solving the company problem.

在有人抱怨“这不是我们的工作之前,我们创建了机器学习模型!” 或“我是数据科学家,而不是在销售部门。” 不,这正是数据科学家在公司中所做的; 许多项目将与另一个部门合作解决公司问题。

Back to our scenario, how do you correctly approach this problem then? You might think, “Just create a machine learning model to generate the leads.” Yes, it is on the right track, but how exactly you create the model? On what basis? Is the business question even viable enough to solved using the machine learning model?

回到我们的情况,那么您如何正确解决此问题? 您可能会想,“只要创建一个机器学习模型来生成线索即可。” 是的,它是在正确的轨道上,但是您如何精确地创建模型? 在什么基础上? 业务问题是否足够可行,可以使用机器学习模型解决?

You can’t just suddenly using a machine learning model, right? This is why business understanding is so crucial as a Data Scientist. You need to understand how the candy business in more detail. Keep asking a question like,

您不能只是突然使用机器学习模型,对吗? 这就是为什么业务理解对数据科学家如此重要的原因。 您需要更详细地了解糖果业务。 继续问一个问题,

  • What kind of business question exactly we want to solve?”

    我们到底想解决什么样的业务问题?”

  • “Would we even need a machine learning model?”

    “我们甚至需要机器学习模型吗?”

  • “What kind of attributes related to candy sales?”

    “与糖果销售相关的属性是什么?”

  • “How is the candy selling strategy and practice within and outside of the company?”.

    “公司内部和外部的糖果销售策略和实践如何?”

And many more business questions you could think of related to the business.

还有更多您可能想到的与业务相关的业务问题。

It is important to know what kind of business your company run and everything related to the business as your work as a data scientist would need you to make sense of the data.

了解您的公司经营哪种业务以及与该业务相关的所有事项非常重要,因为作为数据科学家,您需要了解数据

While it is easy to say that business understanding skill is essential, it is not easy to gain one.

虽然容易理解业务理解技能是必不可少的,但要获得一项技能却并不容易。

Education is one thing; for example, you might have a higher chance to stand out to applying for a data science position in the PR company if your educational background is communication compared to someone with a biology degree.

教育是一回事; 例如,与具有生物学学位的人相比,如果您的教育背景是交流,那么您可能有更大的机会脱颖而出在PR公司申请数据科学职位。

Although work experience quickly covers this. Working experience with another job title in a similar business industry would provide significant leverage, as you already understand the business process.

尽管工作经验很快就涵盖了这一点。 由于您已经了解业务流程,因此在类似的业务行业中拥有另一个职务的工作经验将提供重要的影响。

For a fresher, it might be a hard industry to break in, but in hindsight, there are many benefits as a fresher as well. I remember Tyler Folkman’s post on his LinkedIn why the industry should consider recent graduates, and I agree. The recent graduate could:

对于新生,这可能是一个很难进入的行业,但是事后看来,新生也有很多好处。 我记得泰勒·福克曼(Tyler Folkman)在其LinkedIn帖子,为什么该行业应考虑应届毕业生,我也同意。 应届毕业生可以:

  1. Come with preparation

    附带准备
  2. Hungry to learn about the business

    渴望了解业务
  3. Make an impact

    产生影响

Freshers should a target for companies that have established their data journeys. The company could teach many things about business more easily as fresher have no experience at all in the business world. In my opinion, never count out the freshers.

新生应该成为建立数据旅程的公司的目标。 该公司可以更轻松地教授有关业务的许多事情,因为刚开始的新手根本没有业务领域的经验。 我认为,永远不要指望新生。

I also would tell you about my experience, as well. When I first get the data project, I was not thinking about the business at all and just tried to build the machine learning model. And how disastrous it turns out to be.

我也将告诉您我的经历。 当我第一次获得数据项目时,我根本没有考虑业务,只是尝试构建机器学习模型。 事实证明这是多么的灾难。

I present the model to the related parties with hype in my brain. My model result is good, I know everything about the data, and I know the theory of the model I used. Easy peasy, right? So, wrong. It turns out that the user did not care about the model I used. They are more interested in knowing if I already consider a business approach “A” or why I used the data that should not relate at all to the business. It ends with a discussion that I need more business training.

我在脑海中大肆宣传该模型。 我的模型结果很好,我了解所有有关数据的知识,并且知道我使用的模型的理论。 轻轻松松吧? 大错特错。 事实证明,用户并不关心我使用的模型。 他们更想知道我是否已经考虑过业务方法“ A”,或者为什么我使用了与业务根本不相关的数据。 最后,我需要更多的业务培训。

It is embarrassing, but I am not ashamed at all to admit that it is my fault not to consider business understanding. I could be the best in model creation or statistic, but not knowing the business turns out to be a disaster. Since that day, I try to learn more about the business process itself, even before considering any of the technical things.

令人尴尬,但我完全不as愧承认不考虑业务了解是我的错。 在模型创建或统计方面,我可能是最好的,但我不知道这业务真是一场灾难。 从那天开始,即使在考虑任何技术问题之前,我也会尝试进一步了解业务流程本身。

结论 (Conclusion)

In my opinion, fresher or not, try to learn the business as much as possible.

我认为,无论是否新鲜,都应尽可能多地学习业务。

Focus on one industry you feel interested in; finance, banking, credit, automotive, candy, oil, etc. Every single business has a different approach and strategy; you just need to focus on learning the industry you like.

专注于您感兴趣的一个行业; 金融,银行,信贷,汽车,糖果,石油等。每一项业务都有不同的方法和策略; 您只需要专注于学习自己喜欢的行业即可。

Data scientist employment is hard. It was not easy to get into this field. With many applicants and many people with a similar set of skills, you need to stand out. Business understanding is the skill that will undoubtedly separate you from all the fish in the pond.

数据科学家的工作很难。 进入这个领域并不容易。 在许多申请人和具有相似技能的许多人中, 您需要脱颖而出。 业务理解能力无疑会使您与池塘中的所有鱼类区分开。

翻译自: https://towardsdatascience.com/learn-the-business-to-become-a-great-data-scientist-635fa6029fb6

大数据业务学习笔记

内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: