Skip to content

Latest commit

 

History

History
202 lines (176 loc) · 17.4 KB

File metadata and controls

202 lines (176 loc) · 17.4 KB

1.About the Course:

2.Targeted Audience:

  • Anyone interested in Machine Learning.
  • Any data analysts who want to level up in Machine Learning.
  • Students who have at least high school knowledge in math and who want to start learning Machine Learning.
  • Any intermediate level people who know the basics of machine learning, including the classical algorithms like linear regression or logistic regression, but who want to learn more about it and explore all the different fields of Machine Learning.
  • Any people who are not that comfortable with coding but who are interested in Machine Learning and want to apply it easily on datasets.
  • Any people who are not satisfied with their job and who want to do something new in Machine Learning.

3.What you will Learn:

  • Master Machine Learning on Python.
  • Have a great intuition of many Machine Learning models.
  • Make accurate predictions by powerful analysis methods.
  • Make robust Machine Learning models.
  • Handle specific topics like Reinforcement Learning, NLP and Deep Learning and Handle advanced techniques like Dimensionality Reduction

4.What is Machine Learning:

  • Machine learning is a method of data analysis that automates analytical model building.
  • It is a branch of artificial intelligence based on the idea that systems can learn from data, find patterns and make decisions with minimal human intervention.
  • While artificial intelligence (AI) is the broad science of mimicking human abilities, machine learning is a specific subset of AI that trains a machine how to learn.
  • The processes involved in machine learning are similar to data mining and predictive modeling. Both require searching through data to look for patterns and adjusting program actions accordingly.
  • Many people are familiar with machine learning from shopping on the internet and being served ads related to their purchase. This happens because recommendation engines use machine learning to personalize online ad delivery in almost real-time.
  • Machine learning algorithms are often categorized as Supervised or Unsupervised or Semi-Supervised Learning.

4.1 Supervised Learning:

  • Supervised algorithms require a data scientist or data analyst with machine learning skills to provide both input and desired output, in addition to furnishing feedback about the accuracy of predictions during algorithm training.
  • Data scientists determine which variables, or features, the model should analyze and use to develop predictions.
  • Once training is complete, the algorithm will apply what was learned to new data.
  • Supervised learning is so named because the data scientist acts as a guide to teach the algorithm what conclusions it should come up with. and it is the more commonly used between the three.
  • It includes such algorithms as linear and logistic regression, multi-class classification, and support vector machines and etc.
  • Supervised learning requires that the algorithm’s possible outputs are already known and that the data used to train the algorithm is already labeled with correct answers.
  • For Example: Classification algorithm will learn to identify animals after being trained on a dataset of images that are properly labeled with the species of the animal and some identifying characteristics

4.2 Unsupervised Learning:

  • Unsupervised algorithms do not need to be trained with desired outcome data. Instead, they use an iterative approach called deep learning to review data and arrive at conclusions.
  • Unsupervised learning algorithms also uses an area called neural networks, they used for more complex processing tasks than supervised learning systems, including image recognition, speech-to-text and natural language generation.
  • These neural networks work by combing through millions of examples of training data and automatically identifying often subtle correlations between many variables. Once trained, the algorithm can use its bank of associations to interpret new data.
  • These algorithms have only become feasible in the age of big data, as they require massive amounts of training data.
  • Some examples of unsupervised machine learning algorithms include k-means clustering, principal and independent component analysis, and 8association rules*.

4.3 Which algorithm to choose for my problem:

  • Choosing to use either a supervised or unsupervised machine learning algorithm typically depends on factors related to the structure and volume of your data and the use case of the issue at hand.
  • A well-rounded data science program will use both types of algorithms(Known as Ensemble Learning) to build predictive data models that help stakeholders make decisions across a variety of business challenges.

4.4 Semi-Supervised Learning:

  • Semi-supervised learning is a class of machine learning tasks and techniques that also make use of unlabeled data for training – typically a small amount of labeled data with a large amount of unlabeled data.
  • Semi-supervised learning falls between unsupervised learning (without any labeled training data) and supervised learning (with completely labeled training data).
  • Many machine-learning researchers have found that unlabeled data, when used in conjunction with a small amount of labeled data, can produce considerable improvement in learning accuracy over unsupervised learning but without the time and costs needed for supervised learning.
  • As you may have guessed, semi-supervised learning algorithms are trained on a combination of labeled and unlabeled data. This is useful for the following reasons.
    1. the process of labeling massive amounts of data for supervised learning is often prohibitively time-consuming and expensive.
    2. What’s more, too much labeling can impose human biases on the model.
    3. That means including lots of unlabeled data during the training process actually tends to improve the accuracy of the final model while reducing the time and cost spent building it.
  • For the above Reasons, semi-supervised learning is a win-win for use cases like webpage classification, speech recognition, or even for genetic sequencing.

4.5 What is required to create a good machine learning systems:

  • Data preparation capabilities.
  • Algorithms – basic and advanced.
  • Automation and iterative processes.
  • Scalability.
  • Ensemble modeling.

4.6 Who's using it ?

  • Most industries working with large amounts of data have recognized the value of machine learning technology. Below listed are some of area's Machine Learning models are being used widely.
  • Financial services:
    • Banks and other businesses in the financial industry use machine learning technology for two key purposes:
    • to identify important insights in data, and prevent fraud.
    • The insights can identify investment opportunities, or help investors know when to trade.
    • Data mining can also identify clients with high-risk profiles, or use Cyber-Surveillance to pinpoint warning signs of fraud.
  • Health care:
    • Machine learning is a fast-growing trend in the health care industry, thanks to the advent of wearable devices and sensors that can use data to assess a patient's health in real-time.
    • The technology can also help medical experts analyze data to identify trends or red flags that may lead to improved diagnoses and treatment.
  • Retail:
    • Websites recommending items you might like based on previous purchases are using machine learning to analyze your buying history.
    • Retailers rely on machine learning to capture data, analyze it and use it to personalize a shopping experience, implement a marketing campaign, price optimization, merchandise supply planning, and for customer insights.
  • Oil and gas:
    • Finding new energy sources.
    • Analyzing minerals in the ground.
    • Predicting refinery sensor failure.
    • Streamlining oil distribution to make it more efficient and cost-effective.
    • The number of machine learning use cases for this industry is vast – and still expanding.

4.7 Did you know ?

  • In machine learning, a target Variable is called a label. where in statistics, it is called known as dependent variable.
  • A Independent variable in statistics is called a Input feature in machine learning.
  • A transformation in statistics is called feature creation in machine learning.

5.Installing Anaconda and Python in your Machine:

  • It is recommended to have at-least machine with 8GB-RAM. i prefer to go with Anaconda since it gives you all the libraries more handy
  • Open the following URL in our browser, Anaconda Installing User Guide.
  • Find 2 Different types of Installation modes available in the Doc Page as shown below.

  • If you are installing Anaconda in your Personal PC/Laptop you can proceed to download Anaconda for Python 3.7 and above, run the .Exe will ask you for Anaconda installation path don't change anything leave things as it and completed the installation.
  • Usually installation will take 20-30 min. Please find the below image for your reference.

  • If you want to use anaconda in PC/Laptop which does not have Admin access, recommend to use Minicoda a silent version of Anaconda will install everything by default in the machine. (Note: you are the complete responsibility for this action.)
  • please find the below images to Download and install Miniconda in your machine.

  • Miniconda Installation command start /wait "" Miniconda4-latest-Windows-x86_64.exe /InstallationType=JustMe /RegisterPython=0 /S /D=%UserProfile%\Miniconda3. Here replace the Miniconda version with your version

6.How to start with Spyder IDE:

  • If you want to get in hand with, Machine Learning and Data Science practices one of the best tool that i can suggest is Spyder IDE, which comes along with Anaconda installation.
  • As a pre-requisite before proceeding this topic, i suggest yout to look in to the following link to install anaconda in your machine. link to install Anaconda.
  • Once you completed installing anaconda in your machine, navigate to start and type Anaconda you will find the following Icon in start menu

  • By clicking on above Anaconda Navigator icon will open he following Dashboard screen.
  • please find Spyder IDE and click launch button to launch spyder IDE or simply you can search for sypder in Windows search and launch Spyder IDE.

  • Once you open the IDE, it is always recommended to move to your project directory. to navigate to the project folder use the File Explorer as follows, and create your py file by clicking New file creator icon and set it as your current working directory, if you want to switch Drive use URL navigator in top right corner of the IDE.

  • Image description below:

    1. Open New Py file
    2. Set Current working Directory
    3. File Explorer
    4. URL Navigator
    5. Ipython Console Output
    6. Writing area
    7. Save changes
  • It is recommended to follow the below steps when you get in handy with Spyder IDE.

    1. Open Spyder IDE
    2. Use file explorer or URL navigator and navigate to your project directory.
    3. open new File
    4. right click and "Set Current working Directory"
    5. type your program in writing area and check your output in IPython console
    6. save your program