PDF formatDay 8: Final Assignment Submit Assignment Instructions Use dataset from day 7...

Question

PDF formatDay 8: Final Assignment Submit Assignment Instructions Use dataset from day 7 -world_cities_pop.csv a) Remove all countries with Population as NaN. Call your DataFrame as df. Use df.shape to find the number of countries remaining in your DataFrame. You should work with this smaller DataFrame for the following questions. b) Select a country (it can be the name of your country or any other of your choice). For this country, find the number of cities and the population. c) Create a dot plot showing the cities in this country. No need to adjust the bounding box. d) Create a new DataFrame for this country with the cities with population  more than 1e5 only. Adjust the bounding box to be as tight as possible to the country boundaries. Create a dot plot. e) Create a Delaunay triangulation plot for this new DataFrame. Use same bounding box as in part (d). All parts weight 20 points.  Submit the jupyter notebook with the Python script and including your name and all plots as a single pdf labeled as: FirstName_LastName - Final Assignment.pdf Day 8: Content Overview This session is a wrap-up of the course. We will review the various roles and skills useful for Analytics. Readings · The reference  for this session is here Presentation Slides Big Data Visualization and Analytics -day 8   UCI DIVISION OF CONTINUING EDUCATION  STUDENT SERVICES  VACATION QUARTER REQUEST FORM      FORM INSTRUCTIONS: You are required to complete and submit this form to ImmigrationOfficials@ce.uci.edu before the end  of the current quarter to complete your vacation quarter request. Failure to submit this form prior to the end of the current  quarter will make you no longer eligible to apply for a vacation quarter. After our office has received your completed form, you  will receive an official email notification confirming your vacation quarter approval.    STUDENT INFORMATION Last (Family) Name:  First (Given) Name:     ID Number:  Current Program:      Phone Number:  Email:      SEVIS ID:  I-20 End Date:        Returning Program (the program you will attend after vacation quarter):     I AM REQUESTING VACATION QUARTER FOR (list the TERM YEAR – ie: Summer 2020):     VACATION QUARTER INFORMATION:   The non-refundable $200 tuition deposit will apply towards tuition payment for my returning program after my vacation quarter.   If I am academically dismissed from my current program, before my vacation quarter begins, my vacation quarter approval will  be voided and my I-20 may be terminated.   If I decide to transfer to another school during my vacation quarter, I forfeit my vacation quarter.  o I am required to notify ImmigrationOfficials@ce.uci.edu within 60-days of my current program end date to complete  the transfer-out process, otherwise I am not eligible to request to transfer-out.   If I decide not to continue my studies during or after my vacation quarter, I forfeit my vacation quarter.  o I am required to notify ImmigrationOfficials@ce.uci.edu to cancel my vacation quarter and depart from the U.S. within  60-days of my current program end date.   I am responsible for paying my returning quarter fees by the payment deadlines. If I am SACM or Embassy sponsored, I must  submit an updated financial guarantee letter if my current one has expired.   After my vacation quarter has ended, I must return on time to my studies at UCI Division of Continuing Education to maintain  active F-1 status.    STUDENT SIGNATURE  I am officially requesting vacation quarter and I have read and understand all of the information listed above. After approval for a vacation  quarter, I understand it is my responsibility to abide by all requirements set forth by UCI Division of Continuing Education.                                      Student’s Signature         Date    For Office Use Only    IMMIGRATION     Vacation Quarter Checklist    ISA Initials/ Date: ________________________  mailto:ImmigrationOfficials@ce.uci.edu mailto:ImmigrationOfficials@ce.uci.edu mailto:ImmigrationOfficials@ce.uci.edu amanda zhao Zhao amanda zhao 970198 amanda zhao Data science amanda zhao 9496789318 amanda zhao Amandazhao66@gmail.com amanda zhao 1653-0038 amanda zhao 09-04-2020 amanda zhao Innovation management & Entrepreneurship amanda zhao summer 2020 amanda zhao Jianghong zhao amanda zhao 05-27-2020 amanda zhao Jianghong

Ishvina · Accepted Answer

{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [],
   "source": [
    "import pandas as pd"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {},
   "outputs": [],
   "source": [
    "import numpy as np"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {},
   "outputs": [],
   "source": [
    "#installing the libraries"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Requirement already satisfied: geoplotlib in c:\users\dell\anaconda3\lib\site-packages (0.3.2)
"
     ]
    }
   ],
   "source": [
    "!pip install geoplotlib"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Requirement already satisfied: pyglet in c:\users\dell\anaconda3\lib\site-packages (1.5.5)
"
     ]
    }
   ],
   "source": [
    "!pip install pyglet"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "metadata": {},
   "outputs": [],
   "source": [
    "import geoplotlib"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 7,
   "metadata": {},
   "outputs": [],
   "source": [
    "#to display the maps in the jupyter notebook
",
    "from IPython.display import Image"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 8,
   "metadata": {},
   "outputs": [],
   "source": [
    "#reading the data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 9,
   "metadata": {},
   "outputs": [
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "C:\Users\DELL\Anaconda3\lib\site-packages\IPython\core\interactiveshell.py:3058: DtypeWarning: Columns (3) have mixed types. Specify dtype option on import or set low_memory=False.
",
      "  interactivity=interactivity, compiler=compiler, result=result)
"
     ]
    }
   ],
   "source": [
    "#data is saved at the same location as the current file location
",
    "
",
    "df = pd.read_csv("world_cities_pop.csv")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 10,
   "metadata": {},
   "outputs": [],
   "source": [
    "#because we have mixed datatypes  , so we force 
",
    "#python to consider it as a character because of the mixed data types
",
    "#reading data again with modifications
",
    "df = pd.read_csv("world_cities_pop.csv" , dtype = {'Region' : np.str})"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 11,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "array(['06', '07', '04', '05', '02', '03', '08', '01', '29', '10', '24',
",
       "       '09', '35', '42', '11', '27', '39', '28', '26', '17', '41', '33',
",
       "       '30', '13', '40', '18', '23', '19', '37', '14', '32', '36', '31',
",
       "       '34', '38', nan, '00', '51', '46', '49', '43', '47', '44', '45',
",
       "       '50', '48', '15', '12', '20', '16', '21', '22', '62', '68', '65',
",
       "       '64', '66', '58', '60', '61', '71', '57'], dtype=object)"
      ]
     },
     "execution_count": 11,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "pd.unique(df['Region'])[:62]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 12,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "(3173958, 7)"
      ]
     },
     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#size of the orignal dataset
",
    "df.shape"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 13,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Country        object
",
       "City           object
",
       "AccentCity     object
",
       "Region         object
",
       "Population    float64
",
       "Latitude      float64
",
       "Longitude     float64
",
       "dtype: object"
      ]
     },
     "execution_count": 13,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#checking the data types
",
    "df.dtypes"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 14,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "
",
       "
",
       "    .dataframe tbody tr th:only-of-type {
",
       "        vertical-align: middle;
",
       "    }
",
       "
",
       "    .dataframe tbody tr th {
",
       "        vertical-align: top;
",
       "    }
",
       "
",
       "    .dataframe thead th {
",
       "        text-align: right;
",
       "    }
",
       "
",
       "
",
       "  
",
       "    
",
       "      
",
       "      Country
",
       "      City
",
       "      AccentCity
",
       "      Region
",
       "      Population
",
       "      Latitude
",
       "      Longitude
",
       "    
",
       "  
",
       "  
",
       "    
",
       "      0
",
       "      ad
",
       "      aixas
",
       "      Aixàs
",
       "      06
",
       "      NaN
",
       "      42.483333
",
       "      1.466667
",
       "    
",
       "    
",
       "      1
",
       "      ad
",
       "      aixirivali
",
       "      Aixirivali
",
       "      06
",
       "      NaN
",
       "      42.466667
",
       "      1.500000
",
       "    
",
       "    
",
       "      2
",
       "      ad
",
       "      aixirivall
",
       "      Aixirivall
",
       "      06
",
       "      NaN
",
       "      42.466667
",
       "      1.500000
",
       "    
",
       "    
",
       "      3
",
       "      ad
",
       "      aixirvall
",
       "      Aixirvall
",
       "      06
",
       "      NaN
",
       "      42.466667
",
       "      1.500000
",
       "    
",
       "    
",
       "      4
",
       "      ad
",
       "      aixovall
",
       "      Aixovall
",
       "      06
",
       "      NaN
",
       "      42.466667
",
       "      1.483333
",
       "    
",
       "    
",
       "      5
",
       "      ad
",
       "      andorra
",
       "      Andorra
",
       "      07
",
       "      NaN
",
       "      42.500000
",
       "      1.516667
",
       "    
",
       "  
",
       "
",
       ""
      ],
      "text/plain": [
       "  Country        City  AccentCity Region  Population   Latitude  Longitude
",
       "0      ad       aixas       Aixàs     06         NaN  42.483333   1.466667
",
       "1      ad  aixirivali  Aixirivali     06         NaN  42.466667   1.500000
",
       "2      ad  aixirivall  Aixirivall     06         NaN  42.466667   1.500000
",
       "3      ad   aixirvall   Aixirvall     06         NaN  42.466667   1.500000
",
       "4      ad    aixovall    Aixovall     06         NaN  42.466667   1.483333
",
       "5      ad     andorra     Andorra     07         NaN  42.500000   1.516667"
      ]
     },
     "execution_count": 14,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#viewing the first 6 rows of the dataframe
",
    "df[0:6]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 15,
   "metadata": {},
   "outputs": [],
   "source": [
    "#removing one of the two columns - City / AccentCity because
",
    "#they are the same
",
    "df = df.drop(['AccentCity'], axis = 1)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 16,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "
",
       "
",
       "    .dataframe tbody tr th:only-of-type {
",
       "        vertical-align: middle;
",
       "    }
",
       "
",
       "    .dataframe tbody tr th {
",
       "        vertical-align: top;
",
       "    }
",
       "
",
       "    .dataframe thead th {
",
       "        text-align: right;
",
       "    }
",
       "
",
       "
",
       "  
",
       "    
",
       "      
",
       "      Country
",
       "      City
",
       "      Region
",
       "      Population
",
       "      Latitude
",
       "      Longitude
",
       "    
",
       "  
",
       "  
",
       "    
",
       "      0
",
       "      ad
",
       "      aixas
",
       "      06
",
       "      NaN
",
       "      42.483333
",
       "      1.466667
",
       "    
",
       "    
",
       "      1
",
       "      ad
",
       "      aixirivali
",
       "      06
",
       "      NaN
",
       "      42.466667
",
       "      1.500000
",
       "    
",
       "    
",
       "      2
",
       "      ad
",
       "      aixirivall
",
       "      06
",
       "      NaN
",
       "      42.466667
",
       "      1.500000
",
       "    
",
       "    
",
       "      3
",
       "      ad
",
       "      aixirvall
",
       "      06
",
       "      NaN
",
       "      42.466667
",
       "      1.500000
",
       "    
",
       "  
",
       "
",
       ""
      ],
      "text/plain": [
       "  Country        City Region  Population   Latitude  Longitude
",
       "0      ad       aixas     06         NaN  42.483333   1.466667
",
       "1      ad  aixirivali     06         NaN  42.466667   1.500000
",
       "2      ad  aixirivall     06         NaN  42.466667   1.500000
",
       "3      ad   aixirvall     06         NaN  42.466667   1.500000"
      ]
     },
     "execution_count": 16,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#viewing the dataset again
",
    "df[0:4]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# a)Remove all countries with Population as NaN. Call your DataFrame as df."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 18,
   "metadata": {},
   "outputs": [],
   "source": [
    "#removing countries with population as NaN
",
    "#dataframe is named as df
",
    "df = df.dropna(subset = ['Population'])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 19,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "(47980, 6)"
      ]
     },
     "execution_count": 19,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#Use df.shape to find the number of countries remaining in your DataFrame
",
    "df.shape"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 20,
   "metadata": {},
   "outputs": [],
   "source": [
    "#we see that number of countries has been reduced to 47980 from 3173958"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 21,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "
",
       "
",
       "    .dataframe tbody tr th:only-of-type {
",
       "        vertical-align: middle;
",
       "    }
",
       "
",
       "    .dataframe tbody tr th {
",
       "        vertical-align: top;
",
       "    }
",
       "
",
       "    .dataframe thead th {
",
       "        text-align: right;
",
       "    }
",
       "
",
       "
",
       "  
",
       "    
",
       "      
",
       "      Country
",
       "      City
",
       "      Region
",
       "      Population
",
       "      Latitude
",
       "      Longitude
",
       "    
",
       "  
",
       "  
",
       "    
",
       "      6
",
       "      ad
",
       "      andorra la vella
",
       "      07
",
       "      20430.0
",
       "      42.500000
",
       "      1.516667
",
       "    
",
       "    
",
       "      20
",
       "      ad
",
       "      canillo
",
       "      02
",
       "      3292.0
",
       "      42.566667
",
       "      1.600000
",
       "    
",
       "    
",
       "      32
",
       "      ad
",
       "      encamp
",
       "      03
",
       "      11224.0
",
       "      42.533333
",
       "      1.583333
",
       "    
",
       "    
",
       "      49
",
       "      ad
",
       "      la massana
",
       "      04
",
       "      7211.0
",
       "      42.550000
",
       "      1.516667
",
       "    
",
       "  
",
       "
",
       ""
      ],
      "text/plain": [
       "   Country              City Region  Population   Latitude  Longitude
",
       "6       ad  andorra la vella     07     20430.0  42.500000   1.516667
",
       "20      ad           canillo     02      3292.0  42.566667   1.600000
",
       "32      ad            encamp     03     11224.0  42.533333   1.583333
",
       "49      ad        la massana     04      7211.0  42.550000   1.516667"
      ]
     },
     "execution_count": 21,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#Viewing the first 4 records after removing Population with NaN
",
    "df[0:4]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# b) Select a country (it can be the name of your country or any other of your choice).For this country, find the number of cities and the population.
"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 22,
   "metadata": {},
   "outputs": [],
   "source": [
    "#country to uppercase
",
    "df['Country'] = df['Country'].str.upper()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 23,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "
",
       "
",
       "    .dataframe tbody tr th:only-of-type {
",
       "        vertical-align: middle;
",
       "    }
",
       "
",
       "    .dataframe tbody tr th {
",
       "        vertical-align: top;
",
       "    }
",
       "
",
       "    .dataframe thead th {
",
       "        text-align: right;
",
       "    }
",
       "
",
       "
",
       "  
",
       "    
",
       "      
",
       "      Country
",
       "      City
",
       "      Region
",
       "      Population
",
       "      Latitude
",
       "      Longitude
",
       "    
",
       "  
",
       "  
",
       "    
",
       "      2323330
",
       "      RO
",
       "      edera de jos
",
       "      16
",
       "      3925.0
",
       "      45.033333
",
       "      25.633333
",
       "    
",
       "    
",
       "      2106084
",
       "      PH
",
       "      bancasi
",
       "      02
",
       "      3607.0
",
       "      8.966667
",
       "      125.466667
",
       "    
",
       "    
",
       "      2115472
",
       "      PH
",
       "      clarin
",
       "      42
",
       "      7125.0
",
       "      8.202300
",
       "      123.858200
",
       "    
",
       "    
",
       "      157926
",
       "      AU
",
       "      nerang
",
       "      04
",
       "      17684.0
",
       "      -27.989410
",
       "      153.336334
",
       "    
",
       "    
",
       "      304229
",
       "      BR
",
       "      lajeado
",
       "      23
",
       "      65408.0
",
       "      -29.450000
",
       "      -51.966667
",
       "    
",
       "    
",
       "      2924582
",
       "      US
",
       "      steamboat springs
",
       "      CO
",
       "      9349.0
",
       "      40.485000
",
       "      -106.831111
",
       "    
",
       "  
",
       "
",
       ""
      ],
      "text/plain": [
       "        Country               City Region  Population   Latitude   Longitude
",
       "2323330      RO       edera de jos     16      3925.0  45.033333   25.633333
",
       "2106084      PH            bancasi     02      3607.0   8.966667  125.466667
",
       "2115472      PH             clarin     42      7125.0   8.202300  123.858200
",
       "157926       AU             nerang     04     17684.0 -27.989410  153.336334
",
       "304229       BR            lajeado     23     65408.0 -29.450000  -51.966667
",
       "2924582      US  steamboat springs     CO      9349.0  40.485000 -106.831111"
      ]
     },
     "execution_count": 23,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df.sample(6)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 24,
   "metadata": {},
   "outputs": [],
   "source": [
    "#creating a dataframe df2 to store data for country India
",
    "df2 = df[df.Country == 'IN']"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 25,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "(2995, 6)"
      ]
     },
     "execution_count": 25,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df2.shape"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 26,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "259227307.0"
      ]
     },
     "execution_count": 26,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#population of the country selected (India)
",
    "df2['Population'].sum()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "we see the total population of India is 259227307"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 27,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "2995
"
     ]
    }
   ],
   "source": [
    "#number of cities in country selected (India)
",
    "print(df2['City'].count())"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 28,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "City
",
       "abhayapuri     15803.0
",
       "abiramam        6837.0
",
       "abohar        130613.0
",
       "abu road       50266.0
",
       "achalpur      111287.0
",
       "                ...   
",
       "zahirabad      46509.0
",
       "zaidpur        33400.0
",
       "zamania        32011.0
",
       "ziro           13895.0
",
       "zunheboto      29500.0
",
       "Name: Population, Length: 2899, dtype: float64"
      ]
     },
     "execution_count": 28,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#For this country, the population per city is as follows
",
    "df2.groupby('City')['Population'].agg('sum')"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# c)Create a dot plot showing the cities in this country. No need to adjust the bounding box. "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 29,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Requirement already satisfied: pycountry in c:\users\dell\anaconda3\lib\site-packages (19.8.18)
"
     ]
    }
   ],
   "source": [
    "!pip install pycountry"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 30,
   "metadata": {},
   "outputs": [],
   "source": [
    "import pycountry"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 31,
   "metadata": {},
   "outputs": [],
   "source": [
    "india = pycountry.countries.get(alpha_2 = 'IN') "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 32,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Country(alpha_2='IN', alpha_3='IND', name='India', numeric='356', official_name='Republic of India')"
      ]
     },
     "execution_count": 32,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "india"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 33,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'India'"
      ]
     },
     "execution_count": 33,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "india.name"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 34,
   "metadata": {},
   "outputs": [],
   "source": [
    "#changing it to labels lat and long because geoplotlib requires to 
",
    "#identify coordinates by labels
",
    "
",
    "df2 = df2.rename(columns = {'Latitude' : 'lat' , 
",
    "                          'Longitude' : 'lon'})"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 35,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "
",
       "
",
       "    .dataframe tbody tr th:only-of-type {
",
       "        vertical-align: middle;
",
       "    }
",
       "
",
       "    .dataframe tbody tr th {
",
       "        vertical-align: top;
",
       "    }
",
       "
",
       "    .dataframe thead th {
",
       "        text-align: right;
",
       "    }
",
       "
",
       "
",
       "  
",
       "    
",
       "      
",
       "      Country
",
       "      City
",
       "      Region
",
       "      Population
",
       "      lat
",
       "      lon
",
       "    
",
       "  
",
       "  
",
       "    
",
       "      1321980
",
       "      IN
",
       "      adoni
",
       "      02
",
       "      163649.0
",
       "      15.633333
",
       "      77.283333
",
       "    
",
       "    
",
       "      1357885
",
       "      IN
",
       "      sundarnagar
",
       "      11
",
       "      25340.0
",
       "      31.533333
",
       "      76.883333
",
       "    
",
       "    
",
       "      1357819
",
       "      IN
",
       "      sultanpur
",
       "      35
",
       "      9420.0
",
       "      23.150000
",
       "      77.933333
",
       "    
",
       "  
",
       "
",
       ""
      ],
      "text/plain": [
       "        Country         City Region  Population        lat        lon
",
       "1321980      IN        adoni     02    163649.0  15.633333  77.283333
",
       "1357885      IN  sundarnagar     11     25340.0  31.533333  76.883333
",
       "1357819      IN    sultanpur     35      9420.0  23.150000  77.933333"
      ]
     },
     "execution_count": 35,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df2.sample(3)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 36,
   "metadata": {},
   "outputs": [],
   "source": [
    "geoplotlib.dot(df2 , color = 'b')
",
    "geoplotlib.show()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 52,
   "metadata": {},
   "outputs": [],
   "source": [
    "n = 400"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 53,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "image/png":

Day 8: Final Assignment Submit Assignment Instructions Use dataset from day 7 -world_cities_pop.csv a) Remove all countries with Population as NaN. Call your DataFrame as df. Use df.shape to find the...

Answer To: Day 8: Final Assignment Submit Assignment Instructions Use dataset from day 7 -world_cities_pop.csv...

Answer To This Question Is Available To Download

Related Questions & Answers

Submit New Assignment

	Country	City	AccentCity	Region	Population	Latitude	Longitude
0	ad	aixas	Aixàs	06	NaN	42.483333	1.466667
1	ad	aixirivali	Aixirivali	06	NaN	42.466667	1.500000
2	ad	aixirivall	Aixirivall	06	NaN	42.466667	1.500000
3	ad	aixirvall	Aixirvall	06	NaN	42.466667	1.500000
4	ad	aixovall	Aixovall	06	NaN	42.466667	1.483333
5	ad	andorra	Andorra	07	NaN	42.500000	1.516667

	Country	City	Region	Population	Latitude	Longitude
6	ad	andorra la vella	07	20430.0	42.500000	1.516667
20	ad	canillo	02	3292.0	42.566667	1.600000
32	ad	encamp	03	11224.0	42.533333	1.583333
49	ad	la massana	04	7211.0	42.550000	1.516667

	Country	City	Region	Population	Latitude	Longitude
2323330	RO	edera de jos	16	3925.0	45.033333	25.633333
2106084	PH	bancasi	02	3607.0	8.966667	125.466667
2115472	PH	clarin	42	7125.0	8.202300	123.858200
157926	AU	nerang	04	17684.0	-27.989410	153.336334
304229	BR	lajeado	23	65408.0	-29.450000	-51.966667
2924582	US	steamboat springs	CO	9349.0	40.485000	-106.831111

	Country	City	Region	Population	lat	lon
1321980	IN	adoni	02	163649.0	15.633333	77.283333
1357885	IN	sundarnagar	11	25340.0	31.533333	76.883333
1357819	IN	sultanpur	35	9420.0	23.150000	77.933333