Network dataset

Upload the downloaded rental_agencies.csv.zip. Name the dataset rental_agencies. Here, the data is almost completely clean, and the main task is to geolocate the agencies; that is, for each row, retrieve the longitude and latitude corresponding to the address and zipcode.

To do this, create a Prepare recipe with the default output name rental_agencies_prepared and the following steps:

  1. Use the Simplify text processor on the adress column. This normalizes the text.

  2. Use the same processor with city as the input column.

  3. Using the Concatenate columns processor, concatenate the columns adress, zipcode, and city into the output column address_full, using a single space as a delimiter.

Run the recipe, updating the schema.

../../../_images/compute_rental_agencies_prepared.png

Now we want to use the Geocoder plugin to compute the latitude and longitude of each station’s address.

  1. From the + Recipe dropdown in the Flow, select Geocoder > Forward geocoding.

  2. Set rental_agencies_prepared as the input dataset and create rental_agencies_geocode as the output.

  3. Select address_full as the address column and select your choice of provider to retrieve the latitude and longitude of each rental agency. Specify geo as the prefix for the generated columns.

../../../_images/compute_rental_agencies_geocode.png