Due Date: March 22th
Instructor: Xiaozhong Sun (xs243@cornell.edu)
Lab TAs: Wenzheng Li (wl563) / Ishan Keskar (iuk3) / Aditi Parihar [ap973@cornell.edu]
Location: Sibley 305, Barclay Gibbs Jones Computer Lab
Total Points: 100
This week’s lab contains three goals:
The first goal is utilizing GIS to undertake census level data analysis. Reinforce what you have learn from previous lab. In this exercise, we will utilize GIS to explore and document issues of environmental justice in Massachusetts. You will required to answer the following questions:
The second goal is about geocoding using coordinates vs. addresses. You will download two datasets, one on crime complaints, one on business operations, from NYC Open Data and try to geocode them.
The third goal is learning integrate ArcGIS Pro and Google Earth Pro, you will learn how to create your own spatial data from KML file.
Now, compare and contrast the African American population living within 1 mile of landfills for the Boston metropolitan area. In defining the metropolitan Boston spatial extent, utilize the boundary of Regional Planning Agency.
Before you refer to the demo below. I think this is a great opportunity for you to think independently about how to do a complex GIS spatial analysis, using the techniques we learned earlier. Try to write down critical steps for you to finish this task. Solving complex spatial analysis will also be an essential ability for you to complete your final project independently.
Tasks:
To undertake this analysis, we will need several datasets:
Moreover, in order to determine whether the impact is disproportionate to a specific population, we will need some sort of benchmark with which to compare our findings and conduct our analysis. In this case, we will compare our findings to the state level data.
To undertake the analysis, we will calculate the percentage of the population living within 1 mile of a landfill that is African American. We will then compare this to the statewide African American population percentage.
As our population data is aggregated through enumerated units identified by the Census Bureau, and not according to distance from landfills, we will have to undertake some geoprocessing before completing our analysis.
Often state governments or public agencies will compile a range of data sets that are relevant to the state and make this data publicly available through a GIS repository. This is often helpful, particularly if we require data from different sources that must be integrated, as the data has already been formatted accordingly. For this lab, we will utilize data made available by the state of Massachusetts.
Now go to the Canvas, download and unzip the Lab 7 data. Find the folder “solidwaste”.
There are a number of datasets included in the zip files, but the one we are interested in is the Solid Waste Land Disposal point data layer (SW_LD_PT), compiled by the Massachusetts Department of Environmental Protection (DEP) to track the locations of land disposal of solid waste. Please be sure you read some of the information associated with each dataset, so you know how you are to utilize them in ArcGIS.
Open ArcGIS Pro. Add the Census Tracts. Note that they are already projected (State_Plane_Massachusetts_Mainland), however, please note that the map units being used are meters. It is only convention that associates State Plane with feet. If you open the attribute table, note that the area is recorded in square feet. Therefore, all units related to area should be square feet when you do the calculation and generate new field.
Add the appropriate SF1 database containing population and race data, i.e., CEN2010_CT_SF1_POP_RACE. Again, be sure you are utilizing tract level attribute information, not blocks or block groups).
Please join the SF1 database to the 2010 Census Tracts. Look at the attribute tables and/or metadata to determine the appropriate join fields.
Add the Solid Waste Land Disposal point layer (SW_LD_PT).
Open the Buffer tool (Analysis/Buffer). Set the input to the Landfill shapefile. Name the output and be sure to send it to your output folder. Set the buffer distance to 1 mile. Select ‘Dissolve all output features into a single feature’ as the Dissolve type. Click Run.
Clip the census tracts to the dissolved buffer layer. Set the output location and name. Click Run. You will recall that the clip function does not affect the original attribute information, therefore, you need to calculate the geometric using steps from previous lab. (if we were operating within a Geodatabase it would, but you shouldn’t save it to Geodatabase.).
Zoom in briefly, and using the identify tool, toggle on and off the original Census tract layer and your newly created clipped tract layer. Notice that the attributes for the clipped census tracts are the same as those original census tract (as a point of comparison, check out the AREA_SQFT field).
We will need to calculate the new areas of the clipped polygon census tracts.
Open the Attribute table of the clipped census tract layer.
Right-click on the field NewArea —> Calculate Geometry.
Based on the relative proportion of each census tract that lies within the 1-mile buffer we have delineated; we will now estimate the population that is African American (use the ‘pop_black’ variable for the African American totals).
Right-click on the field ‘Popblk_cl’ > Calculate Field (Note: different from Calculate Geometry). Make sure you have started editing.
Note: This spatial analysis is not perfect. We basically assume that the population within the census tract is evenly distributed, which is not entirely true. However, suppose the buffer of the landfills covers a large portion of the census tract. In that case, it is reasonable to infer that the majority of the population within the tract is affected.
Referring to the Demo, answer Questions 1-5 (55 points).
Recall the tasks’ requirements:
Note: for all the numbers you will answer below, DO NOT keep for more than 2 decimal places.
Question 1 (10 points):
Calculate the population percentage of African-American for the Boston Metro area.
Question 2 (5 points):
What formula did you use to calculate the population that is African American within the 1-mile buffer? Write down both the mathematical formula and Query formula you constructed in ArcGIS Pro.
Question 3 (10 points):
Now you should have the total African American population (count) living within 1-mile buffer of landfills. What percent of population living within 1 mile of landfills that is African American?
Question 4 (10 points):
How does this percentage compare to the statewide percentage of African American population? (10 points).
Question 5 (10 points):
What are your conclusions concerning the spatial distribution of African Americans in Massachusetts in relation to landfills?
Question 6 (10 points):
What are some of the weaknesses of this approach, particularly in terms of accuracy? (5 points)
How could you undertake a more accurate spatial analysis? (5 points)
In this part, you will geocode and map all the crime complaint data in NYC happened during June 2022 by their
Map
–>Add Data
–>
XY Point Data
function.
Generate a map layout, visualize the data by “LAW_CAT_CD” column (categorical) with readable legend settings (10 points).
Now commenting one whether using point data is helpful for describing the crime complaint in NYC. Can you find certain spatial patterns? What would you do to improve the interpretability using this data set? Write down the additional spatial data and ArcGIS operations you might use. (10 points).
Data
–>
Add Data
to ArcGIS Pro as a table.If you cannot use the geocoding credit provided by the ArcGIS Pro, use the new csv (come with full records of coordinates that you manully acquired), geocode the dataset with longitude and latitude data. Make a map layout with readable legends and layouts (10 points)
What if we want to generate a layer to show grocery stores in Ithaca, but we don’t have either x,y coordinate or location information for the grocery stores? Yes, we can use Google Earth to generate locations for grocery stores! This gives us another option for geocoding spatial data.
In the following part, we will explore how to export a location or data from Google Earth. This can be very helpful when you want to generate location data that is not available out there.
The target of this exercise is to draw a map shows the location of grocery stores in the City of Ithaca. However, currently we don’t have any location information about those stores.
We will need to use Google Earth Pro to generate the locations of grocery stores and then import the information into ArcGIS Pro.
We will use data format KML (Keyhole Markup Language) which is the data format that Google Earth uses. Then convert the KML file to shapefile. From there we can either make a map or conducting spatial analysis.
Open ArcGis Pro. Within the Conversion toolbox
, go to
KML toolset and KML To Layer
. This tool creates a file
geodatabase containing a feature class within a feature dataset.
The Dialogue box will ask you to navigate to the kml file and select an output location. Please note that it wants you to place the output within a folder, so be sure to select an actual folder as your output!
You can also specify an output data name. Once you have converted your kml to layer process, it should automatically be added to your map.
Check the attribute table of this point layer. You can see that it contains address information, as well as store name information which is exported from Google Earth.
The Output will be generated in the WGS84 coordinate system (default GCS of Google Earth). We are working on locations in NY, so change the projection of your map to State Plane New York Central.
Note: you will need to change the projections of both your data frame and the grocerystores layer. If you don’t change the projection of your data frame from WGS 1984 to state plane, even though the layer is changed to state plane, it will still be displayed in the projection of WGS 1984. (Again, remember projection on the fly from lab 3 🤓?)
To change the projection of your grocerystores layer, use
Project
. (Note: still remember the difference between
define projection and project?)
Want to get hands on more awesome open geo-data? Check out this tutorial on Youtube.
Create a layout of the grocery stores in the city of Ithaca. Add an appropriate basemap to show the context information. Label the grocery store names on your map. (10 points)
The END