Methods
The goal of the AVA Digitizing Project’s methods is to replicate the official boundary, adhering as closely as possible to the approved maps and official boundary description. Deviations are only made when necessary. Outside data sources should only be used as a last resort, when no other option is available. All data sources used to create any boundary are documented. All input data sources are public domain or open-licensed so we can maintain the open license of our AVA dataset. Contributors agree to uphold these standards.
Teamwork
Given the number and geographic distribution of contributors to this dataset, we have developed methods to help us work together.
Working with the Issues List
Start with the Issues list to find a task to work on.
In the Issues section of the AVA project GitHub page, you will find an Issue for each AVA. All AVAs in need of digitizing have an issue open on the GitHub repository. The issues are closed for AVAs as they are finished. AVAs are given labels to indicate their status. Open issues labeled as “In Progress” are currently being digitized by a contributor. Choose an AVA that does not have an “In Progress” label. Labels for partner organizations such as “UC Davis” or “UC Santa Barbara” indicate that the project coordinators expect these boundaries to be completed by the organization indicated.
In the text of each issue, you will find details of the task to be completed.
Before starting work on any issue, comment on the issue that you are going to start working on. This will notify the project administrators so they can change the status to “In Progress”. Add additional comments if any issues or questions arise about this particular issue. The issue for Arroyo Seco is a good example of the kinds of challenges and questions to track on an issue for an AVA.
Digitizing Methods
Common Resources:
Boundary Descriptions & Government Documents:
- Hein Online Federal Register Library (UC Davis affiliates may need to log in to the campus VPN to get access)
- GovInfo - free online source for government documents
- Wine Institute’s list of documents establishing the original AVA boundaries
- Alcohol and Tobacco Tax and Trade Bureau’s (TTB) list of currently established AVAs
- Electronic Code of Federal Regulations (ECFR)’s Approved American Viticultural Areas - has the boundary descriptions, but not the full document
Maps:
Digitize New Boundaries
New AVAs and boundary updates for existing AVAs are added to the Federal Register. Our web scraping code checks this document regularly, and each new AVA or boundary revision is given its own issue. If you see one of these issues not yet assigned to a contributor or partner organization, and not labeled “In Progress”, you are encouraged to digitize it! Comment on the issue that you would like to work on and one of our team members will assign it to you. You may then digitize the new AVA following the methods described below.
Review the available Federal Register document for your AVA of interest
- Search the Hein Online Federal Register Library or GovInfo for the name of your chosen AVA. Many of the documents you will need are available there.
- Check the [TTB’s list of established AVAs]((https://www.ttb.gov/wine/us_by_ava.shtml) for a citation of the revision history of the AVA found at the bottom of the page in brackets.
- Make note of the approved Maps and the boundary description.
Download the Approved Maps
Note Google Chrome browser has occasionally had difficulties downloading files from topoView, resulting in an error message saying “Failed - Network error”. If this happens, try another browser. Microsoft Edge has worked well in the past.
- Go to the USGS National Geologic Map Database’s topoView: https://ngmdb.usgs.gov/maps/TopoView/viewer
- On the right side of the map in the side panel, select the scale of the Approved Map you are looking for.
- Zoom into the general region of the AVA, and the names of the maps will appear inside the index bounding boxes. Alternatively, you can search for the name of the approved map with the Map Name search at the top of the side panel.
- Select the map of interest by clicking inside the box.
In the lower section of the side panel, a dialog box should appear with details about the map you selected and the maps that are available. Identify the option with the date and edition that corresponds to the Approved Map listed in the Federal Register document. Note that TopoView lists the date the original map was created and the year it was printed, NOT the revision year. If you want to be sure you’re getting the exact map, you can preview the map by downloading the .jpg or .pdf option and then look near the bottom of the map for the revision information.
i. For example, one of the Approved Maps for the Coombsville AVA is listed as “Napa Quadrangle, California-Napa Co., 1951, Photorevised 1980”. For this map, select the Napa map with a date of 1951 and edition of 1980.
ii. If the specific year for the Approved Map is not available, first check the USGS store. If it is not available there, you may need to download a map (or multiple smaller maps) that is of a similar age in the same area from TopoView.
- Click the GeoTIFF download option for the map edition closest to the Approved Map listed in the federal documents.
- Save the file to your local computer. The file saved will be a zipped folder containing GeoTIFF files.
- Navigate to the folder where you saved the file. Right-click the file and select “Extract All”. Browse to the folder you would like to unzip the folder to, then click the “Extract” button. The new unzipped folder should contain 4 files.
- Repeat all the substeps for Step 2 for each of the Approved Maps needed for your AVA before you move on to the digitizing process in Step 3.
- Make note of the maps you use to digitize the boundary. You’ll need to enter the list of maps in the
used_maps
column of the attribute table.
Further assistance in interpreting which scale of map to use can be found in the USGS’ Map Scales Report.
Get the most recent project files from GitHub
We’ll describe how to do this with the GitHub for Desktop tool, but you may use the tool of your choice. We’ll also assume you’ve already set up your GitHub account and the GitHub for Desktop program:
- Fork the AVA repository. See GitHub’s forking documentation for details about how to fork a repository and work with it in GitHub Desktop.
- Open GitHub Desktop
- Select the forked AVA repository on the left side of the window.
- In your computer’s file navigation system, navigate to your GitHub folder and open the AVAs folder. The
tbd
(“to be determined”) folder contains AVAs that haven’t been completed; find the AVA you want to work on and move the .geojson file to the “avas” folder. Note that thetbd
folder is not visible in the GitHub repository when it is empty and there are currently no new boundaries to digitize.
Set Up Your Project File:
- Make sure you’ve moved the geojson file for your chosen AVA (for example, coombsville.geojson) from the
tbd
folder to theavas
folder. - Open QGIS. Note: we work with QGIS because it can write the standard geojson specification. ESRI tools do not. Please do not submit data in ESRI’s geojson format.
- Load data:
- avas.geojson - for reference and snapping to shared boundaries (vector data)
- Your AVA geojson file - the file you will edit; example: coombsville.geojson (vector data)
- Approved Maps files you downloaded from USGS’ TopoView or Store (raster data)
- Set Project Projection
- Open the Project Properties (Project menu → Project Properties)
- Set the Coordinate Reference System to NAD83 (EPSG:4269)
- Set Snapping
- In the Layers Panel (Table of Contents), select the all.geojson layer.
- Open the Snapping Options (Settings menu → Snapping Options)
- Set Layer Selection to “Current Layer” (the all.geojson layer you selected earlier in the Layers Panel), Snap To “To Vertex and Segment”, Tolerance to 20 Pixels (adjust this as needed), and finally check the boxes for “Enable topological editing” and “Enable snapping on intersection”.
Digitizing the Boundary:
Each AVA in the tbd
folder has a template generated automatically and placed at the centroid of the county that the AVA resides in as a placeholder. You’ll replace this triangle with the new boundary.
- Toggle Editing on for the new AVA GeoJSON file.
- You have two options for digitizing the boundary:
- Add a new polygon with the Add Feature tool. Digitize the boundary adding vertices according to the Federal Register Boundary description at a scale sufficient to identify the features needed from the topoquads. You can switch between the Add Feature tool and navigating tools as needed. Right-click to finish. Finally, delete the triangle template.
- Add nodes to the triangle moving them to the places described in the boundary description.
- Some things to keep in mind:
- As you near the edge of a map, you may need to turn off or reorder some of the maps to adjust for the white edges of the scanned maps that obscure the maps underneath.
- If the AVA you are digitizing is very close to another that has already been digitized, read the boundary descriptions for both and determine if they share a boundary. If they share a boundary, refer to the Border Matching Methods section below.
- Pay close attention to the wording describing the boundaries. Some require a straight line between points and use language like “the next point is a straight line to the peak at 1700 feet”. Others require you to digitize along a feature and use language like “follow along Cold Creek until it crosses J Street”.
- When you are done digitizing, fill out the attribute data based on the text of the official Federal Register documents. The definitions of the attributes are found in the Data Page.
- Save Layer Edits.
- Toggle Editing off.
Historical Boundary Methods
The TTB has a process by which AVA boundaries can be amended. Our dataset captures these boundaries because they can help understand the history of a single AVA or the concept of AVAs over time. As these boundary revisions are approved, this is how we digitize them and store the data:
We have one .geojson file for each named AVA. Within the file for an AVA, we create polygons for each official boundary revision as recorded by the TTB and described in the CFR documents. We use the current table to indicate when each polygon was in use. Attributes recorded for each polygon indicate:
created
: the date the AVA officially beganremoved
: the date the AVA was terminatedvalid_start
: the date the boundary officially beganvalid_end
: the last date the boundary was in use (typically the day before the next revision took effect)ava_id
: for non-current boundaries, append, and underscore and the date for the ‘valid_start’ in YYYYMMDD format. Example: redwood_valley_19970221petitioner
andcfr_author
: The boundary update will most likely have a different petitioner and author, which can be found in the CFR documentcfr_revision_history
: the most updated version can be found on the e-CFR website, underneath the boundary instruction. Example: [T.D. ATF-128, 48 FR 14375, Apr. 4, 1983, as amended by T.D. TTB-163, 85 FR 60361, Sept. 25, 2020]. Each revision should have the CFR history up to that revision (first version will and after the first date)
Digitizing Historical Boundaries:
Here is one option for adding historical boundaries to an existing AVA boundary.
- First, load the AVA into QGIS and use the boundary description field to figure out which revision it is. Load the approved maps for this revision and CHECK THE BOUNDARY NOW, as any edits made later on will have to be made to all copies.
- Find the AVA in the avas folder and make however many copies as there are revisions. Rename the files to something like [AVA]1.geojson, [AVA]2.geojson, … , [AVA]n.geojson
- Find the Federal Register documents for each revision. The easiest way is through HeinOnline (requires UCD Library VPN), input volume and page from the revision history strings given in the history issue
- For each revision:
- Make updates to the boundary following the methods described in “Digitizing the Boundary” above
- Edit the attribute tables of both layers (fields described above)
- When editing is complete, use the merge tool to combine layers:
- Open with Vector —> Data Management Tools —> Merge Vector Layers
- Select [AVA]1.geojson, [AVA]2.geojson, … , [AVA]n.geojson as input layers
- Under merged, click the ellipses and select “Save to File”, navigate to the avas folder and save as [AVA].geojson
- Once the merge is complete, double-check that the attribute table for the new layer contains tabs for the original and the update
- The merge will have added extra fields at the bottom, delete these using the Delete Field tool in the attribute table menu (or Ctrl+L)
- Delete [AVA]1.geojson, [AVA]2.geojson, … , [AVA]n.geojson from the avas folder
Border Matching Methods
In the interest of keeping the dataset clean and organized, AVAs that share a partial boundary are matched along this boundary. In these cases, the border that is kept is the one that is of higher detail, i.e. larger-scale maps. If the two AVAs have the same level of detail, priority goes to the more recent AVA. For AVAs with revisions that update maps but keep the same borders, all versions should be matched to the most recent map. Some exceptions can be made, but all border-matching decisions should be documented in the used maps attribute of the edited AVA.
Border matching can be done manually with QGIS’s snapping tool, but for longer sections of boundary, it is best to copy/paste coordinates between the GeoJSON files. To locate the start and end points of the section of the border, add vertices here that are far enough away from the rest of the AVA that the latitude or longitude is at least a full decimal degree away from any other point in the AVA (for example, if the latitude of AVA is between 34 and 35 degrees North, create a point at either 33 or 36). Then you can ctrl+F the GeoJSON file with this coordinate (include the decimal or you will get a lot of returns, so “33.” or “36.”). Do this for the start and end of the section, and now you can select every coordinate in between. Occasionally, the start of the polygon (first coordinate in the GeoJSON) will be inside the section being matched, in which case the section will be split between the beginning of the coordinate list and the end, with the rest of the polygon in between. Also be aware of the order of coordinates, or direction of winding. If two avas share a boundary but don’t overlap, then the matched borders are going in opposite directions, and you will have to change the winding order of one of them using st_reverse in R before copy-pasting coordinates. Make sure to undo the reverse after, all AVAs should be wound counterclockwise (see rcode/check_winding_order.R).
Quality Control Methods
For quality control, each boundary for a second time against the official description and the approved maps. Having a second (or even third) check each boundary ensures that the data matches as closely to the official description as we can make it.
The process should proceed as follows:
- Pick a boundary from the “Quality Control” issues to check.
- Comment on the issue for that boundary that you are reviewing that boundary.
- Load the boundary’s .geojson file and the list of Used Maps from the USGS National Geologic Map Database’s topoView: https://ngmdb.usgs.gov/maps/TopoView/viewer into QGIS.
- Check that
- the Used Maps were the best match for the official Approved Maps. If not, load up other maps. New options may now be available that weren’t at the time the boundary was initially digitized.
- the attribute table for the boundary file is complete and follows our standard. Note that State and County use | (pipe) as the separator.
- the boundary follows the description as best it can. Make changes as necessary by selecting the boundary and clicking “Zoom to layer(s)”. Toggle Editing and select the Vertex tool to edit specific vertices as needed. Remember to click Save as you make edits.
- Update the Used Maps column to add any additional resources you used. Whenever possible, please only use the Approved Maps.
- Post any questions or discussion points on the issue for the boundary you are reviewing.
- Submit a pull request for changes you made - please submit each boundary in its own pull request.
- If the boundary and attribute table don’t need edits, comment on the Issue for the boundary that you’ve checked it and it doesn’t need edits.
Notes
If the name of the AVA changes, this should be reflected in the name field. The ava_id
field should have the most recent AVA ID followed by the appended date if applicable.
If the approved maps change but the boundary stays the same for any length (follows the same features on different maps) match the older boundaries to the new and make note of the in the used maps field: “Border matched to YYYY-MM-DD revision border where applicable”
To see an example of boundaries with revisions, see the closed issues for any issue with a history
label. These AVA’s histories have been completed and are available in the avas
folder.
Submitting your changes to the AVA GitHub Repository
- In GitHub for Desktop, you should see a list of changes you’ve made to the files. Fill in the Summary and Description fields at the bottom of the window and then click the Commit button. https://guides.github.com/activities/forking/#making-changes
- You should now see the “Push” button at the top of your GitHub for Desktop Screen. Click the “Push” button to send your changes to YOUR online repository.
- If you are ready to send finished data to the UC Davis repository, submit a pull request for your fork: https://help.github.com/articles/creating-a-pull-request-from-a-fork/
- The project administrators will review your changes and if your changes are accepted, project administrators will incorporate your changes and close the issue you were working on. If there are any problems or questions, the project administrators will contact you.