Annual Water Yield¶

Summary¶

Hydropower accounts for twenty percent of worldwide energy production, most of which is generated by reservoir systems. InVEST estimates the annual average quantity and value of hydropower produced by reservoirs, and identifies how much water yield or value each part of the landscape contributes annually to hydropower production. The model has three components: water yield, water consumption, and hydropower valuation. The biophysical models do not consider surface – ground water interactions or the temporal dimension of water supply. The valuation model assumes that energy pricing is static over time.

Introduction¶

The provision of fresh water is an ecosystem service that contributes to the welfare of society in many ways, including through the production of hydropower, the most widely used form of renewable energy in the world. Most hydropower production comes from watershed-fed reservoir systems that generally deliver energy consistently and predictably. The systems are designed to account for annual variability in water volume, given the likely levels for a given watershed, but are vulnerable to extreme variation caused by land use and land cover (LULC) changes. LULC changes can alter hydrologic cycles, affecting patterns of evapotranspiration, infiltration and water retention, and changing the timing and volume of water that is available for hydropower production (World Commission on Dams 2000; Ennaanay 2006).

Changes in the landscape that affect annual average water yield upstream of hydropower facilities can increase or decrease hydropower production capacity. Maps of where water yield used for hydropower is produced can help avoid unintended impacts on hydropower production or help direct land use decisions that wish to maintain power production, while balancing other uses such as conservation or agriculture. Such maps can also be used to inform investments in restoration or management that downstream stakeholders, such as hydropower companies, make in hopes of improving or maintaining water yield for this important ecosystem service. In large watersheds with multiple reservoirs for hydropower production, areas upstream of power plants that sell to a higher value market will have a higher value for this service. Maps of how much value each parcel contributes to hydropower production can help managers avoid developments in the highest hydropower value areas, understand how much value will be lost or gained as a consequence of different management options, or identify which hydropower producers have the largest stake in maintaining water yield across a landscape.

The Model¶

The InVEST Water Yield model estimates the relative contributions of water from different parts of a landscape, offering insight into how changes in land use patterns affect annual surface water yield and hydropower production.

Modeling the connections between landscape changes and hydrologic processes is not simple. Sophisticated models of these connections and associated processes (such as the WEAP model) are resource and data intensive and require substantial expertise. To accommodate more contexts, for which data are readily available, InVEST maps and models the annual average water yield from a landscape used for hydropower production, rather than directly addressing the effect of LULC changes on hydropower, as this process is closely linked to variation in water inflow on a daily to monthly timescale. Instead, InVEST calculates the relative contribution of each land parcel to annual average hydropower production and the value of this contribution in terms of energy production. The net present value of hydropower production over the life of the reservoir also can be calculated by summing discounted annual revenues.

How it Works¶

The model runs on a gridded map. It estimates the quantity and value of water used for hydropower production from each subwatershed in the area of interest. It has three components, which run sequentially. First, it determines the amount of water running off each pixel as the precipitation minus the fraction of the water that undergoes evapotranspiration. The model does not differentiate between surface, subsurface and baseflow, but assumes that all water yield from a pixel reaches the point of interest via one of these pathways. This model then sums and averages water yield to the subwatershed level. The pixel-scale calculations allow us to represent the heterogeneity of key driving factors in water yield such as soil type, precipitation, vegetation type, etc. However, the theory we are using as the foundation of this set of models was developed at the subwatershed to watershed scale. We are only confident in the interpretation of these models at the subwatershed scale, so all outputs are summed and/or averaged to the subwatershed scale. We do continue to provide pixel-scale representations of some outputs for calibration and model-checking purposes only. These pixel-scale maps are not to be interpreted for understanding of hydrological processes or to inform decision making of any kind.

Second, beyond annual average runoff, it calculates the proportion of surface water that is available for hydropower production by subtracting the surface water that is consumed for other uses. Third, it estimates the energy produced by the water reaching the hydropower reservoir and the value of this energy over the reservoir’s lifetime.

Figure 1. Conceptual diagram of the simplified water balance method used in the annual water yield model. Aspects of the water balance that are in color are included in the model, those that are in grey are not.

Water Yield Model¶

The water yield model is based on the Budyko curve and annual average precipitation. We determine annual water yield \(Y(x)\) for each pixel on the landscape \(x\) as follows:

\[Y(x) = \left(1-\frac{AET(x)}{P(x)}\right)\cdot P(x)\]

where \(AET(x)\) is the annual actual evapotranspiration for pixel \(x\) and \(P(x)\) is the annual precipitation on pixel \(x\).

For vegetated land use/land cover (LULC) types, the evapotranspiration portion of the water balance, \(\frac{AET(x)}{P(x)}\) , is based on an expression of the Budyko curve proposed by Fu (1981) and Zhang et al. (2004):

(1)¶\[\frac{AET(x)}{P(x)} = 1+\frac{PET(x)}{P(x)} - \left[1+\left(\frac{PET(x)}{P(x)}\right)^\omega\right]^{1/\omega}\]

where \(PET(x)\) is the potential evapotranspiration and \(\omega(x)\) is a non-physical parameter that characterizes the natural climatic-soil properties, both detailed below.

Potential evapotranspiration \(PET(x)\) is defined as:

\[PET(x) = K_c(\ell_x)\cdot ET_0(x)\]

where, \(ET_0(x)\) is the reference evapotranspiration from pixel \(x\) and \(K_c(\ell_x)\) is the plant (vegetation) evapotranspiration coefficient associated with the LULC \(\ell_x\) on pixel \(x\). \(ET_0(x)\) reflects local climatic conditions, based on the evapotranspiration of a reference vegetation such as grass or alfalfa grown at that location. \(K_c(\ell_x)\) is largely determined by the vegetative characteristics of the land use/land cover found on that pixel (Allen et al. 1998). \(K_c\) adjusts the \(ET_0\) values to the crop or vegetation type in each pixel of the land use/land cover map.

\(\omega(x)\) is an empirical parameter that can be expressed as linear function of \(\frac{AWC*N}{P}\), where N is the number of rain events per year, and AWC is the volumetric plant available water content (see Appendix 1 for additional details). While further research is being conducted to determine the function that best describe global data, we use the expression proposed by Donohue et al. (2012) in the InVEST model, and thus define:

(2)¶\[\omega(x) = Z\frac{AWC(x)}{P(x)} + 1.25\]

where:

\(AWC(x)\) is the volumetric (mm) plant available water content. The soil texture and effective rooting depth define \(AWC(x)\), which establishes the amount of water that can be held and released in the soil for use by a plant. It is estimated as the product of the plant available water capacity (PAWC) and the minimum of root restricting layer depth and vegetation rooting depth:

\[AWC(x)= Min(Rest.layer.depth, root.depth)\cdot PAWC\]

Root restricting layer depth is the soil depth at which root penetration is inhibited because of physical or chemical characteristics. Vegetation rooting depth is often given as the depth at which 95% of a vegetation type’s root biomass occurs. PAWC is the plant available water capacity, i.e. the difference between field capacity and wilting point.
\(Z\) is an empirical constant, sometimes referred to as “seasonality factor”, which captures the local precipitation pattern and additional hydrogeological characteristics. It is positively correlated with N, the number of rain events per year. The 1.25 term is the minimum value of \(\omega(x)\), which can be seen as a value for bare soil (when root depth is 0), as explained by Donohue et al. (2012). Following the literature (Yang et al., 2008; Donohue et al. 2012), values of \(\omega(x)\) are capped to a value of 5.

For other LULC types (open water, urban, wetland), actual evapotranspiration is directly computed from reference evapotranspiration \(ET_0(x)\) and has an upper limit defined by precipitation:

(3)¶\[AET(x) = Min(K_c(\ell_x)\cdot ET_0(x),P(x))\]

where \(ET_0(x)\) is reference evapotranspiration, and \(K_c(\ell_x)\) is the evaporation factor for each LULC.

The water yield model generates and outputs the total and average water yield at the subwatershed level.

Realized Supply¶

The Realized Supply option of the model (called Water Scarcity in the tool interface) calculates the water inflow to a reservoir based on calculated water yield and water consumptive use in the watershed(s) of interest. The user inputs how much water is consumed by each land use/land cover type in a table format. Examples of consumptive use include municipal or industrial withdrawals that are not returned to the stream upstream of the outlet. This option may also be used to represent inter-basin transfers out of the study watershed.

For example, in an urban area, consumptive use can be calculated as the product of population density and per capita consumptive use. These land use-based values only relate to the consumptive portion of demand; some water use is non-consumptive such as water used for industrial processes or waste water that is returned to the stream after use, upstream of the outlet. Consumptive use estimates should therefore take into account any return flows to the stream above the watershed outlet:

\[C = \frac{W-R}{n}\]

where, \(C\) = the consumptive use (\(m^3/yr/pixel\)), \(W\) = withdrawals (\(m^3/yr\)), \(R\) = return flows (\(m^3/yr\)), and \(n\) = number of pixels in a given land cover.

For simplicity, each pixel in the watershed is either a “contributing” pixel, which contributes to hydropower production, or a “use” pixel, which uses water for other consumptive uses. This assumption implies that land use associated with consumptive uses will not contribute any yield for downstream use. The amount of water that actually reaches the reservoir for dam \(d\) (called realized supply) is defined as the difference between total water yield from the watershed and total consumptive use in the watershed:

\[V_{in} = Y-u_d\]

where \(V_{in}\) is the realized supply (volume inflow to a reservoir), \(u_d\) is the total volume of water consumed in the watershed upstream of dam \(d\) and \(Y\) is the total water yield from the watershed upstream of dam \(d\).

Note that only anthropogenic uses are considered here, since evapotranspiration (including consumptive use of water by croplands) are accounted for by the \(K_c\) parameter in the water yield model. Users should be aware that the model assumes that all water available for evapotranspiration comes from within the watershed (as rainfall). This assumption holds true in cases where agriculture is either rain-fed, or the source of irrigation water is within the study watershed (not sourced from inter-basin transfer or a disconnected deeper aquifer). See the Limitations section for more information on applying the model in watersheds with irrigated agriculture.

If observed data is available for actual annual inflow rates to the reservoir for dam \(d\), they can be compared to \(V_{in}\).

Hydropower Production and Valuation¶

The Valuation option of the model estimates both the amount of energy produced given the estimated realized supply of water for hydropower production and the value of that energy. A present value monetary estimate is given for the entire remaining lifetime of the reservoir. Net present value can be calculated if hydropower production cost data are available. The energy produced and the revenue is then redistributed over the landscape based on the proportional contribution of each subwatershed to energy production. Final output maps show how much energy production and hydropower value can be attributed to each subwatershed’s water yield over the lifetime of the reservoir.

An important note about assigning a monetary value to any service is that valuation should only be done on model outputs that have been calibrated and validated. Otherwise, it is unknown how well the model is representing the area of interest, which may lead to misrepresentation of the exact value. If the model has not been calibrated, only relative results should be used (such as an increase of 10%) not absolute values (such as 1,523 cubic meters, or 42,900 dollars.)

At dam \(d\), power is calculated using the following equation:

\[p_d = \rho\cdot q_d \cdot g \cdot h_d\]

where \(p_d\) is power in watts, \(\rho\) is the water density (1000 Kg/m³), \(q_d\) is the flow rate (m³/s), \(g\) is the gravity constant (9.81 m/s²), and \(h_d\) is the water height behind the dam at the turbine (m). In this model, we assume that the total annual inflow water volume is released equally and continuously over the course of each year.

The power production equation is connected to the water yield model by converting the annual inflow volume adjusted for consumption (\(V_{in}\)) to a per second rate. Since electric energy is normally measured in kilowatt-hours, the power \(p_d\) is multiplied by the number of hours in a year. All hydropower reservoirs are built to produce a maximum amount of electricity. This is called the energy production rating, and represents how much energy could be produced if the turbines are 100% efficient and all water that enters the reservoir is used for power production. In the real world, turbines have inefficiencies and water in the reservoir may be extracted for other uses like irrigation, retained in the reservoir for other uses like recreation, or released from the reservoir for non-power production uses like maintaining environmental flows downstream. To account for these inefficiencies and the flow rate and power unit adjustments, annual average energy production \(\varepsilon_d\) at dam \(d\) is calculated as follows:

\[\varepsilon_d= 0.00272\cdot \beta \cdot \gamma_d \cdot h_d \cdot V_{in}\]

where \(\varepsilon_d\) is hydropower energy production (KWH), \(\beta\) is the turbine efficiency coefficient (%), \(\gamma_d\) is the percent of inflow water volume to the reservoir at dam \(d\) that will be used to generate energy.

To convert \(\varepsilon_d\), the annual energy generated by dam \(d\), into a net present value (NPV) of energy produced (point of use value) we use the following,

(4)¶\[NPVH_d=(p_e\varepsilon_d-TC_d)\times \sum^{T-1}_{t=0}\frac{1}{(1+r)^t}\]

where \(TC_d\) is the total annual operating costs for dam \(d\), \(p_e\) is the market value of electricity (per kilowatt hour) provided by the hydropower plant at dam \(d\), \(T_d\) indicates the number of years present landscape conditions are expected to persist or the expected remaining lifetime of the station at dam \(d\) (set \(T\) to the smallest value if the two time values differ), and \(r\) is the market discount rate. The form of the equation above assumes that \(TC_d\), \(p_e\), and \(\varepsilon_d\), are constant over time. Any currency may be used, as long as it is consistent across the different inputs.

The model does not perform the following calculations, but energy production over the lifetime of dam \(d\) can be attributed to each subwatershed as follows:

\[\varepsilon_x = (T_d\varepsilon_d)\times(c_x / c_{tot})\]

where the first term in parentheses represents the electricity production over the lifetime of dam \(d\). The second term represents the proportion of water volume used for hydropower production that comes from subwatershed \(x\) relative to the total water volume for the whole watershed. The value of each subwatershed for hydropower production over the lifetime of dam \(d\) can be calculated similarly:

\[NPVH_x=NPVH_d\times (c_x/c_{tot})\]

Limitations and Simplifications¶

The model has a number of limitations. First, it is not intended for devising detailed water plans, but rather for evaluating how and where changes in a watershed may affect hydropower production for reservoir systems. It is based on annual averages, which neglect extremes and do not consider the temporal dimensions of water supply and hydropower production.

Second, the model does not consider the spatial distribution of land use/land cover. The empirical model used for the water balance (based on the Budyko theory) has been tested at larger scales than the pixel dimensions used in InVEST (Hamel & Guswa, in review). Complex land use patterns or underlying geology, which may induce complex water balances, may not be well captured by the model.

Third, the model does not consider sub-annual patterns of water delivery timing. Water yield is a provisioning function, but hydropower benefits are also affected by flow regulation. The timing of peak flows and delivery of minimum operational flows throughout the year determines the rate of hydropower production and annual revenue. Changes in landscape scenarios are likely to affect the timing of flows as much as the annual water yield, and are of particular concern when considering drivers such as climate change. Modeling the temporal patterns of overland flow requires detailed data that are not appropriate for our approach. Still, this model provides a useful initial assessment of how landscape scenarios may affect the annual delivery of water to hydropower production.

Fourth, the model greatly simplifies consumptive demand. For each LULC, a single variable (\(\gamma_d\)) is used to represent multiple aspects of water resource allocation, which may misrepresent the complex distribution of water among uses and over time. In reality, water demand may differ greatly between parcels of the same LULC class. Much of the water demand may also come from large point source intakes, which are not represented by an LULC class at all. The model simplifies water demand by distributing it over the landscape. For example, the water demand may be large for an urban area, and the model represents this demand by distributing it over the urban LULC class. The actual water supply intake, however, is likely further upstream in a rural location. Spatial disparity in actual and modeled demand points may cause an incorrect representation in the realized supply output grid. The distribution of consumption is also simplified in the reallocation of energy production and hydropower value since it is assumed that water consumed along flow paths is drawn equally from every pixel upstream. As a result, water scarcity, energy production patterns, and hydropower values may be incorrectly estimated.

Fifth, water transfers for irrigation, either between subbasins or between seasons, are not well captured by the model. When applying the empirical approach to cropland, irrigation patterns should be considered, which typically fall into one of the following cases:

If there is no irrigation other than direct rain, it can be assumed that croplands respond to climate forcing in a similar way to natural vegetation (i.e. the theory behind the eco-hydrological model used in the InVEST model, linking plant available water and climate forcing, applies, cf. Donohue et al. 2012)
If small reservoirs store water during the wet season to irrigate crops during the dry season, the AET should equal PET during the irrigation season. However, the model predicts AET<PET due to limited water retention in undisturbed catchments (where there is no other reservoir except soil storage). This likely results in the underestimation of evapotranspiration, and therefore the overestimation of yields. To avoid this issue, you can use the alternative equation for AET (equation 2), which sets AET directly as a function of ETo. (In that case, remember that AET is capped by P to avoid predicting negative water yields, which may result in an overestimation of yields).
If the study area contains croplands that are irrigated with water from outside the catchment (either through inter-basin transfer or pumping from a disconnected groundwater source), then AET also equals PET during the irrigation season. Because the model assumes that evapotranspiration is sourced from rainfall, the water yield output is likely overestimated. This situation can also be represented by using the alternative equation for AET (equation 2). Assuming that crops are being irrigated efficiently (i.e. the total volume of imported water is equal to the water deficit, or PET – P, for crop pixels), then the known volume of water irrigated may be added to the modeled water yield to give a better picture of actual yield.
Because seasonality can play a significant role in irrigation water use, use caution when applying the annual model in catchments with large irrigated fields. For options that are not covered above or where complex water transfers may substantially affect the water balance, users are encouraged to use alternative models that will better represent the spatial and temporal water transfers. In particular, great caution should be used when calibrating the model without good data on the different water balance components within your study area (i.e. rainfall, streamflow, irrigation rates and timing).

Finally, the model assumes that hydropower production and pricing remain constant over time. It does not account for seasonal variation in energy production or fluctuations in energy pricing, which may affect the value of hydropower. Even if sub-annual production or energy prices change, however, the relative value between parcels of land in the same drainage area should be accurate.

Data Needs¶

Note

All spatial inputs must have exactly the same projected coordinate system (with linear units of meters), not a geographic coordinate system (with units of degrees).

Note

Raster inputs may have different cell sizes, and they will be resampled to match the cell size of the land use/land cover raster. Therefore, all model results will have the same cell size as the land use/land cover raster.

workspace (directory, required): The folder where all the model’s output files will be written. If this folder does not exist, it will be created. If data already exists in the folder, it will be overwritten.
file suffix (text, optional): Suffix that will be appended to all output file names. Useful to differentiate between model runs.
precipitation (raster, units: mm/year, required): Map of average annual precipitation.
It is strongly recommended to use the same precipitation layer that was used to create the evapotranspiration input raster. If they are based on different sources of precipitation data, this introduces another source of uncertainty in the data, and the mismatch could affect the water balance components computed by the model.
reference evapotranspiration (raster, units: mm, required): Map of reference evapotranspiration values.
It is strongly recommended that the evapotranspiration input raster is based on the same precipitation data as is input to the model. If they are based on different sources of precipitation data, this introduces another source of uncertainty in the data, and the mismatch could affect the water balance components computed by the model.
root restricting layer depth (raster, units: mm, required): Map of root restricting layer depth, the soil depth at which root penetration is strongly inhibited because of physical or chemical characteristics.
plant available water content (raster, required): Map of plant available water content, the fraction of water that can be stored in the soil profile that is available to plants.
land use/land cover (raster, required): Map of land use/land cover codes. Each land use/land cover type must be assigned a unique integer code. All values in this raster must have corresponding entries in the Biophysical Table.
watersheds (vector, polygon, required): Map of watershed boundaries, such that each watershed drains to a point of interest where hydropower production will be analyzed.

Field:
- ws_id (integer, required): Unique identifier for each watershed.
sub-watersheds (vector, polygon/multipolygon, optional): Map of subwatershed boundaries within each watershed in the Watersheds map.

Fields:
- subws_id (integer, required): Unique identifier for each subwatershed.
biophysical table (CSV, required): Table of biophysical parameters for each LULC class. All values in the LULC raster must have corresponding entries in this table.

Columns:
- lucode (integer, required): LULC codes from the LULC raster. Each code must be a unique integer.
- lulc_veg (integer, required): Code indicating whether the the LULC class is vegetated for the purpose of AET. Enter 1 for all vegetated classes except wetlands, and 0 for all other classes, including wetlands, urban areas, water bodies, etc.
  Classes with a value of 1 will have AET calculated according to eq. (1). Classes with a value of 0 will have AET calculated according to eq. (3).
- root_depth (number, units: mm, required): Maximum root depth for plants in this LULC class. Only used for classes with a ‘lulc_veg’ value of 1.
  This is often given as the depth at which 95% of a vegetation type’s root biomass occurs. For land uses where the generic Budyko curve is not used (i.e. where evapotranspiration is calculated from eq.:eq:aet_non_vegetated), rooting depth is not needed. In these cases, the rooting depth field is ignored, and may be set as a value such as -1 to indicate the field is not used.
- kc (number, units: unitless, required): Crop coefficient for this LULC class.
  Used to calculate potential evapotranspiration to modify the reference evapotranspiration.
z parameter (number, units: unitless, required): The seasonality factor, representing hydrogeological characterisitics and the seasonal distribution of precipitation. Values typically range from 1 - 30.
This is \(Z\) in eq. (2). See the Appendix for more information.
water demand table (CSV, conditionally required): A table of water demand for each LULC class. Each LULC code in the LULC raster must have a corresponding row in this table. Required if ‘valuation_table_path’ is provided.
Consumptive water use is that part of water used that is incorporated into products or crops, consumed by humans or livestock, or otherwise removed from the watershed water balance.
Columns:
- lucode (integer, required): LULC code corresponding to the LULC raster
- demand (number, units: m³/(pixel · year), required): Average consumptive water use in this LULC class.
  Note that accounting for pixel area is important since larger pixels will consume more water for the same land cover type.
hydropower valuation table (CSV, optional): A table mapping each watershed to the associated valuation parameters for its hydropower station.
Columns:
- ws_id (integer, required): Unique identifier for the hydropower station. This must match the ‘ws_id’ value for the corresponding watershed in the Watersheds vector. Each watershed in the Watersheds vector must have its ‘ws_id’ entered in this column.
- efficiency (ratio, required): Turbine efficiency, the proportion of potential energy captured and converted to electricity by the turbine.
  May be obtained from the hydropower plant manager. Values generally range from 0.7 to 0.9.
- fraction (ratio, required): The proportion of inflow water volume that is used to generate energy.
  May be obtained from the hydropower plant manager. Managers may release water without generating electricity to satisfy irrigation, drinking water, or environmental demands.
- height (number, units: m, required): The head, measured as the average annual effective height of water behind each dam at the turbine intake.
- kw_price (number, units: currency units/kWh, required): The price of power produced by the station. Must be in the same currency used in the ‘cost’ column.
- cost (number, units: currency units/year, required): Annual maintenance and operations cost of running the hydropower station. Must be in the same currency used in the ‘kw_price’ column.
- time_span (number, units: year, required): Number of years over which to value the hydropower station. This is either the station’s expected lifespan or the duration of the land use scenario of interest.
  This is \(T\) in equation (4).
- discount (percent, required): The annual discount rate, applied for each year in the time span.
  This is \(r\) in equation (4).

Interpreting Results¶

The resolution of the output rasters will be the same as the resolution of the Land use/land cover raster provided as input.

Parameter log: Each time the model is run, a text (.txt) file will be created in the Workspace. The file will list the parameter values and output messages for that run and will be named according to the service, the date and time. When contacting NatCap about errors in a model run, please include the parameter log.
Outputs in the per_pixel folder can be useful for intermediate calculations but should NOT be interpreted at the pixel level, as model assumptions are based on processes understood at the subwatershed scale.
- output\per_pixel\fractp_[Suffix].tif (fraction): Estimated actual evapotranspiration fraction of precipitation per pixel (Actual Evapotranspiration / Precipitation). It is the mean fraction of precipitation that actually evapotranspires at the pixel level.
- output\per_pixel\aet_[Suffix].tif (mm): Estimated actual evapotranspiration per pixel.
- output\per_pixel\wyield_[Suffix].tif (mm): Estimated water yield per pixel.
output\subwatershed_results_wyield_[Suffix].shp and output\subwatershed_results_wyield_[Suffix].csv: Shapefile and table containing biophysical output values per subwatershed, with the following attributes:
- precip_mn (mm): Mean precipitation per pixel in the subwatershed.
- PET_mn (mm): Mean potential evapotranspiration per pixel in the subwatershed.
- AET_mn (mm): Mean actual evapotranspiration per pixel in the subwatershed.
- wyield_mn (mm): Mean water yield per pixel in the subwatershed.
- wyield_vol (m³): Total volume of water yield in the subwatershed. Calculated as wyield_mn x subwatershed area / 1000.
output\watershed_results_wyield_[Suffix].shp and output\watershed_results_wyield_[Suffix].csv: Shapefile and table containing output values per watershed, with the following attributes:
- precip_mn (mm): Mean precipitation per pixel in the watershed.
- PET_mn (mm): Mean potential evapotranspiration per pixel in the watershed.
- AET_mn (mm): Mean actual evapotranspiration per pixel in the watershed.
- wyield_mn (mm): Mean water yield per pixel in the watershed.
- wyield_vol (m³): Total volume of water yield in the watershed. Calculated as wyield_mn x watershed area / 1000.
If the Water Scarcity option is run, the following attributes will also be included for watersheds and subwatersheds:
- consum_vol (m³): Total water consumption for each watershed.
- consum_mn (m³/ha): Mean water consumptive volume per pixel per watershed.
- rsupply_vl (m³): Total realized water supply (water yield – consumption) volume for each watershed.
- rsupply_mn (m³/ha): Mean realized water supply (water yield – consumption) volume per pixel per watershed.
If the Valuation option is run, the following attributes will also be included for watersheds, but not for subwatersheds:
- hp_energy (kWh): The amount of ecosystem service in energy production terms. This is the amount of energy produced annually by the hydropower station that can be attributed to each watershed based on the watershed’s water yield contribution.
- hp_val (currency/timespan): The amount of ecosystem service in economic terms. This shows the value of the landscape per watershed according to its ability to yield water for hydropower production over the specified timespan, and with respect to the discount rate.
intermediate: This directory contains data that represent intermediate steps in calculations of the final data in the output folder. It also contains subdirectories that store metadata used internally to enable avoided re-computation.

The application of these results depends entirely on the objective of the modeling effort. Users may be interested in all of these results or a select one or two. If valuation information is not available or of interest, you may choose to simply run the water yield model and compare biophysical results.

The first several model results provide insight into how water is distributed throughout the landscape. aet_mn describes the actual evapotranspiration depth of the hydrologic cycle, showing how much water (precipitation) is lost annually to evapotranspiration across the watershed or subwatershed.

The wyield_vol field contains the estimated annual average water volume that is ‘yielded’ from each subwatershed within the watershed of interest. This value can be used to determine which subwatersheds are most important to total annual water yield – although at this step the user still will not know how much of that water is benefiting downstream users of any type. The consumptive use (consum_vol) field then shows how much water is used for consumptive activities (such as drinking, bottling, etc.) each year across the landscape per watershed. The realized supply (rsupply_vl) field contains the difference between cumulative water yield and cumulative consumptive use. This value demonstrates where the water supply for hydropower production is abundant and where it is most scarce. Remember that the consumptive use value may not truly represent where water is taken, only where it is demanded. This may cause some misrepresentation of the scarcity in certain locations, but this value offers a general sense of the water balance and whether there is a lack of or abundance of water in the watershed of interest.

The hp_energy and hp_val values are the most relevant model outputs for prioritizing the landscape for investments that wish to maintain water yield for hydropower production. The hp_val field contains the most information for this purpose as it represents the revenue attributable to each watershed over the expected lifetime of the hydropower station, or the number of years that the user has chosen to model. This value accounts for the fact that different hydropower stations within a large river basin may have different customers who pay different rates for energy production. If this is the case, this result will show which watersheds contribute the highest value water for energy production. If energy values do not vary much across the landscape, the hp_energy outputs can be just as useful in planning and prioritization. Comparing any of these values between landuse scenarios allows you to understand how the role of the landscape may change under different management plans.

Calibration/Comparison with observed data¶

The Calibrating the InVEST Freshwater Models chapter of this Guide provides an overview of how to perform sensitivity analysis and calibration.

The water yield model is based on a simple water balance where it is assumed that all water in excess of evaporative loss arrives at the outlet of the watershed. The model is an annual average time step simulation tool applied at the pixel level but reported at the subwatershed level. If possible, calibration of the model should be performed using long term average streamflow. As a rule of thumb, a 10-year period should be used to capture some climate variability, and this 10-year period should coincide with the date of the LULC map and precipitation/ET0 maps. The other inputs - root restricting layer depth and plant available water content - are less susceptible to temporal variability so any available data for these parameters may be used.

Gauge data is often provided in flow units (such as m³/s). Since the model calculates water volume, the observed flow data should be converted into units of m³/year.

As with all models, model uncertainty is inherent and must be considered when analyzing results for decision making. Before starting the calibration process, we highly recommend conducting a sensitivity analysis. The sensitivity analysis will define the parameters that influence model outputs the most. See for example Hamel and Guswa 2015; Sanchez-Canales et al., 2012, and particularly Hamel and Bryant 2017, which provides more general guidance for assessing uncertainty in ecosystem services analyses. The calibration can then focus on highly sensitive parameters.

Appendix 1: Data Sources¶

Precipitation ¶

Reference Evapotranspiration ¶

Kc ¶

Land Use/Land Cover ¶

Watersheds/Subwatersheds ¶

Root restricting layer depth¶

Root restricting layer depth is the soil depth at which root penetration is strongly inhibited because of physical or chemical characteristics. Root restricting layer depth may be obtained from some soil maps. If root restricting layer depth or rootable depth by soil type is not available, soil depth can be used as a proxy. If several soil horizons are detailed, the root restricting layer depth is the sum of the depths of non-restrictive soil horizons.

Global soil data are available from the Soil and Terrain Database (SOTER) Programme (https://data.isric.org:443/geonetwork/srv/eng/catalog.search). They provide some area-specific soil databases, as well as SoilGrids globally. Type “depth” into their Search engine to see a list of layers. For ISRIC SoilGrids 250m (version 2017), Depth to bedrock (R horizon) can be used. Note that the Depth to bedrock values are given in centimeters, which will need to be converted to millimeters to be used in the model. SoilGrids version 2.0 does not currently include a soil depth layer.

The FAO also provides global soil data in their Harmonized World Soil Database: https://webarchive.iiasa.ac.at/Research/LUC/External-World-soil-database/HTML/, but it is rather coarse.

In the United States free soil data is available from the U.S. Department of Agriculture’s NRCS gSSURGO, SSURGO and gNATSGO databases: https://www.nrcs.usda.gov/wps/portal/nrcs/main/soils/survey/geo/. They also provide ArcGIS tools (Soil Data Viewer for SSURGO and Soil Data Development Toolbox for gNATSGO) that help with processing these databases into spatial data that can be used by the model. The Soil Data Development Toolbox (available at https://www.nrcs.usda.gov/resources/data-and-reports/gridded-soil-survey-geographic-gssurgo-database) is easiest to use, and highly recommended if you use ArcGIS and need to process U.S. soil data.

Plant available water content (PAWC)¶

Plant available water content is a fraction obtained from some standard soil maps. It is defined as the difference between the fraction of volumetric field capacity and permanent wilting point. Often plant available water content is available as a volumetric value (mm). To obtain the fraction divide by soil depth.

Soil data in general often has missing data (holes) where there are water bodies. If you want to fill in holes in PAWC data that correspond with water bodies, it is advised to use a value of 1, which is the maximum value for PAWC. This implies that there is no limitation in the amount of water available in the soils under the water for plants to use. There probably is very little vegetation in open water bodies, but some areas classified as water could actually have emergent vegetation or be seasonal wetlands, so assuming that they are almost always wet (which should be a safe assumption if they are classified as open water) would imply maximum PAWC. Also, if the amount of water evapotranspired off of each pixel is calculated as a combination of PAWC and Kc, and if Kc is relatively high for water bodies, then setting PAWC to the maximum value would mean that all water is available for evapotranspiration (or evaporation, if it’s open water), which is the case in reality.

ISRIC SoilGrids 2017 AWC data¶

One global AWC raster is provided by ISRIC, as part of their 2017 SoilGrids product, called SoilGrids250m 2017-03 - “Derived available soil water capacity (volumetric fraction) until wilting point” (https://data.isric.org/geonetwork/srv/eng/catalog.search#/metadata/e33e75c0-d9ab-46b5-a915-cb344345099c). These layers require additional processing to convert the units from percent to fraction, using the soil depth of each layer. If you do not have more local PAWC data, and live outside of the United States (which has a spatial soil data processing tool, listed below), this ISRIC 2017 layer is probably the easiest data source. Note that SoilGrids version 2.0 does not currently provide AWC, so if you prefer to work with version 2.0, you’ll need to find a different method that makes use of the layers that are provided by that version. You can also search for more region-specific ISRIC datasets by typing “available water” into their search engine (https://data.isric.org:443/geonetwork/srv/eng/catalog.search).

If you are using the global SoilGrids 2017 AWC data, following is one way of processing it into the input required for InVEST, using GIS software.

You can find a preprocessed PAWC dataset created using the following instructions from the NatCap Data Hub here: https://data.naturalcapitalalliance.stanford.edu/dataset/?_tags_limit=0&tags=PAWC

SoilGrids 2017 provides AWC layers for 7 soil depth intervals. All 7 depth intervals need to be downloaded, then combined into a single layer for use in the model.

When downloaded from ISRIC, the raw AWC rasters are named as such:

Depth 0cm: WWP_M_sl1_250m_ll.tif
Depth 5cm: WWP_M_sl2_250m_ll.tif
Depth 15cm: WWP_M_sl3_250m_ll.tif
Depth 30cm: WWP_M_sl4_250m_ll.tif
Depth 60cm: WWP_M_sl5_250m_ll.tif
Depth 100cm: WWP_M_sl6_250m_ll.tif
Depth 200cm: WWP_M_sl7_250m_ll.tif

Raster values are given as whole number percentages (such as 25, which means an AWC value of 25%).

The method that is described here is provided in the SoilGrids scientific paper (Hengl 2017):

“Averages over (standard) depth intervals, e.g. 0–5 cm or 0–30 cm, can be derived by taking a weighted average of the predictions within the depth interval using numerical integration, such as the trapezoidal rule:”

\[(\frac{1}{(b-a)})(\frac{1}{2})\sum_{k=1}^{N-1}{(x_{k+1} - x_{k})(f(x_{k}) + f(x_{k+1}))}\]

“where \(N\) is the number of depths, \(x_{k}\) is the k-th depth and \(f(x_{k})\) is the value of the target variable (i.e., soil property) at depth \(x_{k}\).”

Steps

Download all available depth intervals from the ISRIC web site. Depth intervals are 0cm - 200cm. Note that each raster is 1.5GB in size.
Use the GIS Buffer tool to create a buffer around the watershed/area of interest that you’re modeling. Since the SoilGrids data is 250m resolution, make the buffer 250 or 500m wide. This is done to make sure that the soil data completely covers the watershed that you’re modeling, without holes around the border.
Use the buffered watershed to clip all of the raw ISRIC AWC rasters to your area of interest. In ArcGIS this can be done with the Spatial Analyst tool Extract by Mask. In QGIS the tool is called Clip Raster by Mask Layer. For this example, we’ll call the clipped layers AWC_sl1_clip.tif, AWC_sl2_clip.tif … AWC_sl7_clip.tif.
Use the GIS Raster Calculator tool to calculate the combined AWC layer. Substituting into the Hengl equation above gives us

(1/(200-0)) * (1/2) * ( ((5-0) * (AWC_sl1_clip.tif + AWC_sl2_clip.tif)) + ((15-5) * (AWC_sl2_clip.tif + AWC_sl3_clip.tif)) + ((30-15) * (AWC_sl3_clip.tif + AWC_sl4_clip.tif)) + ((60-30) * (AWC_sl4_clip.tif + AWC_sl5_clip.tif)) + ((100-60) * (AWC_sl5_clip.tif + AWC_sl6_clip.tif)) + ((200-100) * ( AWC_sl6_clip.tif + AWC_sl7_clip.tif)) )

Enter this equation into Raster Calculator, adjusting the file names as needed. Note for ArcGIS Desktop users: Instead of the initial fractions “(1/(200-0)) * (1/2)”, you’ll need to enter the equivalent decimal values, i.e. “(.005) * (0.5)” else the result of the calculation will be a raster of all 0s. The equation with fractions works as expected in QGIS and ArcGIS Pro.

The resulting raster should contain values in the range of 0-100, which represent whole number percentages. The model requires that AWC be given as a fraction, so divide the raster calculated in step 4 by 100.
Reproject the AWC fraction layer to have the same projected coordinate system as your other model inputs. This raster can now be used as the Available Water Content input to the model.

Other data sources¶

In the United States, free soil data is available from the NRCS gSSURGO, SSURGO and gNATSGO databases: https://www.nrcs.usda.gov/wps/portal/nrcs/main/soils/survey/geo/. They also provide ArcGIS tools (Soil Data Viewer for SSURGO and Soil Data Development Toolbox for gNATSGO) that help with processing these databases into spatial data that can be used by the model. The Soil Data Development Toolbox (available at https://www.nrcs.usda.gov/resources/data-and-reports/gridded-soil-survey-geographic-gssurgo-database) is easiest to use, and highly recommended if you use ArcGIS Desktop (it does not work in ArcGIS Pro or QGIS) and need to process U.S. soil data. Another option is SSURGO Portal (https://www.nrcs.usda.gov/resources/data-and-reports/ssurgo-portal), which is a new (beta) application independent from ArcGIS.

One other tool of note is SPAW Soil Water Characteristics https://www.ars.usda.gov/research/software/download/?softwareid=492, which helps estimate PAWC when you have soil texture data. However, it does not take in spatial data directly. At a minimum, you provide single values for %sand and %clay and it calculates a single value for Available Water. If you have additional data on organic matter, gravel, etc those may also be entered to refine the result. The Available Water value calculated by the tool will then need to be applied to the spatial soil layer. If your soil data is complex, with many different textures, or combinations of %sand and %clay, then this method will be very tedious and time-consuming. But if you have just a few texture values, it can be applied fairly easily.

Root depth¶

A valuable review of plant rooting depths was done by Schenk and Jackson (2002). Root depth values should be based on depth at which 90% of root biomass occurs, not the maximum depth of the longest tap root. Other rooting depth values for crops and some tree plantations can be found in the FAO 56 guidelines by Allen et al. (1998).

The model determines the minimum of root restricting layer depth and rooting depth for an accessible soil profile for water storage. Values must be integer, converted to mm. For non-vegetated LULCs (e.g. urban), for which Equation 2 above is used, the model will not use the root depth value so any value can be inserted into the table.

Consumptive water use¶

The consumptive water use for each land use/land cover class is the water that is removed from the water balance. It should be estimated based on knowledge of local water transfers (e.g. extraction from groundwater or surface water for urban water supply) in consultation with local professionals in these fields. The value used in the table is an average for each land use type. For agricultural areas, water used by cattle or agricultural processing that is not returned to the watershed must be considered. In urban areas, water use may be calculated based on an estimated water use per person and multiplied by the approximate population area per raster cell. Industrial water use or water exports to other watersheds must also be considered where applicable. For all of these calculations, it is assumed that the agricultural water demand, people, etc. are spread evenly across each land use class.

Hydropower Station Information¶

Detailed information about each hydropower station may only be available from the owner or managing entity of the stations. Some information may be available through public sources, and may be accessible online. In particular, if the hydropower plant is located in the United States some information may be found on the internet.

Exact locations of specific structures, such as reservoirs, should be obtained from the managing entity or may be obtained on the web:

The U.S. National Inventory of Dams: https://nid.sec.usace.army.mil/

Global Reservoir and Dam (GRanD) Database: http://globaldamwatch.org/grand/

World Water Development Report II dam database: https://wwdrii.sr.unh.edu/download.html

Calibration: For calibration, data are needed on how much water actually reaches the (sub)watershed outlets, which can be a hydropower station, on an average annual basis. Data should be available from the managing entity of the hydropower plant. In absence of information available directly from the hydropower operators, data may be available for a stream gage just upstream of the hydropower station. Gages in the U.S. may be managed by the USGS, the state fish and wildlife agency, the state department of ecology or by a local university.
Time_period: The design life span of each hydropower station can be obtained from the station owner or operator. Alternative sources may be available online as described above. This value may instead represent the time period of a scenario of interest, which should be equal to or smaller than the life span of the station.
Discount_rate: This rate is defined as how much value the currency loses per year, which reflects society’s preference for immediate benefits over future benefits.

Z parameter¶

Z is an empirical constant that captures the local precipitation pattern and hydrogeological characteristics, with typical values ranging from 1 to 30. Several studies have determined \(\omega\) empirically (e.g. Xu et al. 2013, Fig. 3; Liang and Liu 2014; Donohue et al. 2012) and can be used to estimate Z. The relationship between \(\omega\) and Z is:

\[Z = \frac{(\omega-1.25) P}{AWC}\]

where P and AWC should be average values of Precipitation and Available Water Capacity, respectively, in the study area. \(AWC\) is the volumetric (mm) plant available water content. The soil texture and effective rooting depth define \(AWC\), which establishes the amount of water that can be held and released in the soil for use by a plant. It is estimated as the product of the plant available water capacity (PAWC) and the minimum of root restricting layer depth and vegetation rooting depth:

\[AWC = Min(Rest.layer.depth, root.depth)\times PAWC\]

Root restricting layer depth is the soil depth at which root penetration is inhibited because of physical or chemical characteristics. Vegetation rooting depth is often given as the depth at which 95% of a vegetation type’s root biomass occurs. PAWC is the plant available water capacity, i.e. the difference between field capacity and wilting point.

Alternatively, following a study by Donohue et al. (2012) encompassing a range of climatic conditions in Australia, Z could be estimated as 0.2*N, where N is the number of rain events per year. The definition of a rain event is the one used by the authors of the study, characterized by a minimum period of 6 hours between two storms. Calibration of the Z coefficient may also be used by comparing modeled and observed data. Note that the Budyko curve theory suggests that the sensitivity of the model to Z is lower when Z values are high, or in areas with a very low or very high aridity index (\(\frac{ET_0}{P}\); see Fig. 5 in Zhang et al. 2004).

References¶

Allen, R.G., Pereira, L.S., Raes, D. and Smith, M., 1998. “Crop evapotranspiration. Guidelines for computing crop water requirements.” FAO Irrigation and Drainage Paper 56. Food and Agriculture Organization of the United Nations, Rome, Italy. Paper available at http://www.fao.org/3/x0490e/x0490e00.htm. Annex 2 available at: http://www.fao.org/3/X0490E/x0490e0j.htm.

Allen, R., Pruitt, W., Raes, D., Smith, M. and Pereira, L., 2005. “Estimating Evaporation from Bare Soil and the Crop Coefficient for the Initial Period Using Common Soils Information.” Journal of Irrigation and Drainage Engineering, 131(1): 14-23.

Donohue, R. J., M. L. Roderick, and T. R. McVicar (2012), Roots, storms and soil pores: Incorporating key ecohydrological processes into Budyko’s hydrological model, Journal of Hydrology, 436-437, 35-50

Droogers, P. & Allen, R.G. 2002. “Estimating reference evapotranspiration under inaccurate data conditions.” Irrigation and Drainage Systems, vol. 16, Issue 1, February 2002, pp. 33–45

Ennaanay, Driss. 2006. Impacts of Land Use Changes on the Hydrologic Regime in the Minnesota River Basin. Ph.D. thesis, graduate School, University of Minnesota.

Fu, B. P. (1981), On the calculation of the evaporation from land surface (in Chinese), Sci. Atmos. Sin., 5, 23– 31.

Hamel, P., & Guswa, A. (2015). Uncertainty analysis of a spatially-explicit annual water-balance model: case study of the Cape Fear catchment, NC. Hydrology and Earth System Sciences. doi:10.5194/hess-19-839-2015

Hamel, P. & Bryant, B. (2017). Uncertainty assessment in ecosystem services analyses: Seven challenges and practical responses. Ecosystem Services, Volume 24. https://doi.org/10.1016/j.ecoser.2016.12.008.

Hengl T, Mendes de Jesus J, Heuvelink GBM, Ruiperez Gonzalez M, Kilibarda M, Blagotić A, et al. (2017) SoilGrids250m: Global gridded soil information based on machine learning. PLoS ONE 12(2): e0169748. https://doi.org/10.1371/journal.pone.0169748

Liang, L., & Liu, Q. (2014). Streamflow sensitivity analysis to climate change for a large water-limited basin. Hydrological Processes, 28(4), 1767–1774. doi:10.1002/hyp.9720

Sánchez-Canales, M., López Benito, A., Passuello, A., Terrado, M., Ziv, G., Acuña, V., Elorza, F. J. (2012). Sensitivity analysis of ecosystem service valuation in a Mediterranean watershed. Science of the Total Environment, 440, 140–53. doi:10.1016/j.scitotenv.2012.07.071

Schenk, H. J., & Jackson, R. B. (2002). Rooting depths, lateral root spreads and below-ground/above-ground allometries of plants in water-limited ecosystems. Journal of Ecology, 90(3), 480–494. doi:10.1046/j.1365-2745.2002.00682.x

World Commission on Dams (2000). Dams and development: A new framework for decision- making. The Report of the World Commission on Dams. Earthscan Publications LTD, London.

Xu, X., Liu, W., Scanlon, B. R., Zhang, L., & Pan, M. (2013). Local and global factors controlling water-energy balances within the Budyko framework. Geophysical Research Letters, 40(23), 6123–6129. doi:10.1002/2013GL058324

Yang, H., Yang, D., Lei, Z., & Sun, F. (2008). New analytical derivation of the mean annual water-energy balance equation. Water Resources Research, 44(3), n/a–n/a. doi:10.1029/2007WR006135

Zhang, L., Hickel, K., Dawes, W. R., Chiew, F. H. S., Western, A. W., Briggs, P. R. (2004) A rational function approach for estimating mean annual evapotranspiration. Water Resources Research. Vol. 40 (2)

InVEST®

Table of Contents

Annual Water Yield¶

Summary¶

Introduction¶

The Model¶

How it Works¶

Water Yield Model¶

Realized Supply¶

Hydropower Production and Valuation¶

Limitations and Simplifications¶

Data Needs¶

Interpreting Results¶

Calibration/Comparison with observed data¶

Appendix 1: Data Sources¶

Precipitation ¶

Reference Evapotranspiration ¶

Kc ¶

Land Use/Land Cover ¶

Watersheds/Subwatersheds ¶

Root restricting layer depth¶

Plant available water content (PAWC)¶

ISRIC SoilGrids 2017 AWC data¶

Other data sources¶

Root depth¶

Consumptive water use¶

Hydropower Station Information¶

Z parameter¶

References¶

Annual Water Yield¶

Summary¶

Introduction¶

The Model¶

How it Works¶

Water Yield Model¶

Realized Supply¶

Hydropower Production and Valuation¶

Limitations and Simplifications¶

Data Needs¶

Interpreting Results¶

Calibration/Comparison with observed data¶

Appendix 1: Data Sources¶

Precipitation¶

Reference Evapotranspiration¶

Kc¶

Land Use/Land Cover¶

Watersheds/Subwatersheds¶

Root restricting layer depth¶

Plant available water content (PAWC)¶

ISRIC SoilGrids 2017 AWC data¶

Other data sources¶

Root depth¶

Consumptive water use¶

Hydropower Station Information¶

Z parameter¶

References¶

Precipitation ¶

Reference Evapotranspiration ¶

Kc ¶

Land Use/Land Cover ¶

Watersheds/Subwatersheds ¶