watex.datasets.load_semien#

watex.datasets.load_semien(*, as_frame=True, index_rhoa=0, tag=None, data_names=None, **kws)[source]#

A DC-vertical Electrical resistivity data collected from SEMIEN during the National Drinking Water Supply Program (PNAEP) occurs in 2014 in Cote d’Ivoire. An illustration of the data arrangement is the following:

AB/2

MN/2

SE1

SE2

SE…

1

0.4

107

93

75

2

0.4

97

91

49

100

10

79

96

98

110

10

84

104

104

Parameters
  • as_frame (bool, default=False) –

    If True, the data is a pandas DataFrame including columns with appropriate types (numeric). The target is a panda DataFrame or Series depending on the number of target columns. If as_frame is False, then returning a Boxspace dictionary-like object, with the following attributes:

    • data{ndarray, dataframe} of shape (33, 5)

      The data matrix. If as_frame=True, data will be a pandas DataFrame.

    • resistivity: {array-like} of shape (33,)

      The resistivity of the sounding point.

    • MN: {array-like} of shape (33,)

      The step value of potential electrodes increasing in meters

    • AB: {array-like} of shape (33,)

      The step value of current electrodes increasing in meters

    • feature_names: list

      The names of the dataset columns. .. versionadded:: 0.23

    • DESCR: str

      The full description of the dataset.

    • filename: str

      The path to the location of the data. .. versionadded:: 0.20

    New in version 0.1.2.

  • index_rhoa (int, default=0) – index of the resistivy columns to retrieve. Note that this is useful in the cases many sounding values are collected in the same survey area. index_rhoa=0 fetches the first sounding values in the collection of all values.

  • tag (None,) – Always None for API consistency

  • data_names (None,) – Always None for API consistency

  • kws (dict,) – Keywords arguments pass to _is_readable() function for parsing data.

Returns

data – Dictionary-like object, with the following attributes. - data : {ndarray, dataframe}

The data matrix. If as_frame=True, data will be a pandas DataFrame.

Return type

Boxspace

Notes

The array configuration is Schlumberger and the max depth investigation is 110.0 meters for \(AB/2\) (current electrodes). The sounding steps \(AB\) starts from 1.0 to 110.0 meters whereas \(MN/2\) (potential electrodes) starts from 0.4 to 10.0 meters. The total number of sounding performers in 3 with the prefix ‘SE’. AB, AB is in meters and SE are in ohm. meters as apparent resistivity values. Use the param index_rho to get the ranking of the sounding resistivity value. For instance index_rhoa=0 fetch the first array of resistivity values (SE1).