large dataset new

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/46

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 5:28 PM on 6/15/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

47 Terms

1
New cards

Regression gradient units

The units of the gradient are units of y divided by units of x. For rainfall y against sunshine x, this is mm per hour.

2
New cards

Large data set locations

The UK weather stations are Camborne, Hurn, Heathrow, Leeming and Leuchars. Camborne, Hurn and Heathrow are in the south. Leeming and Leuchars are in the north. Camborne, Hurn and Leuchars are coastal.

3
New cards

International LDS locations

The international locations are Jacksonville in the USA, Beijing in China and Perth in Australia. Perth is the only southern hemisphere location.

4
New cards

LDS years and months

The large data set records data for 1987 and 2015, from May to October only.

5
New cards

Why the LDS may not represent the whole year

The LDS only covers May to October, so winter is missing. Winter is usually colder, wetter, darker and windier, so the LDS may not represent the whole year.

6
New cards

UK vs international variables

The international locations only have daily mean temperature, daily total rainfall, daily mean pressure and daily mean wind speed. The UK locations have these plus extra variables such as gusts, wind direction, sunshine, humidity, cloud cover and visibility.

7
New cards

Daily mean temperature units

Daily mean temperature is measured in degrees Celsius and rounded to 1 decimal place.

8
New cards

Daily mean temperature trends

Temperature is generally higher in summer. In the UK, northern stations tend to be colder. Perth has reversed seasons because it is in the southern hemisphere.

9
New cards

Perth in the large data set

Perth is in Australia, so it is in the southern hemisphere and its seasons are reversed compared with the UK. June, July and August are winter months for Perth.

10
New cards

Jacksonville large data set

Jacksonville is in Florida, USA, and there were hurricanes affecting the October 1987 and October 2015 data.

11
New cards

Beijing temperature outliers

For Beijing in the large data set, unusually low daily mean temperatures are likely to be from October, because the LDS only covers May to October and October is the coldest month in that range.

12
New cards

Daily total rainfall

Daily total rainfall is measured in millimetres and includes all precipitation, such as rain, snow and sleet.

13
New cards

Trace rainfall

“tr” means trace rainfall, which is less than 0.05 mm. To clean the data, you can replace tr with 0, which slightly underestimates the mean, or use 0.025 for a midpoint estimate.

14
New cards

Rainfall normal model suitability

Rainfall is often not suitable for a normal model because it is skewed and has lots of zero values.

15
New cards

NA values in the LDS

NA means the value is not available or was not recorded. Ignore NA values when doing calculations.

16
New cards

Daily mean wind speed

Daily mean wind speed is measured in knots. UK values are integers, while international values may be decimals.

17
New cards

Maximum gust units

Maximum gust is measured in knots. One knot is approximately 1.15 miles per hour.

18
New cards

Daily maximum gust

Daily maximum gust is the highest wind speed recorded that day. It is measured in knots and is only recorded for UK stations.

19
New cards

Wind speed trends

Wind speed is usually lower in summer and higher in autumn. Coastal UK locations are usually windier.

20
New cards

Beaufort scale

The Beaufort scale converts wind speed into categories such as light, moderate and fresh. It is qualitative/categorical, so it is not normally suitable for a normal model.

21
New cards

In the large data set, what does wind direction mean?

Wind direction means the direction the wind is coming from, not where it is going.

22
New cards

Wind direction bearing

Wind direction bearings are measured clockwise from north and show where the wind is coming from.

23
New cards

Wind direction categories

There are 16 compass directions for wind direction, e.g. N, NNE, NE, ENE, etc. If all are equally likely, each has probability 1/16.

24
New cards

Distribution for equally likely wind directions

Use a discrete uniform distribution, because wind direction is categorical with a fixed set of equally likely outcomes.

25
New cards

Mean wind direction and gust direction

Mean wind direction and gust direction are only recorded for UK stations. They can be given as bearings or compass directions, and they are usually similar.

26
New cards

Pressure units

Pressure is measured in hectopascals, written as hPa. Do not write kPa.

27
New cards

Pressure range in the LDS

Daily mean pressure is measured in hPa and is usually around 988 to 1038 hPa.

28
New cards

How do you handle coded data where y = x - 1010

For the mean, decode after finding the coded mean: x mean = y mean + 1010. For standard deviation, adding or subtracting a constant does not change it.

29
New cards

Pressure systems and wind direction

For pressure systems, high pressure circulates clockwise and low pressure circulates anticlockwise.

30
New cards

Daily total sunshine

Daily total sunshine is measured in hours and rounded to 1 decimal place. It means hours of bright sunshine, not daylight hours.

31
New cards

Sunshine trends

There is usually more sunshine in summer and less sunshine further north in the UK.

32
New cards

Relative humidity

Daily maximum relative humidity is measured as a percentage, usually between 80% and 100%.

33
New cards

High humidity

Humidity above about 95% can lead to mist or fog.

34
New cards

Cloud cover units

Cloud cover is measured in oktas, using integer values from 0 to 8, so it is discrete.

35
New cards

Cloud cover meaning

Cloud cover is measured in oktas from 0 to 8. 0 means no cloud, 8 means complete cloud cover. It must be an integer.

36
New cards

LDS visibility

Visibility means the greatest distance at which an object can be seen and recognised in daylight. It is recorded in decametres, so 1100 means 11,000 m or 11 km.

37
New cards

Visibility rounding

Visibility is measured in decametres and is usually rounded to the nearest 100 decametres.

38
New cards

Great Storm large data set

The Great Storm in the UK was on 15 to 16 October 1987, so October 1987 wind and gust values may be unusually high.

39
New cards

1987 vs 2015 temperature trend

Daily mean temperature is generally higher in 2015 than in 1987, which may be linked to global warming.

40
New cards

Using LDS data for weather models

The LDS may not be reliable for general predictions because it only uses certain months, certain years and specific locations.

41
New cards

Identifying LDS variables by range

Use the range and units to identify variables. Temperature is in Celsius, rainfall is in mm, pressure is around 1000 hPa, cloud cover is 0 to 8, humidity is a percentage, and visibility has large values in decametres.

42
New cards

Normal model suitability for LDS variables

To say a variable is not suitable for a normal model, give a distribution-shape reason, not just “it is discrete”. Good examples: rainfall is skewed with lots of zeroes, date is uniform, Beaufort is qualitative, or pressure/wind speed may be skewed or not symmetric.

43
New cards

Refining a uniform LDS model

If a uniform model is not suitable, refine it by using a non-uniform discrete distribution based on observed frequencies. Mention context, such as variation by month or location.

44
New cards

Cleaning LDS data

Cleaning the data means dealing with trace and NA values. Trace can be replaced with 0 or 0.025, and NA values should be ignored.

45
New cards

Seasonal trends in the LDS

For northern hemisphere locations, summer is generally hotter, brighter, drier and less windy. Autumn is usually windier.

46
New cards

UK regional trends in the LDS

In the UK, northern stations tend to be colder, wetter and darker. Coastal stations tend to be windier.

47
New cards