Monday, August 24, 2015

Census 1970

The census data for 1970 is now completely translated into my XML structure, for all of the publications like the Whole Thailand report I have transcribed table one containing the population and household numbers for each district subdivided by municipal, sanitary districts and non-municipal areas. Of course the publications contain much more data, but that's so much special interest only I have not worked on it at all. Actually, the only other data I have prepared in the XML schema so far is the age distribution, so I might add that later.

Though this XML should be the easiest way to store the data for any further processing by software, the Excel sheets I created as an intermediate step to represent exactly what is visible in the publications are probably more accessible for non-programmers. I collected them all in one archive. I also compiled the downloads from the statistics office into PDF files, however these are around 10 MB each. Due to this, as well as to avoid copyright problems, there's no download link for these, but if you need one or many you can simply ask me by email.

Working through the tables was rather straightforward, there was only one district which was missing - Na Haeo in Loei is not in the list though was created in February 1970 as a minor district, thus before the reference date of April 1st 1970. In a few cases, the type - minor district (King Amphoe, กิ่งอำเภอ) or full district (Amphoe, อำเภอ) - was wrong, but there were also a few where I had mistakes about the actual dates in my data and only spotted them when looking more closely now.

The only cases where the actual numbers are inconsistent are for two province.
  • Surin - the household numbers for private and collective households are wrong by one for sanitary and the nonmunicipal areas.
  • Thonburi - the total population according to the publication on Thonburi was 920,033, whereas the publication on the whole kingdom lists 920,058 - seven women and 18 men more. Also, the sum of the numbers for the districts also return the value of 920,033, so these two documents contradict.
I have also already finished the data for 1960, and I am currently working on 1980 - which however will take much longer than the previous two census as there are both more districts as well as the data is more detailed. I am also continuously updating my spreadsheet with the total population numbers for each district.

No comments: