Text of entries from descriptive gazetteers, mainly 19th century

Table ID:
G_DGAZ_2015     (58064)
Contents:
Text of entries from descriptive gazetteers, mainly 19th century
Approx. number of rows:
96,142
Table type:
Raw Data
Documentation Author:
Humphrey Southall
Chronology:
The data cover the period 1881 to 1911.

Sources:

  1. This table holds entries from the following gazetteers. Unless otherwise noted, we hold ALL entries:
    • John Bartholomew's Gazetteer of the British Isles (1887).
    • John Marius Wilson's Imperial Gazetteer of England & Wales (1872).
    • Frances Groome's The Ordnance Gazetteer of Scotland (1885). This transcription was created through a collaboration with the Gazetteer for Scotland, who own the copyright.
    • Samuel Lewis's Topographical Dictionary of Ireland (1837). This gazetteer was transcribed by Derek Rowlinson of LibraryIreland, and he owns the copyright.
    • William Henry Beable's Russian Gazetteer and Guide (1919). This is a very limited transcription of a few entries for Estonia.


Notes:

  1. This table is an exact copy of the table held in the HGIS system at the end of 2015, and is intended as a permanent archive.


Indices:

IndexTypeColumn(s) indexed
g_dgaz_2015_pkey Primary key g_text_id


Constraints:

The table has the following associated constraints:

ConstraintTypeDetails
g_dgaz_2015_pkey Primary Key See details above for primary key index



Columns within table:

ColumnTypeContents
g_authority Text string (max.len.=44). Identifier for the gazetteer from which the entry is drawn, as defined in the auo.g_authority table which provides full publication details.
g_text_id Integer number. Numerical identifier uniquely identifying the entry.
g_adl_ft Text string (max.len.=44). Feature type for the entry taken from the Gazetteer Feature Type Thesaurus developed by the Alexandria Digital Library.
g_ft_text Text string (max.len.=124). The part of the actual entry text saying what kind of feature the entry is about.
g_text_len Integer number. Total length of the entry in number of characters.
g_text Long text. Text of the entry, held as HTML. NB the Edinburgh entry in Groome' gazetteer contains over 100,000 words, and many of the longer entries include statistical tables and even poems.
g_place Integer number. Identifier for the "place" the entry is about, as defined in g_place_2015.
idxfti tsvector [Column used by the Postgres free text indexing system]
non_place Text string (max.len.=6). Flag identifying entries which are not about places; for example, generic place names.
g_header Text string (max.len.=260). The first part of the actual entry text, stating the place name or names. This is our original version, extracted based on capitalisation or emboldening, depending on the particular gazetteer.
new_header Text string (max.len.=260). Our second attempt to extract the place name or names, based on extracting whatever text appeared before the text held in g_ft_text.
created_by Text string (max.len.=64). Name of the script which inserted this particular row into the original version of this table.
domain_name Text string (max.len.=14). 'Britain', 'Ireland' or 'Europe', depending on where the entry was for, and controlling which banner, canonical address etc should be used when presenting the entry online.
p_place Integer number. Wikidata Q_ID, as used in our PastPlace global gazetteer. Added lomg after the table was created.