# Distant Reading – 2021-09-09 ## Example Project - **Project name**: DISTANT READING FU - **Project ID**: TGPR-f5544495-e47d-4864-0646-5ec546981760 ## Metadata The basic metadata is covered by the TextGrid Metadata schema in *Edition* and *Work* metadata, all additional project specific metadata is covered by the metadata added to the works. Please see the following two examples: - **Edition metadata**: - **Work metadata**: Technically, there are two parts of metadata. Metadata that can be searched using facets, and metadata that can not. ## Browsing the Project All objects of this project are displayed on this README.md page. Browsing in the TextGrid Repository can be done via setting the project filter, such as: ## Search for Regular Metadata (Facets Possible) ### 1. language --> edition.language:"eng" - [eng](/search?query=edition.language%3A%22eng%22&filter=project.value%3ADISTANT+READING+FU&order=relevance&limit=10) - [de](/search?query=edition.language%3A%22deu%22&filter=project.value%3ADISTANT+READING+FU&order=relevance&limit=10) ### 2. timeSlot --> work.temporal.id.value:"timeSlot" AND work.temporal.value:"T3" (in ONE tag!) - [work.temporal.id.value:"timeSlot" AND work.temporal.value:"T3"](/search?query=work.temporal.id.value%3A"timeSlot"++AND+work.temporal.value%3A"T3"&order=relevance&limit=10&filter=project.value%3ADISTANT+READING+FU!) ### firstEdition --> work.dateOfCreation.value:"2000" - [work.dateOfCreation.value:"2000"](/search?query=work.dateOfCreation.value%3A%222000%22&filter=project.value%3ADISTANT+READING+FU!&order=relevance&limit=34) - [work.dateOfCreation.value:"1840"](/search?query=work.dateOfCreation.value%3A%221840%22&filter=project.value%3ADISTANT+READING+FU!&order=relevance&limit=34) ## Search for Project Specific Metadata (Simple Search Possible) ### authorGender --> work.subject.id.value:"authorGender" AND work.subject.value:"male" (in ONE tag!) - [work.subject.id.value:"authorGender" AND work.subject.value:"male"](/search?query=+work.subject.id.value%3A%22authorGender%22+AND+work.subject.value%3A%22male%22&filter=project.value%3ADISTANT+READING+FU!) ### size --> work.subject.id.value:"size" AND work.subject.value:"medium" (in ONE tag!) - [work.subject.id.value:"size" AND work.subject.value:"medium" ](/search?query=work.subject.id.value:%22size%22%20AND%20work.subject.value:%22medium%22&filter=project.value%3ADISTANT+READING+FU!) ### 6. reprintCount --> work.subject.id.value:"reprintCount" AND work.subject.value:"high" (in ONE tag!) - [work.subject.id.value:"reprintCount" AND work.subject.value:"high"](/search?query=work.subject.id.value%3A%22reprintCount%22+AND+work.subject.value%3A%22high%22&order=relevance&limit=10&filter=project.value%3ADISTANT+READING+FU!) ## Statistics | | id | value | | ------------- | ------------- | ---------- | | work.temporal | [X][stat1] | [X][stat2] | | work.subject | [X][stat3] | [X][stat4] | [stat1]: http://dev.textgridlab.org/1.0/tgsearch-public/facet?facet=work.temporal.id.value [stat2]: http://dev.textgridlab.org/1.0/tgsearch-public/facet?facet=work.temporal.value [stat3]: http://dev.textgridlab.org/1.0/tgsearch-public/facet?facet=work.subject.id.value [stat4]: http://dev.textgridlab.org/1.0/tgsearch-public/facet?facet=work.subject.value ## Issues **1** No editions shown under works: - TODO Ubbo: Maybe we need to run the "work-edition-item-connection" script more often (ewmd), should be automated! **2** Is the subject metadata mapping implemented also for collections? we only would need editons here, should we implement both? - TODO Ubbo/Max: Check if we have implemented both - Max: We have implemented that also in collections **3** How to search in general for 2., 4., 5., and 6.? - TODO Ubbo/Max - Max: What do you mean by "general"? **4** [work.subject.id.value:"authorGender"] is working, but [work.subject.id:"authorGender"] not - TODO Ubbo/Max: check the "value", do we really need it in the tgrep search? if not, mask it everywhere - Max: We need this! "value" is not a general reference to a value. In this cases "value" is the name of the field we want to search **5** [work.subject.value.value:"male"] not working, a subject tag problem? There is an error here anyway! - TODO Ubbo/Max: Check https://dev.textgridrep.org/search?query=%5Bwork.subject.value.value%3A%22male%22%5D&order=relevance&limit=10&filter=project.value%3ADISTANT+READING+FU - Max: This doesn't work because of the second ".value". work.subject.value:"male" should work More and up-to-date issues please find in Gitlab: