Drug Abuse Ontology to Harness Web-Based Data for Substance Use Epidemiology Research: Ontology Development Study.

Lokala U; Lamy F; Daniulaityte R; Gaur M; Gyrard A; Thirunarayan K; Kursuncu U; Sheth A

doi:10.2196/24938

Drug Abuse Ontology to Harness Web-Based Data for Substance Use Epidemiology Research: Ontology Development Study.

Affiliations

1. AI Institute, University of South Carolina, Columbia, SC, United States.
Authors
Lokala U¹
Gaur M¹
Kursuncu U¹
Sheth A¹
(4 authors)
2. Department of Society and Health, Mahidol University, Salaya, Thailand.
Authors
Lamy F²
(1 author)
3. College of Health Solutions, Arizona State Univeristy, Phoenix, AZ, United States.
Authors
Daniulaityte R³
(1 author)
4. Department of IoT and AI, Trialog Information Technology & Services, Ile-de-France, France.
Authors
Gyrard A⁴
(1 author)
5. Department of Computer Science & Engineering, Wright State University, Dayton, OH, United States.
Authors
Thirunarayan K⁵
(1 author)

ORCIDs linked to this article

Show all (8)

JMIR Public Health and Surveillance, 23 Dec 2022, 8(12):e24938
https://doi.org/10.2196/24938 PMID: 36563032 PMCID: PMC9823583

This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.

Free full text in Europe PMC

Abstract

Background

Web-based resources and social media platforms play an increasingly important role in health-related knowledge and experience sharing. There is a growing interest in the use of these novel data sources for epidemiological surveillance of substance use behaviors and trends.

Objective

The key aims were to describe the development and application of the drug abuse ontology (DAO) as a framework for analyzing web-based and social media data to inform public health and substance use research in the following areas: determining user knowledge, attitudes, and behaviors related to nonmedical use of buprenorphine and illicitly manufactured opioids through the analysis of web forum data Prescription Drug Abuse Online Surveillance; analyzing patterns and trends of cannabis product use in the context of evolving cannabis legalization policies in the United States through analysis of Twitter and web forum data (eDrugTrends); assessing trends in the availability of novel synthetic opioids through the analysis of cryptomarket data (eDarkTrends); and analyzing COVID-19 pandemic trends in social media data related to 13 states in the United States as per Mental Health America reports.

Methods

The domain and scope of the DAO were defined using competency questions from popular ontology methodology (101 ontology development). The 101 method includes determining the domain and scope of ontology, reusing existing knowledge, enumerating important terms in ontology, defining the classes, their properties and creating instances of the classes. The quality of the ontology was evaluated using a set of tools and best practices recognized by the semantic web community and the artificial intelligence community that engage in natural language processing.

Results

The current version of the DAO comprises 315 classes, 31 relationships, and 814 instances among the classes. The ontology is flexible and can easily accommodate new concepts. The integration of the ontology with machine learning algorithms dramatically decreased the false alarm rate by adding external knowledge to the machine learning process. The ontology is recurrently updated to capture evolving concepts in different contexts and applied to analyze data related to social media and dark web marketplaces.

Conclusions

The DAO provides a powerful framework and a useful resource that can be expanded and adapted to a wide range of substance use and mental health domains to help advance big data analytics of web-based data for substance use epidemiology research.

Free full text

JMIR Public Health Surveill. 2022 Dec; 8(12): e24938.

Published online 2022 Dec 23. https://doi.org/10.2196/24938

PMCID: PMC9823583

PMID: 36563032

Drug Abuse Ontology to Harness Web-Based Data for Substance Use Epidemiology Research: Ontology Development Study

Monitoring Editor: Heather Bradley

Reviewed by Meredith Meacham, Pierre Foulquié, and Nabarun Dasgupta

Usha Lokala, MSci,¹ Francois Lamy, PhD,² Raminta Daniulaityte, PhD,³ Manas Gaur, PhD,¹ Amelie Gyrard, PhD,⁴ Krishnaprasad Thirunarayan, PhD,⁵ Ugur Kursuncu, PhD,¹ and Amit Sheth, PhD¹

¹ AI Institute, University of South Carolina, Columbia, SC, United States,

² Department of Society and Health, Mahidol University, Salaya, Thailand,

³ College of Health Solutions, Arizona State Univeristy, Phoenix, AZ, United States,

⁴ Department of IoT and AI, Trialog Information Technology & Services, Ile-de-France, France,

⁵ Department of Computer Science & Engineering, Wright State University, Dayton, OH, United States,

Usha Lokala, AI Institute, University of South Carolina, 1112 Greene St, Columbia, SC, 29208, United States, Phone: 1 803 777 9707, Email: ude.cs.liame@alakoln.

Author information Article notes Copyright and License information Disclaimer

This article has been cited by other articles in PMC.

Associated Data

Supplementary Materials: Multimedia Appendix 1.
Glossary of terms used in this paper.
publichealth_v8i12e24938_app1.docx (16K)

Abstract

Background

Objective

Methods

Results

Conclusions

Keywords: ontology, knowledge graph, semantic web, illicit drugs, cryptomarket, social media

Introduction

Background

Illicit drug use is a complex social phenomenon generating a variety of public health issues that affect individuals and their communities. In its 2020 report, the United Nations Office on Drugs and Crime estimated that 5.4% of the world population used illicit drugs in 2018 while 0.7% of the whole population is affected by substance use disorder [1]. Individuals affected by substance use disorder are at risk of experiencing a variety of adverse psychiatric and physical health effects such as unintentional overdoses or disease infections (eg, HIV and hepatitis C). Individual drug use also potentially impacts the well-being of others, affecting local communities and neighborhoods [2], which in turn creates the contextual conditions and social determinants linked to individual drug use initiation [3]. Although cannabis remains by far the most consumed illicit drug with more potent forms potentially linked to adverse consequences [4], opioid and amphetamine-type drugs remain more frequently associated with psychiatric and physical harms [5].

Although illicit substance use represents an endemic phenomenon affecting modern societies, recent years have seen radical and rapid changes in terms of the variety of substances available, the growing role played by the internet, and the decriminalization or legalization of several illicit substances in an increasing number of countries. For example, the European Monitoring Centre for Drugs and Drug Addiction has identified and listed approximately 400 novel psychoactive substances since 2015 [6], while cryptomarkets located on the dark net have become increasingly important platforms for the distribution of novel psychoactive substances and other illicit or prescription drugs [7,8]. These changes call for more timely methods of data collection, allowing the monitoring of both demand and supply sides. In this ever-changing environment, user-generated content on illicit drug use shared on social media represents a rich source of unsolicited and unfiltered self-disclosures of attitudes and practices related to substance use [9]. Furthermore, web-based sources of distribution can be harnessed to provide updates on the illicit drug supply trade and new trends [10].

These unfiltered web-based communications and advertisements offer a rich source of data sensitive to changing and emerging drug use trends, and can be used to complement and enhance existing epidemiological surveillance systems.

Semantic web-based approaches play a key role in enhancing and improving big data analytics for such complex domains as substance use. The semantic web is an extension of the web in which a set of design principles and technologies have been created to capture the meaning of information [11]. An ontology is defined as a specification of shared concepts and relationships among them, consisting of a schema and a knowledge base of instances [12].

Ontologies also play key roles in the development of (1) semantic web applications, (2) semantic annotation of data, and (3) tools for querying and reasoning [13]. However, to apply semantic web tools effectively, there is a need for a domain-specific ontology to represent the main entities of value described in the social media posts and their relationships [14].

There has been a broad range of research developing ontologies for social media data. For instance, the work proposed by Kim et al [15] aimed to develop an ontology dedicated to obesity for investigating obesity-related social media posts and detecting sentiments, emotions, and opinions posted on specific social media. Their ontology was evaluated by mapping concepts from ontology with similar terms found in tweets related to obesity, and is only limited to 8 superclasses related to broader perspectives of any biomedical ontology. This study is limited to social media posts for improving upon the ontology, and the keywords are vastly distributed among the top 2 obesity types (abdomen and thigh) and top 3 management types (diet, exercise, and drug therapy) and are only limited to the general population in social media.

There are fewer ontologies related to the domain of mental health. For example, Jung et al. [16] proposed to design an ontology using an entity-attribute-value triplet data model dedicated to adolescent depression in order to analyze related social media. This ontology was developed using clinical guidelines and unstructured social media posts with 777 terms divided into risk factors, signs and symptoms, screening, diagnosis, treatment, and prevention. This work is mainly limited to the extraction of data solely from adolescent depression-related social media posts.

Several prior ontologies were developed for the analysis of the prescription drug domain. For example, the prescription drugs ontology [17] aims at improving the semantics of drug prescriptions and prospectively enabling the interoperability of prescription data by reusing classes and object properties from the information artifact ontology [18], the ontology for biomedical investigations [19], the ontology for general medical science [20], the ontology for medically related social entities [21], and the drug ontology [22]. However, these ontologies focus on medical uses of prescribed drugs and do not include concepts or slang terms related to the use of illicit drugs and addiction.

As the opioid crisis has deepened in recent years, efforts to analyze the opioid research on social media and make policy decisions have intensified. In a recent study, a specific knowledge graph called Opioid Drug Knowledge Graph (ODKG) [23] was developed to capture opioid-related drugs and related entities in eHealth records. As the drug abuse ontology (DAO) also contains information about opioid-related drugs, we compared the ODKG and DAO in terms of their coverage of relevant entities in opioid-related social media corpus (Twitter) and observed that the DAO outperformed the ODKG by order of magnitude. As the DAO was designed to also cover slang terms that are common in social media, it performed well by retrieving 7 million more tweets than the ODKG (2 million) from a resource of 1.2 billion crawled tweets during the COVID-19 pandemic [24].

The key aims of this paper were to describe the process of development, evaluation, and application of the DAO to facilitate and enhance social media and web-based analytics for substance use epidemiology research. This paper describes the process of DAO development in the context of 4 research projects out of which 3 are National Institutes of Health (NIH)–sponsored studies that aimed to harness web-based and social media data for substance use epidemiology research: (1) Prescription Drug Abuse Online Surveillance (PREDOSE) project that aimed to characterize user knowledge, attitudes, and behaviors related to nonmedical use of buprenorphine and other illicitly manufactured opioids through the analysis of web forum data [25-27]; (2) eDrugTrends project that focused on patterns and trends of cannabis product use in the context of evolving cannabis legalization policies in the United States through the analysis of Twitter and web forum data [28-32]; (3) eDarkTrends project that aimed to identify availability trends of novel synthetic opioids through the analysis of crypto market data [33-35]; and (4) COVID-19 pandemic trends in social media data related to 13 states in the United States and its mental health impact.

The terminology related to machine learning (ML), natural language processing (NLP), and ontology design used in this paper is organized alphabetically in Textbox 1.

Descriptions of machine learning (ML), natural language processing (NLP), and ontology terms used in this paper.

Terminology and description

101 ontology [36]: the 101 ontology is a guideline to create an ontology and offers step by step process. It leverages the authors’ experiences developing and maintaining ontologies in several ontology environments like Protégé.
Bootstrap and bagged random Forest with contextual features (BRF-CF): Random Forest is one of the most popular ML algorithms. It is a type of ensemble ML algorithm called bootstrap or bagging.
Class, data property, individual count: these terms are used as the signatures for the imports closure of the active ontology. In other words, the number of distinct classes, object properties, data properties, and individuals are mentioned in the ontology. The numbers here include built-in entities, such as owl: Thing if they are explicitly mentioned in the ontology.
Community Ontology Repository [37]: this is the repository of ontologies hosted by Earth Science Information Partner’s members that would let users try out semantic technologies, understand their benefits, and explore possible applications that used semantic resources.
Depression and drug abuse BERT: BERT is a bidirectional encoder representations from transformers and is a transformer-based ML technique for NLP. We fine-tune BERT models on corpora that are representative of depression and drug abuse.
DBpedia [38]: DBpedia is a crowd-sourced community effort to extract structured content from the information created in various Wikipedia projects.
Diagnostic and Statistical Manual for Mental Disorders (DSM)-5: It is the taxonomic and diagnostic manual developed and published by the American Psychiatric Association. It is an authoritative guide for mental health care professionals in the diagnosis of mental disorders.
Entity, concept: the entity is referred to as an encompassing concept for classes, individuals, and properties. Concept and class are simply synonyms.
F1 score: It is the weighted average of precision and recall. This score takes both false positives and false negatives into account. F1 is usually more useful than accuracy score.
False positive, true positive: a false alarm is also known as a false positive. A false positive is a result that indicates a given condition exists when it does not. For example, the model indicates that cannabis can cause pain when it does not cause pain. A true positive is an outcome where the model correctly predicts the positive class. Similarly, a true negative is an outcome where the model correctly predicts the negative class. A false positive is an outcome where the model incorrectly predicts the positive class.
Horizontal linguistic features, vertical linguistic features, fine-grained features: while training an ML model, we organized our feature set into 3 broad groups: horizontal linguistic features, vertical linguistic features, and fine-grained features. Contextual Features (or embedding of a social media post) with Modulations (CFwM) and without Modulations (CFw/oM) are 2 additional feature set created using Word2Vec.
Ontology metrics [39]: the metrics list the numbers for structures and representation of ontology in Protégé as it is the most widely used tool to create an ontology. Axioms associate class and properties and are a combination of logical and nonlogical attributes. The number of distinct classes, object properties, data properties, and individuals reported is focused on the evaluation of the structure of DAO.
Oops (ontology pitfall scanner), vapor, triple checker [40]: these are Semantic Web (SemWeb) validation or documentation tools that help to improve ontologies. Oops detect common pitfalls in ontology automatically and provide recommendations to fix them.
Owl file: the W3C web Ontology Language is a SemWeb language designed to represent rich and complex knowledge about things, groups of things, and relations between things.
PerfectO methodology [40]: PerfectO references, classifies, and provides tools to encourage SemWeb best practices to achieve semantic interoperability by focusing on ontology improvement.
Precision, recall: precision is the proportion of times that when you predict it is positive and it actually turns out to be positive, whereas recall is like accuracy over just the positives—it is the proportion of times you labeled positive correctly over the number of times it was actually positive.
Protégé: protégé is a free, open-source ontology editor and framework for building intelligent systems.
SEDO [41]: It stands for Semantic Encoding and Decoding Optimization. It is a procedure to modulate the word embedding (vectors) of a word. SEDO modulates the embeddings of each word in the Reddit content of the user based on the proximity of the word to the Diagnostic and Statistical Manual for Mental Disorders-5th edition category.
Vanilla BERT: Vanilla BERT is a variation of the attention-based BERT model and provides a pretrained starting point layer for neural networks.
WebVOWL [42]: It is a web application for the interactive visualization of ontologies which is one of the ontology visual representations.

Evolution of the DAO

As social media and other web resources play an increasingly important role in health-related knowledge and experience sharing [43], there is a need for an ontology explicitly dedicated to the domain of substance use research. The DAO was developed to formalize concepts, entities, and relationships relevant to the domains of addictions and mental health to harness its use on social media data. Our approach, built on the integration of semantic web technologies, enhances traditional ML and NLP techniques for automatic extraction and representation of relevant data and facilitates analysis and interpretation related to the specific goals of each study.

Prescription Drug Abuse Online Surveillance

This study focuses on web forum data related to the nonmedical use of buprenorphine [26,27] approved in late 2002 by the United States Food and Drug Administration for the treatment of opioid addiction. Use of buprenorphine was defined as nonprescribed when used without medical supervision. Although there is always a level of uncertainty in disambiguating prescribed versus nonprescribed use in web-based discussions, some of the questions and practices shared by individuals provided indicators about nonprescribed use (eg, saying that Suboxone was obtained from a friend; that bupe was snorted; or that it was cut up and used in smaller amounts). Buprenorphine (Suboxone, Subutex, etc) is the only controlled substance that may be prescribed for the treatment of opioid addiction by a licensed physician in an office-based setting. The overall purpose of PREDOSE was to study user-generated web forum discussions about the illicit use of Suboxone (buprenorphine or naloxone), Subutex (buprenorphine), and other buprenorphine products by applying novel information processing techniques to facilitate qualitative and quantitative analysis [26]. Along with Twitter and Reddit, we also used 3 web forums that provided venues for people to freely share drug use experiences and post questions, comments, and opinions about different drugs. One of these web forums used in our research was Bluelight [44] (please note that in compliance with Institutional Review Board guidelines at Wright State University, the names of the other 2 forums have not been disclosed in this paper). Our team has developed a research collaboration with the Bluelight team and was able to obtain deidentified data updates directly from Bluelight. Data from these forums were collected using custom-built web crawlers. We chose to study buprenorphine because there was at that time (2011-2012) a growing body of evidence that buprenorphine was used and that there was relatively little knowledge about the patterns and trends of its nonmedical use in the United States. As buprenorphine use is linked to a broader domain of illicit opioid use and addiction, the initial versions of the DAO included detailed representation of the opioid class drugs, including slang and brand name terminology. The DAO developed for the PREDOSE project also included other classes of drugs, such as cannabis and stimulant-type drugs, because polysubstance use is common among illicit opioid users. Figure 1 [26] demonstrates the use of the DAO ontology within our PREDOSE architecture, which comprises three main modules:

An external file that holds a picture, illustration, etc.
Object name is publichealth_v8i12e24938_fig1.jpg

Figure 1

Use of the drug abuse ontology within Prescription Drug Abuse Online Surveillance (PREDOSE). RDF: Resource Description Framework.

Data collection module that collected approximately 1 million posts (1,066,502) from 35,974 users.
Automatic coding module that semantically annotated the posts using the DAO ontology.
Data analysis and interpretation module to visualize the keywords (eg, loperamide and buprenorphine) found within posts and referenced within the DAO ontology.

eDrugTrends

This is our second project that received funding from NIH and National Institute on Drug Abuse (NIDA) in 2014 [45]. This study focused on social media data related to cannabis and synthetic cannabinoid use in the context of evolving cannabis legalization policies in the United States. The aim of this study was to develop eDrugTrends, a comprehensive software platform for semiautomated processing and visualization of thematic, sentiment, spatiotemporal, and social network dimensions of social media data (Twitter and web forums) on cannabis and synthetic cannabinoid use. The study also aimed to (1) identify and compare trends in knowledge, attitudes, and behaviors related to cannabis and synthetic cannabinoid use across United States regions with different cannabis legalization policies using Twitter and web forum data and (2) analyze social network characteristics and identify key influencers in cannabis and synthetic cannabinoid–related discussions on Twitter. For addressing these aims of the eDrugTrends platform, the DAO was expanded further to include a more comprehensive representation of emerging cannabis products, synthetic cannabinoid products, health-related consequences, and mental health conditions.

eDarkTrends

This is the third project using the DAO. This study was funded through the NIH and NIDA time-sensitive mechanism [46], which started in 2017. The eDarkTrends project was orientated toward novel synthetic opioids, such as illicitly manufactured fentanyl that have emerged over the past few years and were and still are significant contributors to the increase in unintentional opioid-related overdose mortality in the United States [35,47,48]. However, epidemiological surveillance on cryptomarket data was limited at the time (2017). The study’s overall goal was to harness cryptomarket data to conduct surveillance of illicit fentanyl, fentanyl analog, and other novel synthetic opioid availability trends over time and identify new substances as they emerge in the Darknet environment. Ultimately, eDarkTrends aimed at providing a powerful tool for epidemiological surveillance, enhancing the capacities of early warning systems to capture changes in the fentanyl and other illicit synthetic opioid supply and availability. For addressing the specific needs of the project, the DAO was further expanded to include a comprehensive and detailed representation of novel illicit synthetic opioid domains (eg, carfentanil, furanyl fentanyl, U-47700, and MT-45).

COVID-19 Pandemic

In addition, we applied the DAO on COVID-19 social media data analysis to analyze the social media data related to the pandemic. The intent is that the COVID-19 pandemic has alleviated community-wide depression and has led to increased drug use [49]. The impact of the COVID-19 pandemic on mental health was investigated in recent studies [50-52]. For this, we proposed a novel framework for assessing the spatiotemporal-thematic progression of depression, drug use, and informativeness of the underlying news content across different states in the United States [53]. The DAO is used along with the Medical Subject Headings terms hierarchy in the Unified Medical Language System, the Diagnostic and Statistical Manual for Mental Disorders-5th edition (DSM-5) lexicon [41], which are collectively referred to as the Mental Health and Drug Abuse Knowledge base (MHDA-Kb) to spot additional entities.

Methods

Overview

The ontology was manually developed by the domain expert coauthors (FL and RD), who used a range of sources, including (1) key epidemiological data sources and reports accessible through the NIDA [54], Drug Enforcement Agency [55], European Monitoring Centre for Drugs Addiction [56], and RxNorm [57]; (2) prior peer-reviewed publications related to illicitly manufactured opioids, cannabis, and other drugs [58-61]; and (3) ongoing manual assessment and examination of web-based social media sources related to selected substances [25,27,62]. Sources of types 1 and 2 provided primary concepts while sources of type 3 were important in identifying alternative concepts, including synonyms and street names. To develop the DAO, we followed the well-known 101 ontology development methodology [63]. The 101 method includes (1) determining the domain and scope of ontology, (2) reusing existing knowledge, (3) enumerating important terms in ontology, and (4) defining the classes and their properties and creating instances of the classes.

Design

Figure 2 provides an overview of the DAO ontology. Protégé [64], a popular ontology editor, was used to build the ontology as a tree of subclasses. The ontology was designed as a catalog of concepts related to substance use. Hence, classes of psychoactive substances (eg, cannabinoids and opioids) were created and populated with subtypes of substances (eg, morphine and fentanyl). Each substance was defined by its name and, when applicable, information regarding its pharmaceutical or brand name (has_brand_name), slang or street name (has_street_name), and chemical designation (has_chemical_formula) were added. This latter information was collected through different sources: pharmaceutical or brand names were based on existing medical or pharmacological dictionaries, slang or street names were based on the domain knowledge of the second and third authors (RD and FL), and chemical designations mostly concerned synthetic cannabinoid receptor agonists and were based on academic literature as well as on seizure data (eg, the National Forensic Laboratory Information System and Europol). The DAO was also enhanced with concepts and slang terms related to those concepts regarding unit (eg, caps, ml, and bottle), purity, and form of preparation (eg, crush and eyeball) to enable the identification and analysis of triple in text content [65]. For example, one instance of the drug Morphine is Poppy_Tea, which has the slang terms Pod and Poppy_Pods used on social media.

An external file that holds a picture, illustration, etc.
Object name is publichealth_v8i12e24938_fig2.jpg

Figure 2

Drug abuse ontology in Protégé (concepts, object properties, data properties, and instances).

Instantiation

This is defined as creating instances of classes in a hierarchy. The instance of a class has its own class and fills a value. The instance has its own properties. For example, Fentanyl belongs to the class Opioid and has its own properties such as has_brand_name, has_synonym, has_slang_term, etc. The DAO ontology reuses instances from the DBpedia data set [66] (eg, buprenorphine). Figure 3 is the WebVOWL (web application for the interactive visualization of ontologies) representation of the DAO focused on the entity Cannabis derived from the visual data web [67]. Figure 2 shows the tree of drug names implemented as a web ontology format (owl) file within the DAO ontology. In Figure 2, entities, object properties, instances, and data properties are represented in yellow, green, and purple tags, respectively, which clearly depict the nature of classes, instances, hierarchies, and relationships for each entity.

An external file that holds a picture, illustration, etc.
Object name is publichealth_v8i12e24938_fig3.jpg

Figure 3

Web-based visualization of OWL ontologies (WebVOWL) representation of the drug abuse ontology, focused on the cannabis concept. RCS-4: 1-pentyl-3-(4-methoxybenzoyl)indole.

Ethics Approval

This research is done in compliance with institutional review board guidelines at Wright State University. The names of the selected websites have not been disclosed in this manuscript. Our project involves analysis of Twitter data that is publicly available and that has been anonymized. It does not involve any direct interaction with any individuals or their personally identifiable data. Furthermore, our data set does not include any interaction with human participants. Our data set does not contain any images as per our data use safety agreement. Thus, this study was reviewed by the Wright State University Institutional Review Board and received an exemption determination.

Results

Evaluation

The DAO ontology was evaluated following the semantic web best practices recognized by the International Semantic Web Conference Resource Track guidelines [68], which provide the following criteria: (1) impact, (2) reusability, (3) design and technical quality, and (4) availability. We have also followed the PerfectO methodology [40], which synthesizes a set of additional best practices and eases their achievements [69]. We have discussed the results of applying the following criteria to our DAO:

Impact and reusability: the DAO has been exploited in 4 scenarios, as mentioned earlier. Automatic documentation can be provided using the Live OWL documentation environment [70], and the DAO documentation is available in Community ontology repository [71].
Design, technical quality, and availability: the design of the ontology is available on the web as a graph visualization using web-based visualization of ontologies (WebVOWL) [72,73]. We improved the ontology using Oops (Ontology Pitfall Scanner) tools that automatically detect common pitfalls and provide recommendations to fix them. Oops loaded with the DAO can be tested on the web [71,74]. The Linked data validator, Vapour tool integrated with the DAO [75] was used to check dereferencing uniform resource identifier and content negotiation. Finally, Resource description framework Triple-Checker checks whether the existing ontologies have been correctly used within our DAO [76].

Ontology metrics: the DAO was also evaluated, as shown in Table 1, with respect to several ontology metrics [77]. The metrics list the numbers for the structures and representation of ontology in Protégé, as it is the most widely used tool to create ontology [78]. Axioms associate class and properties and are a combination of logical and nonlogical axioms [79]. The number of distinct classes, object properties, data properties, and individuals reported in Table 1 are focused on the evaluation of the structure of the DAO.

Table 1

Drug abuse ontology metrics: the ontology metrics view displays entity and axiom count for the axioms in the active ontology [39].

Metric		Count, n	Description
Ontology metrics
	Axioms	4876	Combined logical and nonlogical axiom count
	Logical axiom count	3478	The number of logical axioms
	Declaration axioms count	1185	The number of declaration axioms
	Class count	316	The number of distinct classes, object properties, data properties and individuals that are mentioned in the ontology
	Object property count	12	The number of distinct classes, object properties, data properties and individuals that are mentioned in the ontology
	Data property count	13	The number of distinct classes, object properties, data properties and individuals that are mentioned in the ontology
	Individual count	845	The number of distinct classes, object properties, data properties and individuals that are mentioned in the ontology
Class axiom
	SubClassOf	313	The number of SubClassOf axioms in the ontology. A subclass axiom states that a class is a subclass of another class
Individual axioms
	Data property assertion	2317	A data property assertion states that the individual is connected by the data property expression to the literal.
	ClassAssertion	830	A class assertion states that the individual is an instance of the class expression.
	AnnotationAssertion	213	An annotation assertion states that the annotation subject is an anonymous individual with the annotation property and value.

The subsequent sections demonstrate the results with the DAO in different platforms and the evolution of the DAO with each use case.

The DAO Within PREDOSE

Figure 4 [26,80] describes how the texts are automatically annotated using the DAO. In the text shown in Figure 4, we identify drug entities, dosage, time interval, route of administering the drug, etc. In the DAO, buprenorphine is defined as the subclass of Subutex and Suboxone. It has the slang terms Bupe and Bupey. The term Bupe identified in the text would not have been possible without defining it as a slang term in the DAO. The DAO is capable of mapping units (eg, mg→MILLIGRAM) and slang terms (eg, bupe— buprenorphine) based on a lexical lookup in the ontology. Similarly, other concepts, such as the route of administration injected, are also identified in the text. In NLP-related tasks, such as lexical, semantic, and syntactic analysis of textual data, adding ontology works as an external source of knowledge in identifying triples and entities in data. Conceptualizing the domain in data acts as a prior requirement for processing further information (lexicon and rule-based grammar) about it [81] (Figure 5 [80]). When evaluating 601 web forum posts with the DAO, we achieved 84.9% precision and 72.5% recall in information extraction tasks. In particular, out of 3639 annotations, 2640 were predicted correct (true positives), whereas 683 slang terms are incorrect (false positives). As far as the recall is concerned, only 999 out of 3639 annotations are missed (false negatives) [26]. For triple extraction with the DAO, we achieved 33% precision across 197 evaluated triple patterns (66 were correct and 131 were incorrect). For relation extraction with the DAO, we achieved 36% precision across 183 phrases (66 were correct and 117 were incorrect). Another finding (Figure 6 [25]) is that our analysis of web forums with the DAO revealed that loperamide was widely used as a treatment for withdrawal symptoms related to opioid addiction, where buprenorphine and methadone are commonly prescribed. A total of 3 toxicology studies following this work led to a Food and Drug Administration warning in 2016 [25,82]. A video demo [83] on the PREDOSE platform is available on the web. The PREDOSE platform indicates a need for additional enhancements in information extraction and automated data coding techniques.

An external file that holds a picture, illustration, etc.
Object name is publichealth_v8i12e24938_fig4.jpg

Figure 4

Automatic annotation of texts with the drug abuse ontology (DAO) [80].

An external file that holds a picture, illustration, etc.
Object name is publichealth_v8i12e24938_fig5.jpg

Figure 5

Benefits of ontologies with lexicons and rule-based grammar [80].

An external file that holds a picture, illustration, etc.
Object name is publichealth_v8i12e24938_fig6.jpg

Figure 6

Loperamide discovery and its use in self-medication for opioid withdrawal.

eDrugTrends (Monitoring Drug Trends on Social Media)

The eDrugTrends project aimed to analyze trends in knowledge, attitudes, and behaviors related to the use of cannabis and synthetic cannabinoids on web forums and Twitter [26,28-31]. Figure 7 [79] shows the application of the DAO ontology within the eDrugTrends architecture, which includes 4 stages: (1) data collection, (2) data processing, (3) data access tools for exploration and visualization, and (4) quantitative and qualitative analyses and interpretation. From the social science or substance use epidemiology perspective, the data processing and information extraction stages correspond with the coding task that prepares raw data for further analysis and interpretation. During data processing, the DAO came into the picture by playing an important role in identifying entities in the data that are exact names or synonyms or slang terms or street names of a drug. We generated embedding vectors using the DAO for domain-specific word embedding models and built an ML model to classify users by their types (individual, agency, and retailer) on Twitter by classifying their marijuana-related conversations [28]. We achieved this using multimodal embeddings extracted from people, content, and network views, achieving an 8% improvement over the empirical baseline [28]. We evaluated our approach using the average F1-score for each user type individual (P), informed agency (I), and retailer (R). The F1 scores for the individual classes P, I, and R were 95%, 42%, and 73%, respectively. The descriptive statistics of the training set at the Twitter user account level used for this study, which involved semantic filtering [84] using the DAO, are shown in Table 2.

An external file that holds a picture, illustration, etc.
Object name is publichealth_v8i12e24938_fig7.jpg

Figure 7

Architecture of the eDrugTrends project.

Table 2

Descriptive information of user accounts on Twitter extracted using the drug abuse ontology [28].

Features	Personal accounts	Retail accounts	Informed agency	Total
Number of tweets	9836	1928	338	12,102
Number of profile pictures	4394	476	111	4981
Number of users with description	3884	461	108	4453
Number of retweets	955	24	964	1943
Number of mentions	94	6	307	407

Enhancing the DAO With DSM-5

The motive for enhancing the DAO with DSM-5 is to provide actionable information to clinicians about the mental health of a patient in diagnostic terms for web-based interventions. We chose Reddit data for this study as the concepts, instances, and relations associated with drugs are semantically connected to mental health communications on social media, especially on Reddit. In our Reddit corpus, the drug use–related categories form a substantial portion (48%; corpus size is 2.5 million posts from 15 mental health subreddits by 268,104 users) of the data set in size. However, the DAO still lacked concepts directly related to mental health diagnostic disorders as defined in DSM-5 that are present in the International Classification of Diseases 10th edition [85], Systematized Nomenclature of Medicine-Clinical Terms [86], and DataMed [87]. In a recent study [41] on matching mental conditions of user posts on Reddit to DSM-5 diagnostic disorders, we enhanced the DAO with knowledge derived from DSM-5, which includes 20 chapters (Table 3), consistent with International Classification of Diseases 10th edition and NIH’s research domain criteria [88] for mental health. The enhanced DAO includes representations of mental health disorders and related symptoms that were developed following the DSM-5 classification [89]. For example, references for Cannabis Use Disorder include terms such as addicted to cannabis, addicted to Marijuana, and Jazz_mango addict. References to the feeling of anxiety or anxious include such terms as antsy, worried, and agitated. These lay terms were added to the DAO manually using synonym dictionaries and by manually examining Reddit conversations related to depression, anxiety, and other mental health conditions.

Table 3

Demonstration of improvement in the number of DSM-5^a category–related concepts being captured before and after including the DAO^b [41].

DSM-5 category	DSM-5–related concepts captured without the DAO, n	DSM-5–related concepts captured with the DAO, n
Dissociative disorder	20	20
Anxiety disorder	40	87
Substance use and addictive disorder	39	123
Schizophrenia spectrum	77	77
Sleep-wake disorder	14	19
Paraphilic disorders	14	14
Gender dysphoria	15	15
Neurodevelopmental disorders	25	53
Sexual dysfunctions	23	23
Personality disorders	76	98
Trauma and stressor related disorder	25	28
Disruptive, impulse, control, and conduct disorder	34	34
Psychotic disorders	85	87
Bipolar and related disorders	75	84
Elimination disorders	18	18
Depressive disorders	71	107
Obsessive-compulsive related disorder	43	60
Feeding and eating disorders	32	39
Neurocognitive disorders	80	80
Suicidal behavior or ideation	34	47

^aDSM-5: Diagnostic and Statistical Manual for Mental Disorders-5th edition.

^bDAO: drug abuse ontology.

The DAO, curated and enhanced by DSM-5 concepts, was used in a weakly supervised setting to label Reddit posts with DSM-5 categories. In a comparative analysis with the state-of-the-art research by Park and Conway [90], Saravia et al [91], and Gkotsis et al [92], we observed that expansion of the DAO with DSM-5 helped improve the accuracy of our entity identification tools (reduced false positives by 92%). These results are shown in Figure 8. We further assessed the meaningfulness of the prediction through a reliability assessment with a domain expert, which gave an agreement score of 84%. In addition, the incorporation of slang terms from the DAO to match and process the informal social media data improved both coverage and recall (Table 4). Thus, we demonstrated that semantic weighting of contextual features from the content using the DAO and DSM-5 knowledge could significantly improve the robustness of the artificial intelligence system. As web-based content is mapped to a clinically acceptable vocabulary, the system brings in explainability. Furthermore, Table 3 shows the improvement in the number of concepts extracted from the DAO being captured in our Reddit Corpus that relate to DSM-5, 20 chapters before and after adding slang terms.

An external file that holds a picture, illustration, etc.
Object name is publichealth_v8i12e24938_fig8.jpg

Figure 8

Results illustrating that domain-specific knowledge bases lower false alarm rates in identifying Diagnostic and Statistical Manual for Mental Disorders-5th edition (DSM-5) categories to tag posts in mental health subreddits. DAO: drug abuse ontology.

Table 4

Ablation study on contextual features and their modulation using SEDO^a weights generated from either DSM-5^b or its enrichment using the DAO^c and slang terms^d.

Method (with horizontal linguistic features, vertical linguistic features, and fine-grained features)	Precision	Recall	F1-score
BRF^e with CF^f	0.60	0.54	0.57
BRF-CF (SEDO weights generated from DSM-5 lexicon without the DAO)	0.87	0.77	0.82
BRF-CF (SEDO weights generated from DSM-5 lexicon with the DAO without slang terms)	0.87	0.80	0.83
BRF-CF (SEDO weights generated from DSM-5 lexicon without the DAO with slang terms)	0.85	0.82	0.83
BRF-CF (SEDO weights generated from DSM-5 lexicon with the DAO with slang terms)	0.88	0.83	0.85

^aSEDO: Semantic Encoding and Decoding Optimization.

^bDSM-5: Diagnostic and Statistical Manual for Mental Disorders-5th edition.

^cDAO: drug abuse ontology.

^dThis table demonstrates the improvement of models with the enhanced DAO.

^eBRF: balanced random forest.

^fCF: contextual features.

The base model for the ablation study is a balanced random forest with horizontal linguistic features (number of definite articles, words per post, first-person pronouns, pronouns, and subordinate conjunctions), vertical linguistic features (number of part-of-speech tags, similarity between the posts, intrasubreddit similarity, and intersubreddit similarity), and fine-grained features (sentiment, emotion, and readability scores).

eDarkTrends (Monitoring Drug Trends on Cryptomarkets)

The DAO also plays an essential role in identifying relevant entities and analyzing data from the Darknet cryptomarkets (eg, Agora, Dream Market, and Empire Market) to quantify and assess the availability of fentanyl, fentanyl analogs, and other novel synthetic opioids on the cryptomarkets [25,26]. The snapshot of the Darknet Marketplace is shown in Figure 9 [33]. The terms and slang terms associated with instances populating the DAO opioid subclass, as well as the dosage (eg, gram, mL, and ounce) and form (eg, tablet and powder) classes were compiled as regular expressions and used as expression patterns in the dedicated named entity recognition (NER) algorithm specifically designed for Darknet data [35]. The DAO was inductively augmented with abbreviations and terms specific to the cryptomarket environment (eg, fuff for fluoro-furanyl fentanyl or FE for finalize early) to ensure that only relevant data on novel synthetic opioids were collected. The NER allows capturing the types and quantities of novel synthetic opioids advertised on crypto markets; for example, the NER would provide the following information about the advertisement FENTANYL TRANSDERMAL PATCHES 100 mcg per h as class: fentanyl-type; name: fentanyl; dosage: 0.0001 g per h; form: transdermal. The results regarding the average numbers of fentanyl, fentanyl analogs, and other nonpharmaceutical synthetic opioids advertised on cryptomarkets identified are shown in Table 5. The crawls considered to obtain these results were the dark web posts collected from the Agora and Dream markets in the years 2015 and 2018 [35]. We also classified vendors on Darknet markets (Dream, Tochka, and Wall Street are the marketplaces used for this study) using the DAO. The summary of our findings related to unique vendors, substance, location, vendor descriptions, and the number of withdrawal transactions is shown in Table 6.

An external file that holds a picture, illustration, etc.
Object name is publichealth_v8i12e24938_fig9.jpg

Figure 9

Screenshot of the Darknet marketplace.

Table 5

Average number of fentanyl, fentanyl analogs, and other nonpharmaceutical synthetic opioids advertised on cryptomarkets extracted with the drug abuse ontology [34].

Types of substances			Average number of advertisements per day, by month (number of crawls)
			Agora							Dream Market
			March 2015		April 2015		May 2015		March 2018			April 2018
Fentanyl^a			130		174		139		207			216
Fentanyl analogs
	Acetyl fentanyl	44		39		41		3			1
	Butyr fentanyl	12		10		17		6			7
	Carfentanil	0		0		0		12			5
	Furanyl fentanyl	0		0		1		31			39
	Methoxy Acetyl fentanyl	0		0		0		14			14
	4-fluroIsoButyr fentanyl	0		0		0		19			16
	3-methoxyMethyl fentanyl	0		0		0		2			2
	Total, fentanyl analogs	56		49		59		87			84
Other NP^b synthetic opioids
	U-47,700	5		4		5		0			3
	W-18	5		4		5		0			0
	MT-45	9		8		9		0			0
	AH-7921	0		0		1		0			0
	U-48,800	0		0		0		1			7
	U-49,900	0		0		0		0			1
	U-4TDP	0		0		0		0			4
	U-50,488	0		0		0		8			4
	MPF-47700	0		0		0		0			5
	Total, other NP synth opioids	19		16		20		9			24
Other opioids^c			827		1061		1152		3211			3137
Total (any opioids)			1033		1300		1370		3512			3460

^aIncludes mentions of fentanyl, China white heroine, synthetic heroine, and mentions of pharmaceutical fentanyl such as Duragesic, fentanyl patches, and fentanyl transdermal system.

^bNP: nonpharmaceutical.

^cIncludes mentions of heroin, opium, morphine and other types of pharmaceutical opioids (eg, hydrocodone, oxycodone, and hydromorphone) excluding pharmaceutical fentanyl.

Table 6

Summary of data set extracted from Darknet markets using the drug abuse ontology [33].

Marketplace	Withdrawal number of transactions	Bitcoin	US $ equivalent	Unique number of vendors	Unique number of substances	Unique number of locations	Unique number of descriptions
Dream	261	99.1503695	197,589.12	1448	852	356	16,800
Tochka	2990	0.70483642	5072.33	408	313	44	1829
Wall Street	7755	2.572515	18,729.40	466	290	29	1723

COVID-19 Scenario

We performed a spatiotemporal analysis of the psychological impact of the novel COVID-19 using approximately 1.2 billion tweets from January 1 to April 10, 2020 [93,94]. The concepts related to addiction and mental health in the COVID-19–related data were semiautomatically recognized using the entities and slang terms mentioned in the DAO. Approximately 90 related concepts and 140 slang terms were used to extract tweets mentioning illicit drug use, alcoholism, and pharmacological drug misuse. Furthermore, suicide risk factors such as insomnia and depression were observed in the tweets extracted using the DAO. Similarly, we studied the negative media exposure from approximately 700,000 news articles published during the COVID-19 pandemic by fine-tuning the bidirectional encoder representations from transformers (BERT) model with the DAO [53]. The 3 months (January, February, and March) in the year 2020 were considered for our earlier study, as this period had a huge COVID-19 spread as per the Mental Health America report [95]. We used 10 of the 13 states recognized as high-spread areas in this report. The 3 states that are not included in Table 7 are Washington, Wyoming, and Idaho. These 3 states were not included, as the related data were not present in our data set cohort. In this work, we reported the state-wise labels (ie, depressive, drug abusive, and informative) for each month using deep learning models vanilla BERT, depression BERT, and drug use BERT, as shown in Table 7. The definitions of these deep learning models are described in Textbox 1. This study is followed by analyzing the Social Quality Index, which aggregates mental health components (depression and anxiety), addiction, and substance use disorders, considering tweets in the period March to April 2020. The Social Quality Index and tweets for states Illinois, New York, Maryland, Arizona, New Mexico, and Massachusetts are shown in Figure 10 [94].

Table 7

Evaluation of BERT^a models for Mental Health America states over 3 months (January, February, and March 2020) [53,94].

Mental Health America states with depression and drug use	vanillaBERT (2020; months)	Druguse-BERT (2020; months)	Depression BERT (2020; months)
Tennessee	February and March	February and March	February and March
Alabama	February	February and March	February
Oklahoma	March	February and March	February and March
Kansas	February	January and February	January and February
Montana	March	February	February and March
South Carolina	March	March	February and March
Alaska	February and March	January, February, and March	February and March
Utah	March	March	March
Oregon	None	February	None
Nevada	February	February	February

^aBERT: bidirectional encoder representations from transformers.

An external file that holds a picture, illustration, etc.
Object name is publichealth_v8i12e24938_fig10.jpg

Figure 10

Social quality index (SQI) pattern of improvement in conditions as the decline in the number of tweets on depression, addiction, and anxiety.

Discussion

Strengths and Limitations

The DAO is an ongoing project that can be continuously improved and expanded to handle additional topic areas and emerging substance use issues and trends. DAO development requires intensive, hands-on involvement of experts in the field of substance use research (domain experts). We acknowledge a limitation to our approach in that our DAO development team did not include persons with lived experiences of substance use disorders. In the future, it would be important to also involve individuals who use drugs to help develop and refine DAO sections and terms. The DAO can provide a tool and a framework for interdisciplinary collaborative teams to carry this work forward. The DAO ontology has been proven effective in several scenarios, as demonstrated in Evaluation section (Section 3). Table 8 summarizes the evolution and improvement of the ontology use according to the needs of the projects. The public health findings described in this document of associated projects, with a focus on person, place, and time, are referenced in Table 8.

Table 8

Summary of the drug abuse ontology implemented in projects.

Domain	Related publications	Manuscript section	Data type	Findings reference
Buprenorphine, loperamide, other opioids	Cameron et al [26], Daniulaityte et al [25,82]	PREDOSE^a [26]	Web forum data	Figures 4 and and55
User types in marijuana-related posts on social media	Kursuncu et al [28], Lamy et al [31]	eDrugTrends [28-31,96]	Twitter data, web forums, and Bluelight	Table 2
Depression DSM-5	Gaur et al [41]	eDrugTrends [45]	Web forums, Reddit, and Twitter	Tables 3 and and44
Fentanyl, fentanyl analogs, Clustering of dark web vendors	Usha et al [35], Kumar et al [33], Lamy et al [34]	eDarkTrends [46]	Social media and cryptomarket	Tables 5 and and66
COVID-19	Gaur et al [53,88]	COVID-19: public health study [97]	Social media	Figure 10; Table 7

^aPREDOSE: Prescription Drug Abuse Online Surveillance.

Principal Findings and Conclusions

In this study, we developed and evaluated the DAO as a framework for identifying concepts, entities, and relationships of interest in social media posts. The DAO developed in this study comprises 315 classes, 31 relationships, and 814 instances with 2 to 4 levels deeper. Our ontology was designed to study social media data, dark web data, and web forums. The DAO is primarily used for knowledge extraction and is broadly applicable to these platforms.

The superclasses of our ontology integrate all concepts regarding health conditions, individual-related, network-related, and society (public policies), sources (dealers, internet, medical, self-produced), spatiotemporal, and substance-related classes. The integrated ontology developed in this study is suitable for analyzing social media posts and dark web posts to understand network-related characteristics, location and time issues, identifying new trends, synonyms, slang items, and new drugs.

Our ontology incorporates terminology not only extracted from DSM-5 but also various terms and slang used on social media and other web posts. The terminology with all the medical terms, synonyms, and slang terms representing all the substances enabled a rich collection of terms in social media and dark web data. Our ontology also helps in topic discovery and entity extraction from social media and dark web data. In addition, we used ontology to extract information in the description of each product in dark web marketplaces to identify substances that are being sold that are not known, such as synthetic drugs, research chemicals, synthetic cannabinoids, and synthetic heroin.

Following well-known software development methodologies (eg, agile methodology), the ontology is constantly being updated according to the needs of current addiction-based research. The DAO stands as a machine-processable resource that describes a collection of addiction domain-related objects and classes, and is growing with the needs of the new ongoing projects. For instance, the current ontology is being enriched with knowledge from the dark web. In future work, the ontology will be linked to other ontologies (eg, MEDDRA [98], a Medical Dictionary for Regulatory Activities) to design the drug abuse knowledge graph. Another research contribution would be to automatically update the DAO with new concepts and properties, inspired by the algorithm that allows users to interactively build topic-specific ontologies using suggestions retrieved from a knowledge graph [99]. Glossary of the terms used in this paper is provided in Multimedia Appendix 1.

Acknowledgments

This work was supported in part by the National Institute on Drug Abuse (NIDA) grant 5R01DA039454-02 Trending: Social Media Analysis to Monitor Cannabis and Synthetic Cannabinoid Use; the National Science Foundation award 1761931 Spokes: MEDIUM: MIDWEST: Collaborative: Community-Driven Data Engineering for Substance Abuse Prevention in the Rural Midwest; the NIDA grant 5R21DA044518-02 eDarkTrends: Monitoring Darknet Markets to Track Illicit Synthetic Opioid Trends; and the National Institutes of Health grant R21 DA030571-01A1 A Study of Social Web Data on Buprenorphine Abuse using Semantic Web Technology. Any opinions, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect those of the National Science Foundation, National Institutes of Health, or NIDA.

Abbreviations

BERT	bidirectional encoder representations from transformers
DAO	drug abuse ontology
DSM-5	Diagnostic and Statistical Manual for Mental Disorders-5th edition
ML	machine learning
NER	named entity recognition
NIDA	National Institute on Drug Abuse
NIH	National Institutes of Health
NLP	natural language processing
ODKG	Opioid Drug Knowledge Graph
PREDOSE	Prescription Drug Abuse Online Surveillance

Multimedia Appendix 1

Glossary of terms used in this paper.

Click here to view.^{(16K, docx)}

Footnotes

Conflicts of Interest: None declared.

References

1. UNODC World Drug Report 2020: Global drug use rising; while COVID-19 has far reaching impact on global drug markets. United Nations. 2020. Jul 25, [2022-04-09]. https://www.unodc.org/unodc/press/releases/2020/June/media-advisory---global-launch-of-the-2020-world-drug-report.html .

2. Boardman JD, Finch BK, Ellison CG, Williams DR, Jackson JS. Neighborhood disadvantage, stress, and drug use among adults. J Health Soc Behav. 2001 Jun;42(2):151–65. [Abstract] [Google Scholar]

3. Spooner C, Hetherington K. Social Determinants of Drug Use. Sydney, New South Wales: University of New South Wales; 2004. [Google Scholar]

4. Hall W, Lynskey M. Assessing the public health impacts of legalizing recreational cannabis use: the US experience. World Psychiatry. 2020 Jun 11;19(2):179–86. 10.1002/wps.20735. 10.1002/wps.20735. [Europe PMC free article] [Abstract] [CrossRef] [CrossRef] [Google Scholar]

5. Ross EJ, Graham DL, Money KM, Stanwood GD. Developmental consequences of fetal exposure to drugs: what we know and what we still must learn. Neuropsychopharmacology. 2015 Jan;40(1):61–87. 10.1038/npp.2014.147. /article/MED/24938210 .npp2014147 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

6. European Drug Report 2019: trends and developments. European Monitoring Centre for Drugs and Drug Addiction (EMCDDA) 2019. Jun, [2021-02-15]. https://www.emcdda.europa.eu/publications/edr/trends-developments/2019_en .

7. Kruithof K, Aldridge J, Hétu DD, Sim M, Dujso E, Hoorens S. The role of the 'dark web' in the trade of illicit drugs. RAND Corporation. 2016. [2022-07-29]. https://www.rand.org/pubs/research_briefs/RB9925.html .

8. Aldridge J, Décary-Hétu D. Hidden wholesale: the drug diffusing capacity of online drug cryptomarkets. Int J Drug Policy. 2016 Sep;35:7–15. 10.1016/j.drugpo.2016.04.020. https://linkinghub.elsevier.com/retrieve/pii/S0955-3959(16)30133-5 .S0955-3959(16)30133-5 [Abstract] [CrossRef] [Google Scholar]

9. Kursuncu U, Gaur M, Lokala U, Thirunarayan K, Sheth A, Arpinar I. Emerging Research Challenges and Opportunities in Computational Social Network Analysis and Mining. Cham: Springer; 2019. Predictive analysis on Twitter: techniques and applications. [Google Scholar]

10. Soska K, Christin N. Measuring the longitudinal evolution of the online anonymous marketplace ecosystem. Proceedings of the 24th USENIX Conference on Security Symposium; SEC'15: Proceedings of the 24th USENIX Conference on Security Symposium; Aug 12 - 14, 2015; Washington, D.C. 2015. [Google Scholar]

11. Cardoso J, Sheth A. Semantic Web Services, Processes and Applications. Boston, MA: Springer; 2006. The semantic web and its applications. [Google Scholar]

12. Uschold M, Gruninger M. Ontologies and semantics for seamless connectivity. SIGMOD Rec. 2004 Dec;33(4):58–64. 10.1145/1041410.1041420. https://dl.acm.org/doi/abs/10.1145/1041410.1041420?casa_token=G-uVPZXXNIEAAAAA:xT0ERwJkvNnvvpj1WUUTO-C_bkUjN7sTdk7RJYxmeUJ-jmA9fgzOkvczRvVHsmB_PbhmKbLCqEGU . [CrossRef] [Google Scholar]

13. Grimm S, Abecker A, Völker J, Studer R. Handbook of Semantic Web Technologies. Berlin, Heidelberg: Springer; 2011. Ontologies and the semantic web. [Google Scholar]

14. Horrocks I. Ontologies and the semantic web. Commun ACM. 2008 Dec;51(12):58–67. 10.1145/1409360.1409377. [CrossRef] [Google Scholar]

15. Kim AR, Park H, Song T. Development and evaluation of an obesity ontology for social big data analysis. Healthc Inform Res. 2017 Jul;23(3):159–68. 10.4258/hir.2017.23.3.159. https://www.e-hir.org/DOIx.php?id=10.4258/hir.2017.23.3.159 . [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

16. Jung H, Park H, Song T. Ontology-based approach to social data sentiment analysis: detection of adolescent depression signals. J Med Internet Res. 2017 Jul 24;19(7):e259. 10.2196/jmir.7452. https://www.jmir.org/2017/7/e259/ v19i7e259 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

17. Ethier J, Barton A, Taseen R. An ontological analysis of drug prescriptions. Applied Ontol. 2018 Nov 09;13(4):273–94. 10.3233/ao-180202. [CrossRef] [Google Scholar]

18. Ceusters W. An information artifact ontology perspective on data collections and associated representational artifacts. Stud Health Technol Inform. 2012;180:68–72. [Abstract] [Google Scholar]

19. Bandrowski A, Brinkman R, Brochhausen M, Brush MH, Bug B, Chibucos MC, Clancy K, Courtot M, Derom D, Dumontier M, Fan L, Fostel J, Fragoso G, Gibson F, Gonzalez-Beltran A, Haendel MA, He Y, Heiskanen M, Hernandez-Boussard T, Jensen M, Lin Y, Lister AL, Lord P, Malone J, Manduchi E, McGee M, Morrison N, Overton JA, Parkinson H, Peters B, Rocca-Serra P, Ruttenberg A, Sansone S, Scheuermann RH, Schober D, Smith B, Soldatova LN, Stoeckert CJ, Taylor CF, Torniai C, Turner JA, Vita R, Whetzel PL, Zheng J. The ontology for biomedical investigations. PLoS One. 2016 Apr 29;11(4):e0154556. 10.1371/journal.pone.0154556. https://dx.plos.org/10.1371/journal.pone.0154556 .PONE-D-15-55757 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

20. Scheuermann R, Ceusters W, Smith B. Toward an ontological treatment of disease and diagnosis. Summit Transl Bioinform. 2009 Mar 01;2009:116–20. http://europepmc.org/article/MED/21347182 . [Europe PMC free article] [Abstract] [Google Scholar]

21. Hicks A, Hanna J, Welch D, Brochhausen M, Hogan WR. The ontology of medically related social entities: recent developments. J Biomed Semantics. 2016 Jul 12;7(1):47. 10.1186/s13326-016-0087-8. https://jbiomedsem.biomedcentral.com/articles/10.1186/s13326-016-0087-8 .10.1186/s13326-016-0087-8 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

22. Hanna J, Joseph E, Brochhausen M, Hogan WR. Building a drug ontology based on RxNorm and other sources. J Biomed Semantics. 2013 Dec 18;4(1):44. 10.1186/2041-1480-4-44. https://jbiomedsem.biomedcentral.com/articles/10.1186/2041-1480-4-44 .2041-1480-4-44 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

23. Kamdar M, Hamamsy T, Shelton S, Vala A, Eftimov T, Zou J. A knowledge graph-based approach for exploring the U.S. opioid epidemic. arXiv. 2019 http://arxiv.org/abs/1905.11513 . [Google Scholar]

24. Motlagh F, Shekarpour S, Sheth A, Thirunarayan K, Raymer M. Predicting public opinion on drug legalization: social media analysis and consumption trends. Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining; ASONAM '19: International Conference on Advances in Social Networks Analysis and Mining; Aug 27 - 30, 2019; Vancouver British Columbia Canada. 2019. [CrossRef] [Google Scholar]

25. Daniulaityte R, Carlson R, Falck R, Cameron D, Perera S, Chen L, Sheth A. "I just wanted to tell you that loperamide WILL WORK": a web-based study of extra-medical use of loperamide. Drug Alcohol Depend. 2013 Jun 01;130(1-3):241–4. 10.1016/j.drugalcdep.2012.11.003. http://europepmc.org/article/MED/23201175 .S0376-8716(12)00429-2 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

26. Cameron D, Smith GA, Daniulaityte R, Sheth AP, Dave D, Chen L, Anand G, Carlson R, Watkins KZ, Falck R. PREDOSE: a semantic web platform for drug abuse epidemiology using social media. J Biomed Inform. 2013 Dec;46(6):985–97. 10.1016/j.jbi.2013.07.007. https://linkinghub.elsevier.com/retrieve/pii/S1532-0464(13)00108-1 .S1532-0464(13)00108-1 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

27. Daniulaityte R, Carlson R, Brigham G, Cameron D, Sheth A. "Sub is a weird drug:" a web-based study of lay attitudes about use of buprenorphine to self-treat opioid withdrawal symptoms. Am J Addict. 2015 Aug 25;24(5):403–9. 10.1111/ajad.12213. http://europepmc.org/article/MED/26009867 . [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

28. Kursuncu U, Gaur M, Lokala U, Illendula A, Thirunarayan K, Daniulaityte R. What's ur type? Contextualized classification of user types in marijuana-related communications using compositional multiview embedding. Proceedings of the 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI); 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI); Dec 03-06, 2018; Santiago, Chile. 2018. [CrossRef] [Google Scholar]

29. Daniulaityte R, Nahhas RW, Wijeratne S, Carlson RG, Lamy FR, Martins SS, Boyer EW, Smith GA, Sheth A. "Time for dabs": analyzing Twitter data on marijuana concentrates across the U.S. Drug Alcohol Depend. 2015 Oct 01;155:307–11. 10.1016/j.drugalcdep.2015.07.1199. http://europepmc.org/article/MED/26338481 .S0376-8716(15)01604-X [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

30. Daniulaityte R, Chen L, Lamy FR, Carlson RG, Thirunarayan K, Sheth A. "When 'Bad' is 'Good'": identifying personal communication and sentiment in drug-related Tweets. JMIR Public Health Surveill. 2016 Oct 24;2(2):e162. 10.2196/publichealth.6327. https://publichealth.jmir.org/2016/2/e162/ v2i2e162 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

31. Lamy FR, Daniulaityte R, Sheth A, Nahhas RW, Martins SS, Boyer EW, Carlson RG. "Those edibles hit hard": exploration of Twitter data on cannabis edibles in the U.S. Drug Alcohol Depend. 2016 Jul 01;164:64–70. 10.1016/j.drugalcdep.2016.04.029. http://europepmc.org/article/MED/27185160 .S0376-8716(16)30056-4 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

32. Daniulaityte R, Lamy FR, Smith GA, Nahhas RW, Carlson RG, Thirunarayan K, Martins SS, Boyer EW, Sheth A. "Retweet to pass the blunt": analyzing geographic and content features of cannabis-related Tweeting across the United States. J Stud Alcohol Drugs. 2017 Nov;78(6):910–5. 10.15288/jsad.2017.78.910. http://europepmc.org/article/MED/29087826 . [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

33. Kumar R, Yadav S, Daniulaityte R, Lamy F, Lokala U. eDarkFind: unsupervised multi-view learning for Sybil Account detection. Proceedings of The Web Conference 2020; WWW '20: The Web Conference 2020; Apr 20 - 24, 2020; Taipei Taiwan. 2020. [CrossRef] [Google Scholar]

34. Lamy F, Daniulaityte R, Barratt M, Lokala U, Sheth A, Carlson R. Listed for sale: analyzing data on fentanyl, fentanyl analogs and other novel synthetic opioids on one cryptomarket. Drug Alcohol Depend. 2020 Jun 12;213:108115. 10.1016/j.drugalcdep.2020.108115. http://europepmc.org/article/MED/32585419 .S0376-8716(20)30280-5 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

35. Lokala U, Lamy FR, Daniulaityte R, Sheth A, Nahhas RW, Roden JI, Yadav S, Carlson RG. Global trends, local harms: availability of fentanyl-type drugs on the dark web and accidental overdoses in Ohio. Comput Math Organ Theory. 2019 Mar 25;25(1):48–59. 10.1007/s10588-018-09283-0. http://europepmc.org/article/MED/32577089 . [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

36. Ontology101. Protege Wiki. [2022-04-10]. https://protegewiki.stanford.edu/wiki/Ontology101,

37. Welcome to ESIP's Community Ontology Repository, or COR. Community Ontology Repository. [2022-04-10]. http://esipfed.github.io/cor/

38. About DBpedia. DBpedia. [2022-04-09]. https://wiki.dbpedia.org/about .

39. Ontology metrics. Protégé 5 Documentation. [2022-04-09]. http://protegeproject.github.io/protege/views/ontology-metrics/

40. Gyrard A. PerfectO: semantic web best practices. PerfectO. 2021. Oct, [2020-02-20]. http://perfectsemanticweb.appspot.com/

41. Gaur M, Kursuncu U, Alambo A, Sheth A, Daniulaityte R, Thirunarayan K. "Let me tell you about your mental health!": contextualized classification of reddit posts to DSM-5 for web-based intervention. Proceedings of the 27th ACM International Conference on Information and Knowledge Management; CIKM '18: The 27th ACM International Conference on Information and Knowledge Management; Oct 22 - 26, 2018; Torino Italy. 2018. [CrossRef] [Google Scholar]

42. WebVOWL - Web-based visualization of ontologies. VOWL. [2022-04-10]. http://vowl.visualdataweb.org/webvowl.html .

43. Hamm MP, Chisholm A, Shulhan J, Milne A, Scott SD, Given LM, Hartling L. Social media use among patients and caregivers: a scoping review. BMJ Open. 2013 May 09;3(5):e002819. 10.1136/bmjopen-2013-002819. https://bmjopen.bmj.com/lookup/pmidlookup?view=long&pmid=23667163 .bmjopen-2013-002819 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

44. Bluelight homepage. Bluelight. [2021-02-15]. https://www.bluelight.org/xf/

45. eDrugTrends Center for Interventions, Treatment, and Addictions Research (CITAR) Wright State University. [2020-02-20]. https://medicine.wright.edu/citar/edrugtrends .

46. eDarkTrends | Center for Interventions, Treatment, and Addictions Research (CITAR) Wright State University. [2020-02-20]. https://medicine.wright.edu/citar/edarktrends .

47. Scholl L, Seth P, Kariisa M, Wilson N, Baldwin G. Drug and opioid-involved overdose deaths — United States, 2013–2017. MMWR Morb Mortal Wkly Rep. 2018 Dec 21;67(5152):1419–27. 10.15585/mmwr.mm6751521e1. http://paperpile.com/b/etYjDc/buFi . [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

48. Wilson N, Kariisa M, Seth P, Smith H, Davis NL. Drug and opioid-involved overdose deaths - United States, 2017-2018. MMWR Morb Mortal Wkly Rep. 2020 Mar 20;69(11):290–7. 10.15585/mmwr.mm6911a4. 10.15585/mmwr.mm6911a4. [Europe PMC free article] [Abstract] [CrossRef] [CrossRef] [Google Scholar]

49. Panchal N, Kamal R, Orgera K, Cox C, Garfield R, Hamel L, Munana C, Chidambaram P. The implications of COVID-19 for mental health and substance use. Kaiser Family Foundation. 2020. Aug 21, [2022-07-29]. https://abtcounseling.com/wp-content/uploads/2020/09/The-Implications-of-COVID-19-for-Mental-Health-and-Substance-Use-_-KFF.pdf .

50. Garfin D, Silver R, Holman E. The novel coronavirus (COVID-2019) outbreak: amplification of public health consequences by media exposure. Health Psychol. 2020 May;39(5):355–7. 10.1037/hea0000875. http://europepmc.org/article/MED/32202824 .2020-20168-001 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

51. Holmes EA, O'Connor RC, Perry VH, Tracey I, Wessely S, Arseneault L, Ballard C, Christensen H, Cohen Silver R, Everall I, Ford T, John A, Kabir T, King K, Madan I, Michie S, Przybylski AK, Shafran R, Sweeney A, Worthman CM, Yardley L, Cowan K, Cope C, Hotopf M, Bullmore E. Multidisciplinary research priorities for the COVID-19 pandemic: a call for action for mental health science. Lancet Psychiatry. 2020 Jun;7(6):547–60. 10.1016/s2215-0366(20)30168-1. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

52. Qiu J, Shen B, Zhao M, Wang Z, Xie B, Xu Y. A nationwide survey of psychological distress among Chinese people in the COVID-19 epidemic: implications and policy recommendations. Gen Psych. 2020 Mar 06;33(2):e100213. 10.1136/gpsych-2020-100213. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

53. Alambo A, Gaur M, Thirunarayan K. Depressive, drug abusive, or informative: knowledge-aware study of news exposure during COVID-19 outbreak. arXiv. 2020 http://arxiv.org/abs/2007.15209 . [Google Scholar]

54. National Institute on Drug Abuse. NIDA. [2021-02-11]. https://www.drugabuse.gov/

55. United States Drug Enforcement Administration homepage. United States Drug Enforcement Administration. [2021-02-11]. https://www.dea.gov/

56. 2022 European Drug Report. European Monitoring Centre for Drugs and Drug Addiction (EMCDDA) [2021-02-11]. https://www.emcdda.europa.eu/emcdda-home-page_en .

57. RxNorm. NIH U.S. National Library of Medicines. [2021-02-11]. https://www.nlm.nih.gov/research/umls/rxnorm/index.html .

58. Daniulaityte R, Carlson RG, Kenne DR. Initiation to pharmaceutical opioids and patterns of misuse: preliminary qualitative findings obtained by the ohio substance abuse monitoring network. J Drug Issues. 2016 Aug 03;36(4):787–808. 10.1177/002204260603600402. [CrossRef] [Google Scholar]

59. Daniulaityte R, Carlson RG. "To numb out and start to feel nothing": experiences of stress among crack-cocaine using women in a midwestern city. J Drug Issues. 2011 Jan 01;41(1):1–24. 10.1177/002204261104100101. http://europepmc.org/article/MED/21625334 . [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

60. Daniulaityte R, Falck R, Carlson RG. Illicit use of buprenorphine in a community sample of young adult non-medical users of pharmaceutical opioids. Drug Alcohol Depend. 2012 May 01;122(3):201–7. 10.1016/j.drugalcdep.2011.09.029. http://europepmc.org/article/MED/22036303 .S0376-8716(11)00434-0 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

61. Daniulaityte R, Carlson RG, Kenne DR. Methamphetamine use in Dayton, Ohio: preliminary findings from the Ohio Substance Abuse Monitoring Network. J Psychoactive Drugs. 2007 Sep;39(3):211–21. 10.1080/02791072.2007.10400607. [Abstract] [CrossRef] [Google Scholar]

62. Lamy FR, Daniulaityte R, Zatreh M, Nahhas RW, Sheth A, Martins SS, Boyer EW, Carlson RG. "You got to love rosin: solventless dabs, pure, clean, natural medicine." Exploring Twitter data on emerging trends in rosin tech marijuana concentrates. Drug Alcohol Depend. 2018 Feb 01;183:248–52. 10.1016/j.drugalcdep.2017.10.039. http://europepmc.org/article/MED/29306816 .S0376-8716(17)30604-X [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

63. Noy NF, McGuinness DL. Ontology Development 101: A Guide to Creating Your First Ontology. Stanford, CA: Stanford University; 2001. [Google Scholar]

64. Musen MA, Protégé Team The protégé project: a look back and a look forward. AI Matters. 2015 Jun 16;1(4):4–12. 10.1145/2757001.2757003. http://europepmc.org/article/MED/27239556 . [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

65. Lin Y, Liu Z, Sun M, Liu Y, Zhu X. Learning entity and relation embeddings for knowledge graph completion. Proceedings of the AAAI Conference on Artificial Intelligence; Twenty-Ninth AAAI Conference on Artificial Intelligence; Jan 25–30, 2015; Austin, Texas, USA. 2015. https://ojs.aaai.org/index.php/AAAI/article/view/9491 . [Google Scholar]

66. Lehmann J, Isele R, Jakob M, Jentzsch A, Kontokostas D, Mendes PN, Hellmann S, Morsey M, van Kleef P, Auer S, Bizer C. DBpedia – A large-scale, multilingual knowledge base extracted from Wikipedia. Semant Pragmat. 2015;6(2):167–95. 10.3233/SW-140134. https://content.iospress.com/articles/semantic-web/sw134 . [CrossRef] [Google Scholar]

67. Lohmann S, Negru S, Haag F, Ertl T. Visualizing ontologies with VOWL. Semantic Web. 2016 May 27;7(4):399–419. 10.3233/SW-150200. http://visualdataweb.de/webvowl/#iri=http://purl.org/dao# . [CrossRef] [Google Scholar]

68. Call for resources track papers. ISWC. [2020-02-20]. http://iswc2018.semanticweb.org/call-for-resources-track-papers/

69. Gyrard A, Atemezing G, Serrano M. Semantic IoT: Theory and Applications. Cham: Springer; 2021. PerfectO: an online toolkit for improving quality, accessibility, and classification of domain-based ontologies. [Google Scholar]

70. LODE - Live OWL Documentation Environment homepage. LODE - Live OWL Documentation Environment. [2020-02-21]. http://www.essepuntato.it/lode .

71. Lokala U. Drug abuse ontology. Earth Science Information Partners Community Ontology Repository. 2020. Jul 27, [2022-02-15]. http://cor.esipfed.org/ont/~ushanri/DAO .

72. Lohmann S, Negru S, Haag F, Ertl T. Visualizing ontologies with VOWL. Semantic Web. 2016 May 27;7(4):399–419. 10.3233/sw-150200. [CrossRef] [Google Scholar]

73. Visual Notation for OWL Ontologies. Visual Dataweb. 2020. Jul 27, [2022-02-15]. http://vowl.visualdataweb.org/webvowl-old/webvowl-old.html#iri=http://cor.esipfed.org/ont/~ushanri/DAO%C2%A0 .

74. Poveda-Villalón M, Gómez-Pérez A, Suárez-Figueroa MC. OOPS! (OntOlogy Pitfall Scanner!): an on-line tool for ontology evaluation. Intl J Semantic Web Inform Syst. 2014;10(2) 10.4018/ijswis.2014040102. https://www.igi-global.com/article/oops-ontology-pitfall-scanner/116450 . [CrossRef] [Google Scholar]

75. Vapour report. Vapour Linked Data Validator. [2022-02-15]. http://linkeddata.uriburner.com:8000/vapour?uri=http://cor.esipfed.org/ont/~ushanri/DAO .

76. Graphite PHP linked data library. RDF Triple-Checker. [2022-02-15]. http://graphite.ecs.soton.ac.uk/checker/?uri=http://cor.esipfed.org/ont/~ushanri/DAO .

77. Ontology metrics. Protégé 5 Documentation. [2021-02-15]. http://protegeproject.github.io/protege/views/ontology-metrics/

78. García J, García-Peñalvo FJ, Therón R. Knowledge Management, Information Systems, E-Learning, and Sustainability Research. Berlin, Heidelberg: Springer; 2010. A survey on ontology metrics. [Google Scholar]

79. Web ontology language (OWL) abstract syntax and semantics section 2. Abstract syntax. W3C Working Draft. [2021-02-15]. https://www.w3.org/TR/2002/WD-owl-semantics-20021108/syntax.html .

80. Sheth A, Perera S, Wijeratne S, Thirunarayan K. Knowledge will propel machine understanding of content: extrapolating from current examples. Proceedings of the International Conference on Web Intelligence; WI '17: International Conference on Web Intelligence 2017; Aug 23 - 26, 2017; Leipzig Germany. 2017. https://datamed.org . [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

81. Estival D, Nowak C, Zschorn A. Towards ontology-based natural language processing. Proceeedings of the Workshop on NLP and XML (NLPXML-2004): RDF/RDFS and OWL in Language Technology; NLPXML '04: Proceeedings of the Workshop on NLP and XML (NLPXML-2004): RDF/RDFS and OWL in Language Technology; Jun 1, 2004; Barcelona, Spain. 2004. [Google Scholar]

82. Daniulaityte R, Carlson RG, Falck RS, Cameron DH, Udayanaga S, Chen L, Sheth AP. A web-based study of self-treatment of opioid withdrawal symptoms with loperamide. Wright State University. 2012. [2022-04-08]. https://corescholar.libraries.wright.edu/knoesis/624/

83. PREDOSE Demo. Youtube. 2013. Sep 16, [2020-02-16]. https://www.youtube.com/watch?v=gCFPzMgEPQM .

84. Sheth A, Kapanipathi P. Semantic filtering for social data. IEEE Internet Comput. 2016 Jul;20(4):74–8. 10.1109/MIC.2016.86. [CrossRef] [Google Scholar]

85. International Classification of Diseases, Version 10. BioPortal. [2020-02-20]. https://bioportal.bioontology.org/ontologies/ICD10 .

86. SNOMED CT. BioPortal. [2020-02-20]. https://bioportal.bioontology.org/ontologies/SNOMEDCT .

87. bioCaddie Core Development Team. Home - DataMed. [2020-02-20]. https://datamed.org .

88. Research domain criteria (RDoC) Research domain criteria (RDoC). In: National Institute of Mental Health (NIMH) [2022-04-10]. https://www.nimh.nih.gov/research/research-funded-by-nimh/rdoc .

89. Alambo A, Gaur M, Lokala U, Kursuncu U, Thirunarayan K, Gyrard A, Sheth A, Welton RS, Pathak J. Question answering for suicide risk assessment using reddit. Proceedings of the 2019 IEEE 13th International Conference on Semantic Computing (ICSC); 2019 IEEE 13th International Conference on Semantic Computing (ICSC); Jan 30 - Feb 01, 2019; Newport Beach, CA, USA. 2019. [CrossRef] [Google Scholar]

90. Park A, Conway M. Harnessing reddit to understand the written-communication challenges experienced by individuals with mental health disorders: analysis of texts from mental health communities. J Med Internet Res. 2018 Apr 10;20(4):e121. 10.2196/jmir.8219. https://www.jmir.org/2018/4/e121/ v20i4e121 [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

91. Saravia E, Chang C, De Lorenzo RJ, Chen Y. MIDAS: mental illness detection and analysis via social media. Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM); 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM); Aug 18-21, 2016; San Francisco, CA, USA. 2016. [CrossRef] [Google Scholar]

92. Gkotsis G, Oellrich A, Velupillai S, Liakata M, Hubbard TJ, Dobson RJ, Dutta R. Characterisation of mental health conditions in social media using Informed Deep Learning. Sci Rep. 2017 Mar 22;7:45141. 10.1038/srep45141. 10.1038/srep45141.srep45141 [Europe PMC free article] [Abstract] [CrossRef] [CrossRef] [Google Scholar]

93. Gaur M, Khandelwal V, Kurşuncu U, Pallagani V. Measuring spatio-temporal psychological impact of novel coronavirus through social quality index. YouTube. [2020-06-27]. https://youtu.be/XzYrn0PEzNk .

94. Gaur M, Kursuncu U, Khandelwal V, Pallagani V, Shalin V, Sheth A. Psychidemic: measuring the spatio-temporal psychological impact of novel choronovirus with a social quality index. Proceedings of the Computing Research Association Annual Conference; Computing Research Association Annual Conference; 2020; -. 2020. [CrossRef] [Google Scholar]

95. Ranking the states. Mental Health America. [2021-02-11]. https://www.mhanational.org/issues/ranking-states .

96. Daniulaityte R, Carlson RG, Golroo F, Wijeratne S, Boyer EW, Martins SS, Nahhas RW, Sheth AP. 2015 Abstract Book. Fairborn, Ohio: Wright State University; 2015. "Time for Dabs": analyzing Twitter data on butane hash oil use. [Google Scholar]

97. Covid19 - knoesis wiki. Wiki. [2022-04-08]. http://wiki.aiisc.ai/index.php/Covid19 .

98. Mozzicato P. MedDRA. Pharmaceut Med. 2009;23:65–75. [Google Scholar]

99. Böhm K, Ortiz M. A tool for building topic-specific ontologies using a knowledge graph. Proceedings of the 31st International Workshop on Description Logics co-located with 16th International Conference on Principles of Knowledge Representation and Reasoning (KR 2018); 31st International Workshop on Description Logics co-located with 16th International Conference on Principles of Knowledge Representation and Reasoning (KR 2018); Oct 27-29, 2018; Tempe, Arizona, US. 2018. [Google Scholar]

Articles from JMIR Public Health and Surveillance are provided here courtesy of JMIR Publications Inc.

Full text links

Read article at publisher's site: https://doi.org/10.2196/24938

Read article for free, from open access legal sources, via Unpaywall: https://publichealth.jmir.org/2022/12/e24938/PDF

Citations & impact

Impact metrics

Citations

Jump to Citations

Citations of article over time

Alternative metrics

Altmetric item for https://www.altmetric.com/details/140478557

Altmetric
Discover the attention surrounding your research
https://www.altmetric.com/details/140478557

Smart citations by scite.ai
Explore citation contexts and check if this article has been supported or disputed.
https://scite.ai/reports/10.2196/24938

Supporting

Mentioning

Contrasting

Article citations

The Use of Natural Language Processing Methods in Reddit to Investigate Opioid Use: Scoping Review.
Almeida A, Patton T, Conway M, Gupta A, Strathdee SA, Bórquez A
JMIR Infodemiology, 4:e51156, 13 Sep 2024
Cited by: 0 articles | PMID: 39269743 | PMCID: PMC11437337
Review
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Detecting Substance Use Disorder Using Social Media Data and the Dark Web: Time- and Knowledge-Aware Study.
Lokala U, Phukan OC, Dastidar TG, Lamy F, Daniulaityte R, Sheth A
JMIRx Med, 5:e48519, 01 May 2024
Cited by: 2 articles | PMID: 38717384
Analysis of Wastewater Samples to Explore Community Substance Use in the United States: Pilot Correlative and Machine Learning Study.
Severson MA, Onanong S, Dolezal A, Bartelt-Hunt SL, Snow DD, McFadden LM
JMIR Form Res, 7:e45353, 26 Oct 2023
Cited by: 1 article | PMID: 37883150 | PMCID: PMC10636622
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Knowledge Representation and Management 2022: Findings in Ontology Development and Applications.
Charlet J, Cui L, Section Editors for the IMIA Yearbook Section on Knowledge Representation and Management
Yearb Med Inform, 32(1):225-229, 01 Aug 2023
Cited by: 1 article | PMID: 38147864 | PMCID: PMC10751114
Review
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Characterization of time-variant and time-invariant assessment of suicidality on Reddit using C-SSRS.
Gaur M, Aribandi V, Alambo A, Kursuncu U, Thirunarayan K, Beich J, Pathak J, Sheth A
PLoS One, 16(5):e0250448, 17 May 2021
Cited by: 6 articles | PMID: 33999927 | PMCID: PMC8128252
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC

Data

Data behind the article

This data has been text mined from the article, or deposited into data resources.

BioStudies: supplemental material and supporting data

http://www.ebi.ac.uk/biostudies/studies/S-EPMC9823583?xr=true

Funding

Funders who supported this work.

NIAID NIH HHS (1)

Grant ID: R01 AI039454
59 publications

NIDA NIH HHS (2)

Grant ID: R21 DA044518
6 publications
Grant ID: R21 DA030571
6 publications

Search life-sciences literature (45,104,931 articles, preprints and more)

Drug Abuse Ontology to Harness Web-Based Data for Substance Use Epidemiology Research: Ontology Development Study.

Author information

Affiliations

Authors

Authors

Authors

Authors

Authors

ORCIDs linked to this article

Abstract

Background

Objective

Methods

Results

Conclusions

Free full text

Drug Abuse Ontology to Harness Web-Based Data for Substance Use Epidemiology Research: Ontology Development Study

Usha Lokala

Francois Lamy

Raminta Daniulaityte

Manas Gaur

Amelie Gyrard

Krishnaprasad Thirunarayan

Ugur Kursuncu

Amit Sheth

Associated Data

Abstract

Background

Objective

Methods

Results

Conclusions

Introduction

Background

Descriptions of machine learning (ML), natural language processing (NLP), and ontology terms used in this paper.

Evolution of the DAO

Prescription Drug Abuse Online Surveillance

eDrugTrends

eDarkTrends

COVID-19 Pandemic

Methods

Overview

Design

Instantiation

Ethics Approval

Results

Evaluation

Table 1

The DAO Within PREDOSE

eDrugTrends (Monitoring Drug Trends on Social Media)

Table 2

Enhancing the DAO With DSM-5

Table 3

Table 4

eDarkTrends (Monitoring Drug Trends on Cryptomarkets)

Table 5

Table 6

COVID-19 Scenario

Table 7

Discussion

Strengths and Limitations

Table 8

Principal Findings and Conclusions

Acknowledgments

Abbreviations

Multimedia Appendix 1

Footnotes

References

Full text links

Citations & impact

Impact metrics

Citations of article over time

Alternative metrics

Article citations

Data

Data behind the article

BioStudies: supplemental material and supporting data

Similar Articles

Funding

NIAID NIH HHS (1)

NIDA NIH HHS (2)