Logo
  • Installation
    • Recommended Install
    • Full Install
      • Setup
        • Linux
        • OSX (with brew)
        • Install (pip)
        • Install (Anaconda)
        • Install (GitHub)
      • Install Other Dependencies
        • Load NLTK corpus
        • Install Stanford Parser
        • Install Tweet NLP v0.3 (ark-tweet-nlp-0.3)
        • Python Modules (optional)
        • Install the IBM Wordcloud jar file (optional)
        • Mallet (optional)
    • Full List of Dependencies
      • Python
      • Other
      • Python (optional)
      • Other (optional)
    • Python version support
    • Getting Started
      • Command Line Interface
      • MySQL Configuration
      • Sample Datasets
      • Next Steps
    • Install Issues
  • Github Repo
  • Getting started in Colab
  • Tutorials
    • Getting started
    • Text Cleaning and Transformations
    • Feature Extraction
    • Viewing your data and output
    • Prediction
    • Clustering
    • LDA with Mallet
    • Data Engines
    • Other Topics
    • Video Tutorials
  • Packaged Datasets
    • Language Data
      • Blog Authorship Corpus
    • Lexica
      • Age and Gender Lexica
      • PERMA Lexicon
      • Spanish PERMA Lexicon
      • Other Lexica
    • LDA Topics
      • 2000 Facebook Topics
  • dlatkInterface Flags by type
    • Setup
    • Preprocessing
    • Feature Extraction
    • Feature Refinement
    • Language Insights
    • Clustering
    • Prediction
      • Regression
      • Classification
    • Visualization
  • Papers Utilizing DLATK
    • DLATK Paper
    • Peer Reviewed Publications
      • 2020
      • 2019
      • 2018
      • 2017
      • 2016
      • 2015
      • 2014
      • 2013
DLATK
  • Tutorials
  • View page source

Tutorials

The following tutorials are roughly ordered from beginner to more advanced topics.

Getting started

  • Differential Language Analysis (DLA) Tutorial

  • DLA Tutorial in Colab

Text Cleaning and Transformations

  • Data Cleaning

  • Tokenization, Part of Speech Tagging and Segmentation

Feature Extraction

  • Understanding Feature Table Names

  • Advanced Feature Extraction

  • Transformers in DLATK (Huggingface Interface)

Viewing your data and output

  • Output Formats

  • Using DLATK to view your SQL data

Prediction

  • Intro Prediction / Classification / Predictive Lexica

  • Building A Pickle Model

  • Applying A Pickle Model

Clustering

  • Clustering and Super Topics

LDA with Mallet

  • DLATK LDA Interface

  • Mallet LDA Interface

Data Engines

  • Data Engines

Other Topics

  • Importing and Exporting Data

  • DLA Rules of Thumb

  • Working with DLATK's Classes

  • DLATK's Pandas Interface

  • Using INI Files

Video Tutorials

The following videos were developed by the World Well-Being Project and are not directly related to running DLATK .

  • Open Vocabulary Analysis: The Concept

  • Age / Gender Predictive Lexica

  • Differential Word Clouds

  • Twitter Predicting Heart Disease

  • Penalized Regression

  • Introduction to Word Embeddings

Previous Next

© Copyright 2024, H. Andrew Schwartz and Salvatore Giorgi.

Built with Sphinx using a theme provided by Read the Docs.