Dartmouth Research Computing & Data (RCD) logo
  • Home 
  • HPC 
  • AI 
  • Services 
  • Status 
  • About Us 

  •   Search this site
  •  
  1.   Data Science Support @ Researc...
  1. Home
  2. Data Science Support @ Research Computing
  3. Text Analysis & Natural Language Processing

Text Analysis & Natural Language Processing

On this page
  What We Support     Who We Work With     Ready to Get Started?  

Text is everywhere in research — from interview transcripts and historical documents to clinical notes and scientific publications. Whether you’re cleaning up survey responses or building generative models, Research Computing supports a wide range of text analysis workflows across disciplines.

  What We Support  

We offer guidance at every level of the text analysis pipeline:

  •   Text Cleaning & Preprocessing: Turning unstructured or messy text into structured data for analysis
  •   Named Entity Recognition (NER) and keyword extraction for tagging people, places, concepts, or chemicals in your corpus
  •   Topic Modeling & Clustering: Discovering patterns in large text collections with LDA, NMF, or BERTopic
  •   Text Visualization: Tools like word clouds, concordance plots, topic projections, and term frequency charts
  •   Text Classification: Supervised workflows for coding documents, sentiment analysis, and category prediction
  •   Embeddings & Representation: Word2Vec, BERT, and other vector-based approaches for semantic similarity and clustering
  •   Generative AI & LLMs: Use of models like ChatGPT for summarization, rephrasing, translation, or question answering in domain-specific contexts

  Who We Work With  

Our text analysis clients span the humanities, social sciences, and biomedical research:

  •   History, Literature & Digital Humanities: Thematic analysis, archival text mining, authorship attribution, and document exploration
  •   Sociology & Political Science: Analyzing open-ended survey responses, policy documents, interviews, and speeches
  •   Clinical & Biomedical Sciences: De-identifying and analyzing clinical notes, extracting information from medical records
  •   Education & Psychology: Coding reflective essays, journal entries, and classroom dialogue
  •   Natural & Physical Sciences: Mining the scientific literature for chemical reactions, methods, or concept networks

  Ready to Get Started?  

Whether you’re exploring qualitative coding or deploying a neural model for entity extraction, our team is here to help turn your text into insights.

Explore all our services to learn more.

On this page:
  What We Support     Who We Work With     Ready to Get Started?  
Copyright © 2025 Dartmouth Research Computing & Data | Powered by Hinode.
Dartmouth Research Computing & Data (RCD)
Code copied to clipboard