pith. sign in

Tao Shen

Identifiers

  • name variant Tao Shen 0.60 · backfill

Papers (10)

  1. Efficient Proposal-Test-Release for Minimax Optimal Estimation stat.ME · 2026 · author #1
  2. Mamoda2.5: Enhancing Unified Multimodal Model with DiT-MoE cs.CV · 2026 · author #3
  3. Construction of Knowledge Graph based on Language Model cs.CL · 2026 · author #5
  4. Will LLMs Scaling Hit the Wall? Breaking Barriers via Distributed Resources on Massive Edge Devices cs.DC · 2025 · author #1
  5. A Survey on Knowledge Distillation of Large Language Models cs.CL · 2024 · author #4
  6. Tensorized Self-Attention: Efficiently Modeling Pairwise and Global Dependencies Together cs.CL · 2018 · author #1
  7. Bi-Directional Block Self-Attention for Fast and Memory-Efficient Sequence Modeling cs.CL · 2018 · author #1
  8. Reinforced Self-Attention Network: a Hybrid of Hard and Soft Attention for Sequence Modeling cs.CL · 2018 · author #1
  9. Calculations of point defects in the layered MX2 (M=Mo, W; X=S, Te): Substitution by the groups III, V and VII elements cond-mat.mtrl-sci · 2018 · author #3
  10. DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding cs.CL · 2017 · author #1

Mentions

  • 2402.13116 #4 · arxiv_oai · confidence 0.70 Tao Shen

Frequent Coauthors