3D Judge

the system should:

build a judge that measures real 3D structural correctness, not just 2D plausibility.

Research Gap

  1. Janus and view inconsistency
  1. Data scarcity
  1. Benchmark weakness
  1. Compute and representation limits
  1. Open limitations across the literature

Benchmarks

  1. Eval3D is the primary benchmark
  1. T3-Bench is the secondary baseline
  1. Core grading axes
  1. 3D MM-Vet is useful for LLM-based judges
Powered by Forestry.md