Evaluating Large Language Models A Comprehensive Survey Source Back Parts Allignment Evaluation Introduction and Taxonomy Knowledge and Capability Evaluation Organizations, Future Directions, and Conclusion Safety Evaluation Specialized LLM Evaluations