Entity-relationship diagram
Entity-relationship diagram (ERD) is a graphical representation of entities and their relationships within a database system. Peter Chen introduced this modeling technique in his 1976 paper "The Entity-Relationship Model--Toward a Unified View of Data" published in ACM Transactions on Database Systems. The paper became one of the most cited works in computer science and established ERD as a fundamental tool for database design.
Historical development
Before Chen's work, database designers lacked standardized visual methods for representing data structures. Charles Bachman developed Data Structure Diagrams in the 1960s, and A.P.G. Brown published research on real-world systems modeling. These approaches provided foundations, but no unified notation existed[1].
Peter Pin-Shan Chen, then a researcher at MIT, synthesized existing ideas into a coherent framework. His 1976 paper proposed that the real world consists of entities and relationships. This natural view captured semantic information that previous models overlooked.
The entity-relationship model played a significant role in the Computer-Aided Software Engineering (CASE) movement during the late 1980s and early 1990s. James Martin refined ERD conventions during this period. The combined work of Chen, Bachman, Brown, and Martin contributed to the development of Unified Modeling Language (UML), which became widely adopted for software design.
Core concepts
The ER model rests on three fundamental concepts:
Entities represent real-world objects or concepts about which data is stored. A person, product, order, or department can serve as an entity. Entities are depicted as rectangles in Chen notation.
Attributes describe properties of entities. A person entity might have attributes for name, birth date, and address. Ovals represent attributes, connected by lines to their parent entities.
Relationships show how entities connect to one another. An employee "works for" a department. A customer "places" an order. Diamonds represent relationships in the original notation.
Chen notation
Chen's original notation uses distinct symbols for each element:
- Rectangles for entity sets
- Ovals for attributes
- Diamonds for relationships
- Lines connecting related elements
Relationship cardinality indicates how many instances of one entity relate to another:
- One-to-one (1:1) - Each entity instance relates to exactly one instance of another entity
- One-to-many (1:N) - One instance relates to multiple instances
- Many-to-many (M:N) - Multiple instances on both sides relate to each other
Alternative notations
Several notation systems have emerged since Chen's original work:
Crow's Foot notation - Uses fork symbols to indicate "many" relationships. Widely used in professional database tools.
Information Engineering notation - Developed by James Martin. Emphasizes cardinality and optionality.
IDEF1X - U.S. government standard integrating ER concepts with data modeling requirements.
UML Class Diagrams - Object-oriented extension that incorporates ER concepts into broader software modeling.
Each notation offers advantages for specific contexts. Chen notation remains common in educational settings because its symbols are intuitive.
Creating an ERD
Database designers follow a systematic process when creating entity-relationship diagrams[2]:
- Identify entities from requirements documents
- Determine attributes for each entity
- Define primary keys to uniquely identify instances
- Establish relationships between entities
- Specify cardinality constraints
- Add any remaining attributes to relationships
- Review and refine the diagram
The resulting diagram guides physical database implementation. Tables correspond to entities, columns to attributes, and foreign keys enforce relationships.
Applications
ERDs serve multiple purposes in organizational contexts:
Database design - The primary use case. ERDs translate business requirements into implementable structures.
Communication - Diagrams provide common ground for discussions between technical staff and business stakeholders.
Documentation - ERDs record design decisions for future reference and maintenance.
Education - Universities use ERDs to teach database fundamentals. The visual approach helps students grasp abstract concepts.
Reverse engineering - Analysts create ERDs from existing databases to understand undocumented systems.
Limitations
ERDs focus on data structure rather than behavior. They do not capture business rules, temporal aspects, or processing logic. Complex domains may require supplementary documentation.
Large systems produce unwieldy diagrams. Designers often decompose models into subject areas or use hierarchical approaches. Tools with zoom and filtering capabilities help manage complexity.
See also
Related articles:
References
- Chen, P.P. (1976). "The Entity-Relationship Model--Toward a Unified View of Data." ACM Transactions on Database Systems, 1(1), 9-36.
- Martin, J. (1989). Information Engineering. Prentice Hall.
- Teorey, T., Lightstone, S., & Nadeau, T. (2011). Database Modeling and Design. Morgan Kaufmann.
Footnotes
[1] Charles Bachman received the ACM Turing Award in 1973 for his contributions to database technology.
[2] Process adapted from standard database design methodologies taught in information systems curricula.