Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Discovery Document

Below is the grading rubric for the Discovery Document, structured with three performance levels: “Exceeds Expectations,” “Meets Expectations,” and “Does Not Meet Expectations.” Each main requirement is assigned a maximum point value, totaling 100 points.

RequirementExceeds ExpectationsMeets ExpectationsDoes Not Meet ExpectationsPoints
Title and Author(s)Title clearly reflects specific research questions; author(s) properly credited.Title reflects general research topic; author(s) credited.Title is missing or does not reflect research topic; author(s) not credited.5
Research QuestionsProvides a numbered list of at least 3 well-defined, answerable research questions with 1–3 sentence descriptions.Provides a numbered list of 3 research questions with brief descriptions; questions are somewhat clear.Provides fewer than 3 research questions; questions are vague or lack descriptions.15
MotivationClearly articulates the significance of the research questions in 1–2 well-developed paragraphs; explains relevance and potential impact.Provides a general explanation of the significance of the research questions; relevance is somewhat clear.Motivation is unclear or missing; does not explain the significance of the research questions.10
Dataset Description
For webscraping challenge; document the site for scraping URLs and expected format of future scraped data.
Thoroughly describes dataset(s) including exact URLs; dataset is real, contains over 500 lines, and is appropriate for research; explains data source, collection methods, and potential limitations. Deep understanding and documentation of data columns, formats, accuracy and caveatsDescribes dataset(s) with URLs; dataset meets basic requirements; provides some information on data source and collection methods.Inadequate description of dataset(s); missing URLs; dataset is inappropriate or insufficient for research; lacks information on data source and collection methods.30
Challenge GoalsSelects at least 2 appropriate challenge goals; provides strong justification for their relevance and suitability to the project.Selects 2 challenge goals; provides some justification for their selection.Fails to select appropriate challenge goals; justification is weak or missing.10
OverallDocumentation is well-organized, free of grammatical errors, and formatted professionally; tables and figures (if any) are clear and enhance understanding. All sections are complete with sufficient details (such as: Task List, Challenges working with the data...)Document is organized and mostly free of grammatical errors; formatting is adequate; tables and figures (if any) are understandable. Some details lackingDocument is poorly organized, contains grammatical errors, and lacks professional formatting; tables and figures (if any) are unclear or absent.30
Total Points100

Data Organization

RequirementExceeds ExpectationsMeets ExpectationsDoes Not Meet Expectations
Data Preparation and Code Submission (40 points)- Data is meticulously pre-processed, well-organized, and thoroughly documented.
- Comprehensive, well-annotated code provided for data collection and organization.
- Data is adequately pre-processed and organized.
- Code for data collection and organization is complete but may lack detailed annotations.
- Data is poorly organized or not pre-processed.
- Code is missing, incomplete, or contains significant errors.
Plot Sketches and Visualization Planning (40 points)- All research questions are addressed through diverse, creative, and feasible plot sketches.
- Sketches are detailed, with clear titles, labeled axes, appropriate use of color, legends, annotations, and statistical indicators.
- Effectively highlights challenge goals, such as machine learning predictions or interactive controls.
- Most research questions are addressed with a variety of plot types.
- Sketches include titles and labeled axes but may lack some details like color or annotations.
- Some challenge goals are incorporated into the sketches.
- Few or none of the research questions are addressed.
- Sketches are incomplete or missing essential elements like titles and labeled axes.
- Challenge goals are not considered or addressed.
Overall Documentation and Clarity (20 points)- All sketches and documentation is clear, well-organized, and easy to interpret.
- Documentation is professional, adheres to all guidelines, and effectively communicates the intended message.
- All documentation is submitted per the requirements.
- Sketches and documentation is generally clear with minor areas of confusion.
- Documentation mostly adheres to technical writing standards and communicates the intended message.
- Sketches and documentation is difficult to interpret due to poor organization or lack of clarity.
- Presentation does not adhere to guidelines and fails to communicate the intended message.
Total Points:100

Final Delivery (250 pts)

Your final delivery will be due according to the schedule posted and will be comprised of the following elements for grading.

Part 1: Peer Evaluation Rubric

To be completed by classmates via the Google Quiz. These focus on the presentation “experience.”

CategoryExcellentGoodAdequateNeeds Work
Speaking & EngagementClear, confident, and polished delivery. Engages the audience effectively throughout.Mostly clear delivery; occasional hesitations. Some effort to engage the audience.Somewhat unclear or monotone delivery. Limited connection with the audience.Unclear, overly quiet, or distracting delivery; little effort to engage.
Clarity & StructureStrong, logical flow. Clearly communicates purpose, methodology, and outcomes.Mostly clear with logical structure; transitions between topics are mostly smooth.Somewhat hard to follow; lacks organization or clarity in specific sections.Difficult to follow; unclear organization or missing key ideas.
Visual Accessibility & PacingAudience is given ample time to process each slide. Explanations are synchronized with visuals; data is visible to everyone.Pacing is generally good. Most slides are explained well and visible; transitions feel slightly rushed.Inconsistent pacing. Audience may struggle to see specific data points or keep up with the spoken explanation.Slides are flashed too quickly or not explained; data is too small to see or obscured.
Data Viz Standards & TechniquesExceptional adherence to unit guidelines. Uses correct chart types, avoids “chart junk,” and highlights key insights.Good application of unit techniques. Visuals follow guidelines with only minor aesthetic or technical deviations.Inconsistent use of principles. Charts may be misleading, cluttered, or fail to use techniques covered in class.Fails to apply unit guidelines. Charts are difficult to interpret or use inappropriate/prohibited techniques.

Part 2: Instructor Evaluation Rubric

This will be completed by your teacher. These focus on technical rigor and group dynamics.

CategoryExcellentGoodAdequateNeeds Work
Analytical Rigor & Claim AccuracyAll claims fully substantiated by data. No “leaps of faith”; conclusions address necessary disclosures and segments.Most claims are supported. Minor instances where conclusions slightly outpace evidence, but reporting is sound.Several claims lack data; makes logical leaps or omits critical segments that should have been disclosed.Significant “leaps of faith” without data. Fails to provide evidence or ignores major missing segments.
Team Collaboration & TransitionsAll members contribute meaningfully. Transitions are seamless, demonstrating rehearsal and co-operation.All members contribute clearly. Transitions are logical, though one or two may feel slightly unpolished.Visible imbalance in contribution; transitions are awkward or members seem disconnected.One or more members are significantly under-participating or absent; no cohesive team flow.
CategoryExcellentGoodAdequateNeeds Work
Technical Depth & Analytical RigorPurpose and technical features are justified with high-quality logic. Claims are fully substantiated by data with no “leaps of faith.” Findings connect seamlessly to research goals and future implications.Purpose and logic are clear. Most claims are supported by evidence. Summarizes outcomes well and relates them to the project purpose with minor gaps in depth or detail.Superficial overview; features are listed but not contextualized. Relies on unsubstantiated claims or omits critical context. Conclusion is brief or fails to explain the “so what.”Unclear purpose or minimal technical explanation. Significant “leaps of faith” without data. Missing a conclusion or fails to relate findings to goals.
Methodology & Technical ExecutionSophisticated technical approach. The approach taken is clearly documented and justified. Technical hurdles are handled with elegant solutions.Sound technical approach. Tools and methods are used correctly and described clearly, though some minor implementation details or justifications may be missing.Basic implementation. Methods are described but lack detail on “why” specific tools/logic were chosen. Implementation is functional but unoptimized or simplistic.Methodology is missing, fundamentally flawed, or a “black box.” No explanation of how the results were technically produced or how tools were applied.
Professionalism & Data VizPolished formatting and error-free writing. Data visualizations strictly adhere to unit guidelines (no chart junk) and enhance the narrative.Well-formatted with few errors. Visuals are effective and follow most class guidelines, with minor aesthetic deviations.Inconsistent formatting or grammar. Data visualizations may be cluttered, difficult to interpret, or ignore class techniques.Disorganized or hard to read. Visuals are missing, misleading, or use prohibited techniques.
Challenge GoalsSuccessfully implements two or more listed challenge goals beyond core requirements. Documented with full technical transparency.Challenge goals (two or more) do not meet the technical depth and effort expected for a semester-long research project.One challenge goal met.No evidence of challenge goals attempted.

Code/GitHub (50 pts)

CategoryExcellentGoodAdequateNeeds Work
Documentation & Administrative SetupMain README provides a professional overview, folder map, and “How to Run” guide. All sub-folders have READMEs. The Research Report is correctly placed in the final_doc folder. Data schemas/columns are fully defined.Main README and folder layout are clear. Research Report is included but perhaps misplaced. Data columns are described, but sub-folder documentation is sparse.README is present but missing key components (e.g., folder map). Documentation of data columns is incomplete.README is minimal or missing. Research Report is missing from the submission. Folder structure is undocumented.
Code Quality & PortabilityCode is readable and organized in a maintainable, coherent way. All AI-generated logic and plots are clearly attributed with sources cited in comments. Program runs seamlessly on Windows and Linux/Codespaces.Code is mostly organized. AI attribution is present for major blocks but may lack specific citations for plot generation. Runs on both platforms with minor manual tweaks.Code is disorganized. AI-generated code is included but poorly documented or missing proper attribution. Platform-locked (e.g., only works on Windows).Disorganized and impossible to follow. No attribution for AI use. Code fails to run on one or both required platforms.
Reproducibility & Plot GenerationAll plots can be regenerated via a single script or clear, reproducible steps. Instructions for data processing are robust and error-free.Most plots can be regenerated with minor manual intervention. Instructions for plot generation are mostly clear.Unable to regenerate multiple plots. Code for data processing is missing or requires significant troubleshooting.Plot generation code is missing, non-functional, or no instructions are provided.

Ongoing

Professionalism

This section will be ongoing through the entire project lifecycle and count towards the final grade (Professionalism category)

CategoryExcellentGoodAdequateNeeds Work
ProfessionalismNo instances of being off task. Asks for hints occasionally, independently researches, demonstrates problem-solving abilities.1 or 2 reminders needed to be on task. Needs help with difficult issues but demonstrates ability to debug and grow independently.3+ reminders to get back on task. Consistently unable to resolve (minor) issues without assistance.Frequently off task. Cannot make progress without assistance.