# Regression Analysis in R: Adapting to Varied Data Types

## Course Description 

Regression analysis is a fundamental statistical technique used to model the relationship between multiple variables. It is a very flexible tool that can handle a range of different data types (continuous, binary or categorical) and address both complex and simple research questions. In this hands-on workshop you will build on your understanding of simple linear regression and learn how to fit a broader range of regression models with R including logistic regression.

While it is delivered as a stand alone session, it is designed as a part of a series of Regression with R workshops where the content develops the ideas further to give you a comprehensive understanding how regression can be used to address a broad range of questions. 

The complete series includes:

1. Introduction to Regression with R
2. Regression Analysis in R: Adapting to Varied Data Types
3. Mixed Effects Regression with R

## Course Objectives
- Understand what a generlised linear model is
- Fit a logistic regression models with R 
- Select the appropriate regression model for either a continuous or binary outcome
- Include a range of different types of predictor variables in regression models
- Interpret the coefficients of a regression model


## Pre-requisite Knowledge

Learners are expected to already be familiar with the basics of R, such as how to load a dataset from a local file and manipulate variables.

We also assume that you are comfortable with fitting simple linear regression models in R and interpreting the output of these. If not we recommend that you consult our pre-requisite course **Introduction to Regression with R**.


## Signup 

To check for upcoming course dates and to register, please visit the Workshop Schedule and Signup page available [here](../cfrr_program_details/courses_overview.ipynb).

In [1]:
import pandas as pd
from datetime import datetime
from IPython.display import display, HTML

# Define the course that is being looked at
course_name = "Regression Analysis in R: Adapting to Varied Data Types"

# Load the CSV file
file_path = '../data/workshop_info.csv'  # Adjust the path to your file location
courses_df = pd.read_csv(file_path)

# Strip any extra spaces in the column names
courses_df.columns = courses_df.columns.str.strip()

# Convert date columns to datetime
courses_df['Start Date'] = pd.to_datetime(courses_df['Start Date'], dayfirst=True, errors='coerce')
courses_df['End Date'] = pd.to_datetime(courses_df['End Date'], dayfirst=True, errors='coerce')

# Get today's date
today = datetime.now()

# Function to generate markdown text based on the course dates
def generate_html(row):
    if pd.notna(row['Start Date']) and pd.notna(row['End Date']):
        if row['Start Date'] <= today <= row['End Date']:
            return f"<div style='font-weight: bold;'>This course is currently accepting applications.</div>"
    return ""

# Apply the function and create a new column for Markdown
courses_df['HTML'] = courses_df.apply(generate_html, axis=1)

# Variable for course name


# Filter the DataFrame for the given course name and display the HTML text
html_output = courses_df[courses_df['Course Name'] == course_name]['HTML'].values[0]
display(HTML(html_output))


## Installation Guide 

All CfRR courses require attendees to use their own computer/laptop to follow workshop activities and take effective notes. 

As this course extends upon *Introduction to R*, the installation instructions for the base R elements are the same, available [here](intro_to_r.ipynb).

Alongside having R itself, three packages are needed for this workshop. The first two (devtools & learnr) are available from CRAN. The third is a package we have developed with the course materials and is available from GitHub (cfrrRTutorials).

This code will install these three packages.
```
install.packages("devtools") 
install.packages("learnr") 
library(devtools)
devtools::install_github("coding-for-reproducible-research/cfrrRtutorials")
```

If you encounter any difficulties with installation, please reach out.

## Self Study Material Link 

The self-study material for this course is available as a tutorial through the cfrrRtutorials R package  installed as part of the set up for this workshop. To launch the "Regression Analysis in R: Adapting to Varied Data Types" tutorial in Rstudio you can run:

```
learnr::run_tutorial("Regression Analysis in R: Adapting to Varied Data Types", "cfrrRtutorials")
```

When ready the tutorial may launch in a new window, or (more likely) there will be some output in red in the **Jobs** console that says something like:  

```
Output created: Regression-Analysis-in-R:-Adapting-to-Varied-Data-Types.html

Listening on http://127.0.0.1:38555

+------------------------------------------------------------------------+
<U+2713> Open the tutorial in your browser: http://127.0.0.1:38555
! Stop or cancel this job to stop running the tutorial
+------------------------------------------------------------------------+
```

If the tutorial does not automatically launch in a new window, to launch it copy the web address in this case (http://127.0.0.1:38555) into a web browser of your choice, such as Chrome. 

You can then start the workshop in this document. You can navigate through the sections using the menu on the side. Please note that the data required for the examples and exercises is preloaded within each interactive document, so the commands/exercises only work within it. They won't work with the Rstudio console. When you come to apply what you have learned on your own datasets, you will need to ensure your data is loaded and edit the syntax to model the relevant variables. 

## Developers

This workshop was developed by Eilis Hannon.


## License Info 

In [2]:
from IPython.display import Markdown, display

def display_markdown(file_path):
    try:
        with open(file_path, 'r') as file:
            content = file.read()
            display(Markdown(content))
    except FileNotFoundError:
        print(f"The file {file_path} was not found.")
    except Exception as e:
        print(f"An error occurred: {e}")

# Example usage:
file_path = '../individual_modules/regression_analysis_with_R/LICENSE.txt'  # Replace with your file path
display_markdown(file_path)

### Instructional Material

The instructional material in this course is copyright © 2024 University of Exeter
and is made available under the [Creative Commons Attribution 4.0 International
licence](https://creativecommons.org/licenses/by/4.0/). Instructional material
consists of material that is contained within the "individual_modules/regression_analysis_with_R" folders in
this repository, with the exception of code snippets and example programs found
in files within these folders. Such code snippets and example programs are
considered software for the purposes of this licence. 


### Software

Except where otherwise noted, software provided in this repository is made
available under the MIT licence (https://opensource.org/licenses/MIT).

Copyright © 2024 University of Exeter

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

The software in this repository is adapted from software that is covered by the
following copyright and permission notice:

    Copyright © 2024 Software Carpentry

    Permission is hereby granted, free of charge, to any person obtaining
    a copy of this software and associated documentation files (the
    "Software"), to deal in the Software without restriction, including
    without limitation the rights to use, copy, modify, merge, publish,
    distribute, sublicense, and/or sell copies of the Software, and to
    permit persons to whom the Software is furnished to do so, subject to
    the following conditions:

    The above copyright notice and this permission notice shall be
    included in all copies or substantial portions of the Software.

    THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
    EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
    MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
    NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE
    LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION
    OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION
    WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.