Project

Complete the map_reduce function in mr.c

void map_reduce(mapper_t mapper, size_t num_mapper, reducer_t reducer,
                size_t num_reducer, kvlist_t* input, kvlist_t* output);

mapper is a function that performs the map operation.
num_mapper is the number of threads used to execute mapper.
- For example, if num_mapper == 8, map_reduce will spawn eight threads, each processing a subset of the data
reducer is a function that performs the reduce operation.
num_reducer is the number of threads used to execute reducer.
input is a list of key-value pairs that represent the data to process
output is a list of key-value pairs that map_reduce writes the results to.

Project 2: MapReduce

Project 2 is out!

MapReduce

Programming Model

Example: Word Counting Problem (1/3)

Example: Word Counting Problem (2/3)

Input: (document name, document contents)

Map(Input)

Example: Word Counting Problem (3/3)

Group by Key

Reduce

Project

`kvlist.h`

`kvpair_t`

`kvlist_t`

`kvlist_iterator_t`

`kvlist.h` in action

`hash.h`

`map_reduce` Structure

Split Phase

Map Phase

Shuffle Phase

Reduce Phase

Output

Additional Functionality

Testing with `word-count`

`pthread` API

Creating Threads

Joining Threads

Example: Create Thread

Example: Pass value to threads

Example: Data Race

Example: `mutex`

Project 2: MapReduce

Project 2 is out!

MapReduce

Programming Model

Example: Word Counting Problem (1/3)

Example: Word Counting Problem (2/3)

Input: (document name, document contents)

Map(Input)

Example: Word Counting Problem (3/3)

Group by Key

Reduce

Project

kvlist.h

kvpair_t

kvlist_t

kvlist_iterator_t

kvlist.h in action

hash.h

map_reduce Structure

Split Phase

Map Phase

Shuffle Phase

Reduce Phase

Output

Additional Functionality

Testing with word-count

pthread API

Creating Threads

Joining Threads

Example: Create Thread

Example: Pass value to threads

Example: Data Race

Example: mutex

`kvlist.h`

`kvpair_t`

`kvlist_t`

`kvlist_iterator_t`

`kvlist.h` in action

`hash.h`

`map_reduce` Structure

Testing with `word-count`

`pthread` API

Example: `mutex`