Week 5 – Introduction to generic functions in C part 2

Posted by Unknown Posted on - - 0 comments

Week 5 – Introduction to generic functions in C part 2

When we left you last time, we were probably about 60 percent of the way through my lsearch implementation. We are trying to go from type specific to generic, but I'm trying to do that in the C language.

Void *lsearch (void *key ,void base,int n ,int elemsize, int (cmpfn)(void *,void *))

this is the header for our linear search

♠key refers to the element we need to find

♠base refers to the base address

♠n refers to the no of elements

♠and elemsize refers to the size of each element

but the last argument looks a bit confusing ... lets figure out whats the last argument all about

now lets look at the body of the function

for(i=0;i

void * elemaddr = (char*)base +i*elemsize ;

/* elemaddr is the address of the key ... here a for loop to increase the valueof the base pointer each time by elemsize *\

now lets get to the mystery argument we had left out on closer evaluation we see that the argument is actually a function with the aruments as two void * this arguments is used to compare two elements of any type and it returns 0 if they match or some other integer otherwise

if (cmpfn(key,elemaddr)==0)

return elemaddr;

now lets consider an example

int array[] ={4,2,3,7,11,6};

int size =6 ;

now if i have to search for a number say 7 i have to do the following

int number =7 ;

now i have to set aside space for the key element i am interested in as i have to pass that as the very first element to the function

array is represented in the following way

<----elemsize--->

↑ ↑

base address key address

now lets see how to call lsearch for this example

int *found = lsearch(&number,array,6,sizeof(int),int cmpfn);

if (found==NULL)

dint get number

else

number was found

we have to define the int cmpfn function before we can use it so lets write the cmpfn function

now the cmp function has to take two void * and return an integer (prototype) , this is the fifth parameter in our lsearch function

you may think cant i just write a fucntion taking two int * and returning an int ... the answer is you could since all pointers are of same size .. you would pass them as int * and they would be absorbed as void * ... but its better to write the comparison function to match that prototype exatly

my function take two pointers ... one is the key which always remains constant and the other is the element address which keeps changing unless a match is found

i also have to return a 0 or any other integer depending whether there is a match or not

now since im writing the code i know that the two void * are really int * .. therefore i can reinterpret them to be the int * they really are

int intcmp(void * elem1,void *elem2)
{
int *ip1 =elem1;
int *ip2=elem 2;

return (*ip1 -*ip2) /* returns the difference between element if zero ... its a match */
}

this may be difficult to understand the first time you see it .. its not the most elegant way to support generics .. its just the best C has to offer ... all the other languages are much younger and they have learnt forms C's mistakes and they have better solutions to such problems

here is the full program :

#include
#include
#include

void *lsearch(void *key,void*base,int n,int elemsize,int(cmpfn)(void*,void*))
{
int i;
for(i=0;i

{
void *elemaddr=(char*)base+i*elemsize;
if(cmpfn(key,elemaddr)==0)
return elemaddr;
}}

int cmpfn(void *elem1,void *elem2)
{
int *ip1=elem1;
int *ip2=elem2;
return(*ip1-*ip2);
}

main()
{
int array[]={4,2,3,7,11,6};
int size=6;
int number =7;
int *found=lsearch(&number,array,6,sizeof(int),cmpfn);
if(found==NULL)
printf("we dint get the number\n");
else
printf("we found the number\n");
}

But there are some plus points like this method is very fast, you use one copy of lsearch to do all your searching .. the template approach you get one instance of that lsearch algorithm for every datatype you search for

this method isnt that difficult to understand while searching for integers but its starts to get a lot more complicated when we start lsearching an array of c strings so you will have an array of char * and well have to search for a specific char * to see whether you have a match or not

consider the following code

char *letters [] ={“a”,”f”,”b”,”g”,”d”};
char *favletter =”e”;
lsearch(&favletter,letters,5,sizeof(char*),StrCmp);

this is array is represents in the memory as follows

↓ ↓ ↓ ↓ ↓ ↓

base pointin to a pointing to f pointing to b pointing to g pointing to d

address

in the function we pass the address of the favletter , we pass the letters array which is the base address of the first element ,no of elements ,size of elements which is char * (why char *) .. so that we know the space between the elements,and StrCmp .. the compare function which we have to write

the reason we are passing the address of favletter rather than the value its self .. is that we want the key to be of the same type as of as all the pointers that are computed manually in the lsearch

this is beacause though elemenst are passed as char * we know that they are acutally char** that is they are two hops away from the element .. and it would make our life easier if we passed the key as char** so that there will be some symmetry in the code

now lets write the StrCmp function

int StrCmp(void*vp1,void *vp2)
{
char * s1= * (char **)vp1;
char * s2= * (char **)vp2;
return strcmp(s1,s2);
}

the first argument is &favletter and the other argument is the function to which it has to be compared with ...now the function body look really very confusing the first time you see it..but if you examine it carefully youll find that we are casting vp1 and vp2 to be the type we know they are ..

they are two hops away from going to charecters (that is why we use char** instead of char*) after that we dereference them once so that the address of the key and the present element are sitting in s1 and s2 ... we do this because there a function in c library which compares character arrays by passing the address of the pointer to the first character this function returns a zero for match otherwise the difference between the ascii key values

lets consider the same function and if we forget to pass &favletter and instead pass favletter

our compare function still interprets its as a char** not char* it would set theit address of 'e' to be char** instead of char*

it would still derefrence it but would now represent 'e' to be a char * which actually is a char ,and it would pass that to strcmp which will not be good as it will jump to the 'e' mystery address and assume there are characters there

now suppose u dont want to deal with a double dereference when u dont need to .you can wanna pass just favletter as your argument

the body of the function gets changed

char * s1= (char *)vp1;
char * s2= * (char **)vp2;
return strcmp(s1,s2);

we can recognise that the first argument is 1 hop away from the characters and the second one is two hops away

the way the function was written first is the way its usually written due to the symmetry of dealing with the same incoming datatype

we can write it anyway .. both methods work

this is the full program :

#include
#include
#include

int StrCmp(void *vp1,void *vp2)
{
char *s1=*(char**)vp1;
char *s2=*(char**)vp2;
return (strcmp(s1,s2));
}

void lsearch(void *key,void*base,int n,int elemsize,int(StrCmp)(void*,void*))
{
int i;
for(i=0;i

{
void *elemaddr=(char*)base+i*elemsize;
if(StrCmp(key,elemaddr)==0)
{
printf("element has a match\n");
break ;
}
}
printf("element has no match\n");
}

main()
{
char *letters[]={"a","f","b","g","d"};
char *favletter="e";
lsearch(&favletter,letters,5,sizeof(char*),StrCmp);

}

now that we have a pretty good idea about genric code ... lets turn our attention to implementing a stack datastructure

well start by making it int specify just so that we have a clear idea how the generic should be implemented

well be implementing everything in int so there is no void* business yet

well try to imitate the way we use stacks in c++ as we are more use to it

as there is no class keyword in C well be using struct

and we have to public and no private

well try to come as close as we can to implementing a class using C code

typedef struct{
int *elems;
int logicallen;
int alloclength;
}stacks;

since we are trying to implement it as a class ... lets not manipulate the element inside stacks diresctly

well try using function to manipulate them

we want to write a constructor function, destructor function, and then we want to write an is empty function, a pop function, a push function, things like that.

So here’s the prototype of the first thing I’m interested in:

void stacknew (stack *s);
void stackdispose(stack *s);
void stackpush(stack *s,int value);
int stackpop(stack *s);

♠in stack new all we are gonna do is pass in or expect the address of some stack that’s already been allocated.

♠we also have this function stack dispose. we want to identify the address of the stack structure that should be disposed. This is gonna be a dynamically allocated array that’s not perfectly sized. So we want to keep track of how much space I have (logicallen) and how much of it I’m using (alloclengh)

♠stackpush and stackpop is used to push and pop elements

lets see a picture of what happens when we declare a element of stack

stack s ;

That means that this, as a picture, is gonna be set aside. And you know, based on what we’ve talked about in the past, that it’s 12 bytes

if the elems field is at the bottom, and that the two integers are stacked on top of it. But as far as the declaration is concerned, it doesn’t actually clear these out, or zero them out like Java does, it just inherits whatever bits happen to reside in the 12 bytes that are

overlaid by this new variable.

stacknew(&s) It’s when we call stack new that we pass in the address of this. Why does that work, and why

do we know it can work?

Because we identify the location of my question-mark-holding

stack pass into a block of code that we’re gonna allow to actually manipulate these three

fields. And I’m going to logically do the following:

I’m gonna take the raw space, that’s set up this way. I’m gonna set it’s length to be zero.

I’m gonna make space for four elements. And I’m gonna store the address of a

dynamically allocated array of integers, where these question marks are right here, and

initialize the things. That means that that number can be used not only to store

the effective depth of the stack, but it can also identify the index where I’d like to place

the next integer to be pushed.

So because I’m pre-allocating space for four elements, that means that this is a function.

It’s gonna be able to run very, very quickly for the very first calls. And it’s only when I

actually push a fifth element, that I detect that allocated space has been saturated, that I

have to recover and panic a little bit and say, oh, I have to put these things somewhere

else. That it’ll actually go and allocate another array that’s twice as big, and move

everything over, and then carry on as if the array were of length 8 instead of 4 all along.

You can imagine that when we go generic this is still gonna be an array, just like the

arrays passed to lsearch are, but we’re gonna have to manually compute the

insertion index to house the next push call, or to accommodate the next push call, and do

the same thing for pop. we do this

stacknew(&s)
{
for(int i=o;i<5;i++)
stackpush(&s,i);
}

So what has to happen is that on the very last iteration here I have

to do that little panic thing, where I say, I don’t have space for the 4. So I have to go and

allocate space for everything.

So I’m gonna use this doubling strategy, where I’ve gotta set aside space for eight

elements. I copy over all the data that was already here. I free this. Get this to point to this

space as if it were the original figure I’ve allocated. Forget about that smaller house, I’ve

moved into a bigger house and I hated the older house. And then I can finally put down

the 4 and make this a 5 and make this an 8. So that’s the generic algorithm we’re gonna

be following for this code

I want to implement stack new. So take a stack address, just like that, recognize that s is a

local variable. It has to make the assumption that it’s pointing to this 12-byte figure

Void stacknew(stack *s)
{
s->logicallen=0;
s->allocelem=4;
s->elems=malloc(4*size0f(int));
assert(s->elems !=num);

So what I want to do is I want to go in. I want to s arrow logical n = zero. I want to do s

arrow alloc len = 4, and allocate dynamic memeory using malloc for elems

There is this function called assert. It’s actually

not a function it’s actually a macro. There’s this thing you can invoke called assert in the

code, which takes a boolean value, takes something that functions as a test. It effectively

becomes a no op if this test is true, but if this is false assert actually ends the program and

tells you what line the program ended at. It tells you the file number containing and the

line number of the assert that broke.

The idea here is that you want to assert the truth of some condition, or assert that some

condition is being met, before you carry forward. Because if I don’t put this here and

malloc failed, it couldn’t find 16 bytes (That wouldn’t happen but just assume it could)

and it returned null, you don’t want to allow the program to run for 44 more seconds for

it to crash because you de-referenced a null pointer somewhere.

So when we come back next sunday we'll finish the rest of the three functions and then we will go

generic

Categories: class, linux, linux class, system programming

linuxcandyclone

Popular Posts

Week 5 – Introduction to generic functions in C part 2

Week 5 – Introduction to generic functions in C part 2

Leave a Reply

About Me

Category List

Blog Archive

Followers