[LITMUS^RT] litmus-dev Digest, Vol 60, Issue 4

Tue Feb 21 15:39:06 CET 2017

figures for the data are attached in case it does not show up.

Thanks.

On 21 February 2017 at 14:20, Shuai Zhao <zs673 at york.ac.uk> wrote:

> Hi Björn
>
> Thank you for your respond.
>
> Yes, the kmalloc function is now out of the critical section.
>
> The code I posted in the email is protected by a spin lock to avoid race
> conditions. The tickets are maintained and obtained by atomic_t variables.
>
> Using the feather-trace tool we get a csv file, where all the overheads
> are recorded. I noticed that you processed the data as max, 99.9prec,
> 99prec and 95prec. I wonder what is the rational behind this?
>
> Is that the 99,9prec or 99prec result filter out some of the out-layer
> data which is influenced by the system overheads or interrupts?
>
> For example: we tried to gather the overheads of the original
> pfp-scheduler. we did this experiment with a increasing number of
> processors and expect a constant overheads. However, the data we have is
> confusing. The samples for each test is above one million.
>
>
> 
> We gather this data inside the pfp-scheduler (we put time stamps inside
> the run queue locks) to get the exact overheads for executing the
> scheduling code. The result above gives the max overhead we observed in
> each test.
>
> As shown in the result, the overheads of the pfp-scheduler is extremely
> high when using cpu 1,2 and 4. By repeating the same tests, we can often
> observe such a extreme value, but with different number of cpus.
>
>
> A more simple example: the lock function that I post on previous email
> with the kmalloc removed. This part of code has only one path and is O(1),
> shich suppose to have a stable overhead.
> However, as shown above, the overheads of the code is extremely high with
> 2,14 and 15 cpus.
>
> I wonder have you met any sitations like this before? and how you explain
> such a result or how you solve the problem(if there is).
>
> Do you have any suggestions when facing such a confusing result?
>
> Thank you in advance.
>
> Best wishes
> Shuai
>
> On 20 February 2017 at 11:14, Shuai Zhao <zs673 at york.ac.uk> wrote:
>
>> Hi Meng
>>
>> Thank you so much for this information.
>>
>> I will have a try :).
>>
>> Best wishes
>> Shuai
>>
>> On 20 February 2017 at 11:00, <litmus-dev-request at lists.litmus-rt.org>
>> wrote:
>>
>>> Send litmus-dev mailing list submissions to
>>>         litmus-dev at lists.litmus-rt.org
>>>
>>> To subscribe or unsubscribe via the World Wide Web, visit
>>>         https://lists.litmus-rt.org/listinfo/litmus-dev
>>> or, via email, send a message with subject or body 'help' to
>>>         litmus-dev-request at lists.litmus-rt.org
>>>
>>> You can reach the person managing the list at
>>>         litmus-dev-owner at lists.litmus-rt.org
>>>
>>> When replying, please edit your Subject line so it is more specific
>>> than "Re: Contents of litmus-dev digest..."
>>>
>>>
>>> Today's Topics:
>>>
>>>    1. Re: litmus-dev Digest, Vol 60, Issue 2 (Meng Xu)
>>>
>>>
>>> ----------------------------------------------------------------------
>>>
>>> Message: 1
>>> Date: Sun, 19 Feb 2017 17:34:52 -0500
>>> From: Meng Xu <xumengpanda at gmail.com>
>>> To: litmus-dev at lists.litmus-rt.org
>>> Subject: Re: [LITMUS^RT] litmus-dev Digest, Vol 60, Issue 2
>>> Message-ID:
>>>         <CAENZ-+kzpYR3kHNBZ=kDVaJMgvRvbViyQ-j256xCNWwQh9Eo0A at mail.gm
>>> ail.com>
>>> Content-Type: text/plain; charset=UTF-8
>>>
>>> On Sun, Feb 19, 2017 at 5:31 PM, Shuai Zhao <zs673 at york.ac.uk> wrote:
>>> > Hi Meng
>>> >
>>> > Thank you for your fast respond.
>>> >
>>> > Yes, the CPU prefetch is already disabled. But there isn't any other
>>> options
>>> > of the hardware prefetching. But I guess its should be OK.
>>> >
>>> > The kmalloc() function can be one of the reasons. I will adjust the
>>> code.
>>> >
>>> > BTW, is there any other features or facilities that could be disabled
>>> to
>>> > minimise the system interferences?
>>>
>>> How about the cache prefecthing? Did you disable it?
>>> You also need to keep the memory bus frequency constant.
>>>
>>> But I think such large variance may mainly come from the kmalloc().
>>>
>>> Best,
>>>
>>> Meng
>>>
>>>
>>> >
>>> >
>>> > On 19 February 2017 at 21:27, <litmus-dev-request at lists.litmus-rt.org>
>>> > wrote:
>>> >>
>>> >> Send litmus-dev mailing list submissions to
>>> >>         litmus-dev at lists.litmus-rt.org
>>> >>
>>> >> To subscribe or unsubscribe via the World Wide Web, visit
>>> >>         https://lists.litmus-rt.org/listinfo/litmus-dev
>>> >> or, via email, send a message with subject or body 'help' to
>>> >>         litmus-dev-request at lists.litmus-rt.org
>>> >>
>>> >> You can reach the person managing the list at
>>> >>         litmus-dev-owner at lists.litmus-rt.org
>>> >>
>>> >> When replying, please edit your Subject line so it is more specific
>>> >> than "Re: Contents of litmus-dev digest..."
>>> >>
>>> >>
>>> >> Today's Topics:
>>> >>
>>> >>    1. A question about feather-trace tool (Shuai Zhao)
>>> >>    2. Re: A question about feather-trace tool (Shuai Zhao)
>>> >>    3. Re: A question about feather-trace tool (Meng Xu)
>>> >>
>>> >>
>>> >> ------------------------------------------------------------
>>> ----------
>>> >>
>>> >> Message: 1
>>> >> Date: Sun, 19 Feb 2017 20:32:17 +0000
>>> >> From: Shuai Zhao <zs673 at york.ac.uk>
>>> >> To: litmus-dev at lists.litmus-rt.org
>>> >> Subject: [LITMUS^RT] A question about feather-trace tool
>>> >> Message-ID:
>>> >>
>>> >> <CAA133hO+_4Svyx6dJUyUn_rsnY50AfUVxLyOHKcsUN6+vG8UiQ at mail.gmail.com>
>>> >> Content-Type: text/plain; charset="utf-8"
>>> >>
>>> >> Hi Björn
>>> >>
>>> >> I am a student from the University of York working with Alan and Andy
>>> to
>>> >> study MrsP nested behaviours now.
>>> >>
>>> >> We now have a full implementation of nested MrsP under Litmus P-FP
>>> >> scheduler and we are now trying to do some evaluations of the
>>> >> implementation overheads.
>>> >>
>>> >> We use the feather-trace tool to trace the overheads of the scheduler
>>> >> (which includes the P-FP schedule function), context switch (includes
>>> >> finish_switch function), mrsp_lock and mrsp_unlock function.
>>> >>
>>> >> During evaluation, we fixed the CPU clock speed, bounds interrupts to
>>> cpu
>>> >> 0
>>> >> and isolate other cpus for testing to minimise the interference from
>>> the
>>> >> system.
>>> >>
>>> >>
>>> >> However, the result seems werid. Use mrsp_lock as an example: we
>>> evaluated
>>> >> the overhead of the following code using the timestamps
>>> "TS_LOCK_START"
>>> >> and
>>> >> "TS_LOCK_END".
>>> >>
>>> >> TS_LOCK_START;
>>> >>
>>> >> if (t->rt_param.task_params.helper == NULL) {
>>> >> t->rt_param.task_params.priority = sem->prio_per_cpu[get_partition(t)]
>>> <
>>> >> get_priority(t) ? sem->prio_per_cpu[get_partition(t)] :
>>> get_priority(t);
>>> >> t->rt_param.task_params.migrated_time = -1;
>>> >> }
>>> >>
>>> >> ticket = atomic_read(&sem->next_ticket);
>>> >> t->rt_param.task_params.ticket = ticket;
>>> >> atomic_inc(&sem->next_ticket);
>>> >>
>>> >> add_task(t, &(sem->tasks_queue->next));
>>> >> t->rt_param.task_params.requesting_lock = sem;
>>> >>
>>> >> TS_LOCK_END;
>>> >>
>>> >> Where function add_task() is as follows:
>>> >>
>>> >> void add_task(struct task_struct* task, struct list_head *head) {
>>> >> struct task_list *taskPtr = (struct task_list *) kmalloc(sizeof(struct
>>> >> task_list), GFP_KERNEL);
>>> >> BUG_ON(taskPtr == NULL);
>>> >>
>>> >> taskPtr->task = task;
>>> >> INIT_LIST_HEAD(&taskPtr->next);
>>> >> list_add_tail(&taskPtr->next, head);
>>> >> }
>>> >>
>>> >> We expect the overheads of the code above to be stable as the time
>>> >> complexity is order 1. However, the testing result gives us a
>>> different
>>> >> story, as shown below:
>>> >>
>>> >> Overhead        Overhead        Unit        Samples        MAX
>>> >>  99.9th perc.        99perc.        95th perc.        avg        med
>>> >>  min        std        var
>>> >> MRSP             LOCK             cycles    149985          20089
>>> >> 2220.016            1584            1287            853.9508    939
>>> >>  319     367.314  134918.7
>>> >>
>>> >> As we can see, the max overheads we have is 20089 cycles, which is far
>>> >> bigger than the med/avg value and even the 99.9the perc value.
>>> >>
>>> >> I am confused about this result. I wonder have you meet this situation
>>> >> before? Is there any explanation for the result like this? or is
>>> there any
>>> >> way to avoid this?
>>> >>
>>> >> Thank you in advance.
>>> >>
>>> >> Best wishes
>>> >> Shuai
>>> >> -------------- next part --------------
>>> >> An HTML attachment was scrubbed...
>>> >> URL:
>>> >> <http://lists.litmus-rt.org/pipermail/litmus-dev/attachments
>>> /20170219/d6859058/attachment-0001.html>
>>> >>
>>> >> ------------------------------
>>> >>
>>> >> Message: 2
>>> >> Date: Sun, 19 Feb 2017 20:37:52 +0000
>>> >> From: Shuai Zhao <zs673 at york.ac.uk>
>>> >> To: litmus-dev at lists.litmus-rt.org
>>> >> Subject: Re: [LITMUS^RT] A question about feather-trace tool
>>> >> Message-ID:
>>> >>
>>> >> <CAA133hM4boX5fLX1+PERwbxtvymOFOonn0wwV2r3q9s3hehBog at mail.gmail.com>
>>> >> Content-Type: text/plain; charset="utf-8"
>>> >>
>>> >> PS: I commented out the original "TS_LOCK_START" and "TS_LOCK_END"
>>> stamps
>>> >> in sys_lock function. thanks.
>>> >>
>>> >> On 19 February 2017 at 20:32, Shuai Zhao <zs673 at york.ac.uk> wrote:
>>> >>
>>> >> > Hi Björn
>>> >> >
>>> >> > I am a student from the University of York working with Alan and
>>> Andy to
>>> >> > study MrsP nested behaviours now.
>>> >> >
>>> >> > We now have a full implementation of nested MrsP under Litmus P-FP
>>> >> > scheduler and we are now trying to do some evaluations of the
>>> >> > implementation overheads.
>>> >> >
>>> >> > We use the feather-trace tool to trace the overheads of the
>>> scheduler
>>> >> > (which includes the P-FP schedule function), context switch
>>> (includes
>>> >> > finish_switch function), mrsp_lock and mrsp_unlock function.
>>> >> >
>>> >> > During evaluation, we fixed the CPU clock speed, bounds interrupts
>>> to
>>> >> > cpu
>>> >> > 0 and isolate other cpus for testing to minimise the interference
>>> from
>>> >> > the
>>> >> > system.
>>> >> >
>>> >> >
>>> >> > However, the result seems werid. Use mrsp_lock as an example: we
>>> >> > evaluated
>>> >> > the overhead of the following code using the timestamps
>>> "TS_LOCK_START"
>>> >> > and
>>> >> > "TS_LOCK_END".
>>> >> >
>>> >> > TS_LOCK_START;
>>> >> >
>>> >> > if (t->rt_param.task_params.helper == NULL) {
>>> >> > t->rt_param.task_params.priority = sem->prio_per_cpu[get_partition(t)]
>>> <
>>> >> > get_priority(t) ? sem->prio_per_cpu[get_partition(t)] :
>>> get_priority(t);
>>> >> > t->rt_param.task_params.migrated_time = -1;
>>> >> > }
>>> >> >
>>> >> > ticket = atomic_read(&sem->next_ticket);
>>> >> > t->rt_param.task_params.ticket = ticket;
>>> >> > atomic_inc(&sem->next_ticket);
>>> >> >
>>> >> > add_task(t, &(sem->tasks_queue->next));
>>> >> > t->rt_param.task_params.requesting_lock = sem;
>>> >> >
>>> >> > TS_LOCK_END;
>>> >> >
>>> >> > Where function add_task() is as follows:
>>> >> >
>>> >> > void add_task(struct task_struct* task, struct list_head *head) {
>>> >> > struct task_list *taskPtr = (struct task_list *)
>>> kmalloc(sizeof(struct
>>> >> > task_list), GFP_KERNEL);
>>> >> > BUG_ON(taskPtr == NULL);
>>> >> >
>>> >> > taskPtr->task = task;
>>> >> > INIT_LIST_HEAD(&taskPtr->next);
>>> >> > list_add_tail(&taskPtr->next, head);
>>> >> > }
>>> >> >
>>> >> > We expect the overheads of the code above to be stable as the time
>>> >> > complexity is order 1. However, the testing result gives us a
>>> different
>>> >> > story, as shown below:
>>> >> >
>>> >> > Overhead        Overhead        Unit        Samples        MAX
>>> >> >  99.9th perc.        99perc.        95th perc.        avg        med
>>> >> >  min        std        var
>>> >> > MRSP             LOCK             cycles    149985          20089
>>> >> > 2220.016            1584            1287            853.9508    939
>>> >> >  319     367.314  134918.7
>>> >> >
>>> >> > As we can see, the max overheads we have is 20089 cycles, which is
>>> far
>>> >> > bigger than the med/avg value and even the 99.9the perc value.
>>> >> >
>>> >> > I am confused about this result. I wonder have you meet this
>>> situation
>>> >> > before? Is there any explanation for the result like this? or is
>>> there
>>> >> > any
>>> >> > way to avoid this?
>>> >> >
>>> >> > Thank you in advance.
>>> >> >
>>> >> > Best wishes
>>> >> > Shuai
>>> >> >
>>> >> -------------- next part --------------
>>> >> An HTML attachment was scrubbed...
>>> >> URL:
>>> >> <http://lists.litmus-rt.org/pipermail/litmus-dev/attachments
>>> /20170219/824d81ab/attachment-0001.html>
>>> >>
>>> >> ------------------------------
>>> >>
>>> >> Message: 3
>>> >> Date: Sun, 19 Feb 2017 16:27:26 -0500
>>> >> From: Meng Xu <xumengpanda at gmail.com>
>>> >> To: litmus-dev at lists.litmus-rt.org
>>> >> Subject: Re: [LITMUS^RT] A question about feather-trace tool
>>> >> Message-ID:
>>> >>
>>> >> <CAENZ-+mfQQ+-5qxbJOyh2UEPFrhvPp2OzsB4dYXUG_r=tNj95A at mail.gmail.com>
>>> >> Content-Type: text/plain; charset=UTF-8
>>> >>
>>> >> On Sun, Feb 19, 2017 at 3:32 PM, Shuai Zhao <zs673 at york.ac.uk> wrote:
>>> >> >
>>> >> > Hi Björn
>>> >>
>>> >>
>>> >> Hi,
>>> >>
>>> >> Can I hijack the question? ;-)
>>> >>
>>> >> >
>>> >> >
>>> >> > I am a student from the University of York working with Alan and
>>> Andy to
>>> >> > study MrsP nested behaviours now.
>>> >> >
>>> >> > We now have a full implementation of nested MrsP under Litmus P-FP
>>> >> > scheduler and we are now trying to do some evaluations of the
>>> implementation
>>> >> > overheads.
>>> >> >
>>> >> > We use the feather-trace tool to trace the overheads of the
>>> scheduler
>>> >> > (which includes the P-FP schedule function), context switch
>>> (includes
>>> >> > finish_switch function), mrsp_lock and mrsp_unlock function.
>>> >> >
>>> >> > During evaluation, we fixed the CPU clock speed, bounds interrupts
>>> to
>>> >> > cpu 0 and isolate other cpus for testing to minimise the
>>> interference from
>>> >> > the system.
>>> >>
>>> >>
>>> >> Did you disable the hardware prefetching mechanisms in BIOS?
>>> >> Maybe you want to disable them as well.
>>> >>
>>> >>
>>> >> >
>>> >> >
>>> >> >
>>> >> >
>>> >> >
>>> >> > However, the result seems werid. Use mrsp_lock as an example: we
>>> >> > evaluated the overhead of the following code using the timestamps
>>> >> > "TS_LOCK_START" and "TS_LOCK_END".
>>> >> >
>>> >> > TS_LOCK_START;
>>> >> >
>>> >> > if (t->rt_param.task_params.helper == NULL) {
>>> >> > t->rt_param.task_params.priority = sem->prio_per_cpu[get_partition(t)]
>>> <
>>> >> > get_priority(t) ? sem->prio_per_cpu[get_partition(t)] :
>>> get_priority(t);
>>> >> > t->rt_param.task_params.migrated_time = -1;
>>> >> > }
>>> >> >
>>> >> > ticket = atomic_read(&sem->next_ticket);
>>> >> > t->rt_param.task_params.ticket = ticket;
>>> >> > atomic_inc(&sem->next_ticket);
>>> >> >
>>> >> > add_task(t, &(sem->tasks_queue->next));
>>> >> > t->rt_param.task_params.requesting_lock = sem;
>>> >> >
>>> >> > TS_LOCK_END;
>>> >> >
>>> >> > Where function add_task() is as follows:
>>> >> >
>>> >> > void add_task(struct task_struct* task, struct list_head *head) {
>>> >> > struct task_list *taskPtr = (struct task_list *)
>>> kmalloc(sizeof(struct
>>> >> > task_list),
>>> >> >
>>> >> > GFP_KERNEL);
>>> >>
>>> >>
>>> >> I guess the variance comes from the kmalloc().
>>> >> kmalloc() latency varies a lot.
>>> >>
>>> >> You should avoid kmalloc() in the critical section.
>>> >> One approach is pre-allocating the space, re-initializing the value
>>> >> every time when you want to use it.
>>> >>
>>> >> You can also measure the overhead value spent in this function to
>>> >> validate my speculation.
>>> >>
>>> >>
>>> >> >
>>> >> > BUG_ON(taskPtr == NULL);
>>> >> >
>>> >> > taskPtr->task = task;
>>> >> > INIT_LIST_HEAD(&taskPtr->next);
>>> >> > list_add_tail(&taskPtr->next, head);
>>> >> > }
>>> >> >
>>> >> > We expect the overheads of the code above to be stable as the time
>>> >> > complexity is order 1. However, the testing result gives us a
>>> different
>>> >> > story, as shown below:
>>> >> >
>>> >> > Overhead        Overhead        Unit        Samples        MAX
>>> >> > 99.9th perc.        99perc.        95th perc.        avg        med
>>> >> > min        std        var
>>> >> > MRSP             LOCK             cycles    149985          20089
>>> >> > 2220.016            1584            1287            853.9508    939
>>> >> > 319     367.314  134918.7
>>> >>
>>> >>
>>> >> This is a bit difficulty to read. Next time when you send the new
>>> >> result, could you please send it in column. ;-)
>>> >>
>>> >>
>>> >> >
>>> >> >
>>> >> > As we can see, the max overheads we have is 20089 cycles, which is
>>> far
>>> >> > bigger than the med/avg value and even the 99.9the perc value.
>>> >> >
>>> >> > I am confused about this result. I wonder have you meet this
>>> situation
>>> >> > before? Is there any explanation for the result like this? or is
>>> there any
>>> >> > way to avoid this?
>>> >>
>>> >>
>>> >> You should be able to avoid this, as I mentioned above.
>>> >>
>>> >> Best,
>>> >>
>>> >> Meng
>>> >>
>>> >>
>>> >> --
>>> >> -----------
>>> >> Meng Xu
>>> >> PhD Student in Computer and Information Science
>>> >> University of Pennsylvania
>>> >> http://www.cis.upenn.edu/~mengxu/
>>> >>
>>> >>
>>> >>
>>> >> ------------------------------
>>> >>
>>> >> Subject: Digest Footer
>>> >>
>>> >> _______________________________________________
>>> >> litmus-dev mailing list
>>> >> litmus-dev at lists.litmus-rt.org
>>> >> https://lists.litmus-rt.org/listinfo/litmus-dev
>>> >>
>>> >>
>>> >> ------------------------------
>>> >>
>>> >> End of litmus-dev Digest, Vol 60, Issue 2
>>> >> *****************************************
>>> >
>>> >
>>> >
>>> > _______________________________________________
>>> > litmus-dev mailing list
>>> > litmus-dev at lists.litmus-rt.org
>>> > https://lists.litmus-rt.org/listinfo/litmus-dev
>>> >
>>>
>>>
>>>
>>> --
>>> -----------
>>> Meng Xu
>>> PhD Student in Computer and Information Science
>>> University of Pennsylvania
>>> http://www.cis.upenn.edu/~mengxu/
>>>
>>>
>>>
>>> ------------------------------
>>>
>>> Subject: Digest Footer
>>>
>>> _______________________________________________
>>> litmus-dev mailing list
>>> litmus-dev at lists.litmus-rt.org
>>> https://lists.litmus-rt.org/listinfo/litmus-dev
>>>
>>>
>>> ------------------------------
>>>
>>> End of litmus-dev Digest, Vol 60, Issue 4
>>> *****************************************
>>>
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.litmus-rt.org/pipermail/litmus-dev/attachments/20170221/fad4aee7/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Screen Shot 2017-02-21 at 13.31.46.png
Type: image/png
Size: 11783 bytes
Desc: not available
URL: <http://lists.litmus-rt.org/pipermail/litmus-dev/attachments/20170221/fad4aee7/attachment-0004.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Screen Shot 2017-02-21 at 13.23.45.png
Type: image/png
Size: 13041 bytes
Desc: not available
URL: <http://lists.litmus-rt.org/pipermail/litmus-dev/attachments/20170221/fad4aee7/attachment-0005.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: lock.PNG
Type: image/png
Size: 6323 bytes
Desc: not available
URL: <http://lists.litmus-rt.org/pipermail/litmus-dev/attachments/20170221/fad4aee7/attachment-0006.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pfp scheduler.PNG
Type: image/png
Size: 7481 bytes
Desc: not available
URL: <http://lists.litmus-rt.org/pipermail/litmus-dev/attachments/20170221/fad4aee7/attachment-0007.png>