License cleanup: add SPDX GPL-2.0 license identifier to files with no license
Many source files in the tree are missing licensing information, which
makes it harder for compliance tools to determine the correct license.
By default all files without license information are under the default
license of the kernel, which is GPL version 2.
Update the files which contain no license information with the 'GPL-2.0'
SPDX license identifier. The SPDX identifier is a legally binding
shorthand, which can be used instead of the full boiler plate text.
This patch is based on work done by Thomas Gleixner and Kate Stewart and
Philippe Ombredanne.
How this work was done:
Patches were generated and checked against linux-4.14-rc6 for a subset of
the use cases:
- file had no licensing information it it.
- file was a */uapi/* one with no licensing information in it,
- file was a */uapi/* one with existing licensing information,
Further patches will be generated in subsequent months to fix up cases
where non-standard license headers were used, and references to license
had to be inferred by heuristics based on keywords.
The analysis to determine which SPDX License Identifier to be applied to
a file was done in a spreadsheet of side by side results from of the
output of two independent scanners (ScanCode & Windriver) producing SPDX
tag:value files created by Philippe Ombredanne. Philippe prepared the
base worksheet, and did an initial spot review of a few 1000 files.
The 4.13 kernel was the starting point of the analysis with 60,537 files
assessed. Kate Stewart did a file by file comparison of the scanner
results in the spreadsheet to determine which SPDX license identifier(s)
to be applied to the file. She confirmed any determination that was not
immediately clear with lawyers working with the Linux Foundation.
Criteria used to select files for SPDX license identifier tagging was:
- Files considered eligible had to be source code files.
- Make and config files were included as candidates if they contained >5
lines of source
- File already had some variant of a license header in it (even if <5
lines).
All documentation files were explicitly excluded.
The following heuristics were used to determine which SPDX license
identifiers to apply.
- when both scanners couldn't find any license traces, file was
considered to have no license information in it, and the top level
COPYING file license applied.
For non */uapi/* files that summary was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 11139
and resulted in the first patch in this series.
If that file was a */uapi/* path one, it was "GPL-2.0 WITH
Linux-syscall-note" otherwise it was "GPL-2.0". Results of that was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 WITH Linux-syscall-note 930
and resulted in the second patch in this series.
- if a file had some form of licensing information in it, and was one
of the */uapi/* ones, it was denoted with the Linux-syscall-note if
any GPL family license was found in the file or had no licensing in
it (per prior point). Results summary:
SPDX license identifier # files
---------------------------------------------------|------
GPL-2.0 WITH Linux-syscall-note 270
GPL-2.0+ WITH Linux-syscall-note 169
((GPL-2.0 WITH Linux-syscall-note) OR BSD-2-Clause) 21
((GPL-2.0 WITH Linux-syscall-note) OR BSD-3-Clause) 17
LGPL-2.1+ WITH Linux-syscall-note 15
GPL-1.0+ WITH Linux-syscall-note 14
((GPL-2.0+ WITH Linux-syscall-note) OR BSD-3-Clause) 5
LGPL-2.0+ WITH Linux-syscall-note 4
LGPL-2.1 WITH Linux-syscall-note 3
((GPL-2.0 WITH Linux-syscall-note) OR MIT) 3
((GPL-2.0 WITH Linux-syscall-note) AND MIT) 1
and that resulted in the third patch in this series.
- when the two scanners agreed on the detected license(s), that became
the concluded license(s).
- when there was disagreement between the two scanners (one detected a
license but the other didn't, or they both detected different
licenses) a manual inspection of the file occurred.
- In most cases a manual inspection of the information in the file
resulted in a clear resolution of the license that should apply (and
which scanner probably needed to revisit its heuristics).
- When it was not immediately clear, the license identifier was
confirmed with lawyers working with the Linux Foundation.
- If there was any question as to the appropriate license identifier,
the file was flagged for further research and to be revisited later
in time.
In total, over 70 hours of logged manual review was done on the
spreadsheet to determine the SPDX license identifiers to apply to the
source files by Kate, Philippe, Thomas and, in some cases, confirmation
by lawyers working with the Linux Foundation.
Kate also obtained a third independent scan of the 4.13 code base from
FOSSology, and compared selected files where the other two scanners
disagreed against that SPDX file, to see if there was new insights. The
Windriver scanner is based on an older version of FOSSology in part, so
they are related.
Thomas did random spot checks in about 500 files from the spreadsheets
for the uapi headers and agreed with SPDX license identifier in the
files he inspected. For the non-uapi files Thomas did random spot checks
in about 15000 files.
In initial set of patches against 4.14-rc6, 3 files were found to have
copy/paste license identifier errors, and have been fixed to reflect the
correct identifier.
Additionally Philippe spent 10 hours this week doing a detailed manual
inspection and review of the 12,461 patched files from the initial patch
version early this week with:
- a full scancode scan run, collecting the matched texts, detected
license ids and scores
- reviewing anything where there was a license detected (about 500+
files) to ensure that the applied SPDX license was correct
- reviewing anything where there was no detection but the patch license
was not GPL-2.0 WITH Linux-syscall-note to ensure that the applied
SPDX license was correct
This produced a worksheet with 20 files needing minor correction. This
worksheet was then exported into 3 different .csv files for the
different types of files to be modified.
These .csv files were then reviewed by Greg. Thomas wrote a script to
parse the csv files and add the proper SPDX tag to the file, in the
format that the file expected. This script was further refined by Greg
based on the output to detect more types of files automatically and to
distinguish between header and source .c files (which need different
comment types.) Finally Greg ran the script using the .csv files to
generate the patches.
Reviewed-by: Kate Stewart <kstewart@linuxfoundation.org>
Reviewed-by: Philippe Ombredanne <pombredanne@nexb.com>
Reviewed-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
2017-11-01 14:07:57 +00:00
|
|
|
/* SPDX-License-Identifier: GPL-2.0 */
|
2010-05-07 23:57:28 +00:00
|
|
|
#ifndef _ASM_X86_MSHYPER_H
|
|
|
|
#define _ASM_X86_MSHYPER_H
|
2010-05-06 19:08:41 +00:00
|
|
|
|
2010-05-07 23:57:28 +00:00
|
|
|
#include <linux/types.h>
|
2017-06-23 08:50:38 +00:00
|
|
|
#include <linux/atomic.h>
|
2017-08-02 16:09:17 +00:00
|
|
|
#include <linux/nmi.h>
|
2017-08-02 16:09:14 +00:00
|
|
|
#include <asm/io.h>
|
2018-03-20 14:02:05 +00:00
|
|
|
#include <asm/hyperv-tlfs.h>
|
2018-01-11 21:46:30 +00:00
|
|
|
#include <asm/nospec-branch.h>
|
2010-05-07 23:57:28 +00:00
|
|
|
|
|
|
|
struct ms_hyperv_info {
|
|
|
|
u32 features;
|
2015-08-01 23:08:20 +00:00
|
|
|
u32 misc_features;
|
2010-05-07 23:57:28 +00:00
|
|
|
u32 hints;
|
2018-03-20 14:02:10 +00:00
|
|
|
u32 nested_features;
|
2017-06-25 17:06:41 +00:00
|
|
|
u32 max_vp_index;
|
|
|
|
u32 max_lp_index;
|
2010-05-07 23:57:28 +00:00
|
|
|
};
|
|
|
|
|
|
|
|
extern struct ms_hyperv_info ms_hyperv;
|
2010-05-06 19:08:41 +00:00
|
|
|
|
2017-01-18 23:45:01 +00:00
|
|
|
|
|
|
|
/*
|
2018-03-20 14:02:06 +00:00
|
|
|
* Generate the guest ID.
|
2017-01-18 23:45:01 +00:00
|
|
|
*/
|
|
|
|
|
|
|
|
static inline __u64 generate_guest_id(__u64 d_info1, __u64 kernel_version,
|
|
|
|
__u64 d_info2)
|
|
|
|
{
|
|
|
|
__u64 guest_id = 0;
|
|
|
|
|
2017-02-04 15:46:23 +00:00
|
|
|
guest_id = (((__u64)HV_LINUX_VENDOR_ID) << 48);
|
2017-01-18 23:45:01 +00:00
|
|
|
guest_id |= (d_info1 << 48);
|
|
|
|
guest_id |= (kernel_version << 16);
|
|
|
|
guest_id |= d_info2;
|
|
|
|
|
|
|
|
return guest_id;
|
|
|
|
}
|
|
|
|
|
2017-01-19 18:51:50 +00:00
|
|
|
|
|
|
|
/* Free the message slot and signal end-of-message if required */
|
|
|
|
static inline void vmbus_signal_eom(struct hv_message *msg, u32 old_msg_type)
|
|
|
|
{
|
|
|
|
/*
|
|
|
|
* On crash we're reading some other CPU's message page and we need
|
|
|
|
* to be careful: this other CPU may already had cleared the header
|
|
|
|
* and the host may already had delivered some other message there.
|
|
|
|
* In case we blindly write msg->header.message_type we're going
|
|
|
|
* to lose it. We can still lose a message of the same type but
|
|
|
|
* we count on the fact that there can only be one
|
|
|
|
* CHANNELMSG_UNLOAD_RESPONSE and we don't care about other messages
|
|
|
|
* on crash.
|
|
|
|
*/
|
|
|
|
if (cmpxchg(&msg->header.message_type, old_msg_type,
|
|
|
|
HVMSG_NONE) != old_msg_type)
|
|
|
|
return;
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Make sure the write to MessageType (ie set to
|
|
|
|
* HVMSG_NONE) happens before we read the
|
|
|
|
* MessagePending and EOMing. Otherwise, the EOMing
|
|
|
|
* will not deliver any more messages since there is
|
|
|
|
* no empty slot
|
|
|
|
*/
|
|
|
|
mb();
|
|
|
|
|
|
|
|
if (msg->header.message_flags.msg_pending) {
|
|
|
|
/*
|
|
|
|
* This will cause message queue rescan to
|
|
|
|
* possibly deliver another msg from the
|
|
|
|
* hypervisor
|
|
|
|
*/
|
|
|
|
wrmsrl(HV_X64_MSR_EOM, 0);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2017-01-19 18:51:51 +00:00
|
|
|
#define hv_init_timer(timer, tick) wrmsrl(timer, tick)
|
|
|
|
#define hv_init_timer_config(config, val) wrmsrl(config, val)
|
|
|
|
|
2017-01-19 18:51:54 +00:00
|
|
|
#define hv_get_simp(val) rdmsrl(HV_X64_MSR_SIMP, val)
|
|
|
|
#define hv_set_simp(val) wrmsrl(HV_X64_MSR_SIMP, val)
|
|
|
|
|
2017-01-19 18:51:55 +00:00
|
|
|
#define hv_get_siefp(val) rdmsrl(HV_X64_MSR_SIEFP, val)
|
|
|
|
#define hv_set_siefp(val) wrmsrl(HV_X64_MSR_SIEFP, val)
|
|
|
|
|
2017-01-19 18:51:56 +00:00
|
|
|
#define hv_get_synic_state(val) rdmsrl(HV_X64_MSR_SCONTROL, val)
|
|
|
|
#define hv_set_synic_state(val) wrmsrl(HV_X64_MSR_SCONTROL, val)
|
|
|
|
|
2017-01-19 18:51:57 +00:00
|
|
|
#define hv_get_vp_index(index) rdmsrl(HV_X64_MSR_VP_INDEX, index)
|
|
|
|
|
2017-01-19 18:51:58 +00:00
|
|
|
#define hv_get_synint_state(int_num, val) rdmsrl(int_num, val)
|
|
|
|
#define hv_set_synint_state(int_num, val) wrmsrl(int_num, val)
|
|
|
|
|
2013-02-04 01:22:39 +00:00
|
|
|
void hyperv_callback_vector(void);
|
2018-01-24 13:23:33 +00:00
|
|
|
void hyperv_reenlightenment_vector(void);
|
x86, trace: Add irq vector tracepoints
[Purpose of this patch]
As Vaibhav explained in the thread below, tracepoints for irq vectors
are useful.
http://www.spinics.net/lists/mm-commits/msg85707.html
<snip>
The current interrupt traces from irq_handler_entry and irq_handler_exit
provide when an interrupt is handled. They provide good data about when
the system has switched to kernel space and how it affects the currently
running processes.
There are some IRQ vectors which trigger the system into kernel space,
which are not handled in generic IRQ handlers. Tracing such events gives
us the information about IRQ interaction with other system events.
The trace also tells where the system is spending its time. We want to
know which cores are handling interrupts and how they are affecting other
processes in the system. Also, the trace provides information about when
the cores are idle and which interrupts are changing that state.
<snip>
On the other hand, my usecase is tracing just local timer event and
getting a value of instruction pointer.
I suggested to add an argument local timer event to get instruction pointer before.
But there is another way to get it with external module like systemtap.
So, I don't need to add any argument to irq vector tracepoints now.
[Patch Description]
Vaibhav's patch shared a trace point ,irq_vector_entry/irq_vector_exit, in all events.
But there is an above use case to trace specific irq_vector rather than tracing all events.
In this case, we are concerned about overhead due to unwanted events.
So, add following tracepoints instead of introducing irq_vector_entry/exit.
so that we can enable them independently.
- local_timer_vector
- reschedule_vector
- call_function_vector
- call_function_single_vector
- irq_work_entry_vector
- error_apic_vector
- thermal_apic_vector
- threshold_apic_vector
- spurious_apic_vector
- x86_platform_ipi_vector
Also, introduce a logic switching IDT at enabling/disabling time so that a time penalty
makes a zero when tracepoints are disabled. Detailed explanations are as follows.
- Create trace irq handlers with entering_irq()/exiting_irq().
- Create a new IDT, trace_idt_table, at boot time by adding a logic to
_set_gate(). It is just a copy of original idt table.
- Register the new handlers for tracpoints to the new IDT by introducing
macros to alloc_intr_gate() called at registering time of irq_vector handlers.
- Add checking, whether irq vector tracing is on/off, into load_current_idt().
This has to be done below debug checking for these reasons.
- Switching to debug IDT may be kicked while tracing is enabled.
- On the other hands, switching to trace IDT is kicked only when debugging
is disabled.
In addition, the new IDT is created only when CONFIG_TRACING is enabled to avoid being
used for other purposes.
Signed-off-by: Seiji Aguchi <seiji.aguchi@hds.com>
Link: http://lkml.kernel.org/r/51C323ED.5050708@hds.com
Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>
Cc: Steven Rostedt <rostedt@goodmis.org>
2013-06-20 15:46:53 +00:00
|
|
|
#ifdef CONFIG_TRACING
|
|
|
|
#define trace_hyperv_callback_vector hyperv_callback_vector
|
|
|
|
#endif
|
2013-02-04 01:22:39 +00:00
|
|
|
void hyperv_vector_handler(struct pt_regs *regs);
|
2014-03-05 12:42:14 +00:00
|
|
|
void hv_setup_vmbus_irq(void (*handler)(void));
|
|
|
|
void hv_remove_vmbus_irq(void);
|
2013-02-04 01:22:39 +00:00
|
|
|
|
2015-08-01 23:08:07 +00:00
|
|
|
void hv_setup_kexec_handler(void (*handler)(void));
|
|
|
|
void hv_remove_kexec_handler(void);
|
2015-08-01 23:08:09 +00:00
|
|
|
void hv_setup_crash_handler(void (*handler)(struct pt_regs *regs));
|
|
|
|
void hv_remove_crash_handler(void);
|
2017-01-18 23:45:02 +00:00
|
|
|
|
2018-03-05 05:17:18 +00:00
|
|
|
/*
|
|
|
|
* Routines for stimer0 Direct Mode handling.
|
|
|
|
* On x86/x64, there are no percpu actions to take.
|
|
|
|
*/
|
|
|
|
void hv_stimer0_vector_handler(struct pt_regs *regs);
|
|
|
|
void hv_stimer0_callback_vector(void);
|
|
|
|
int hv_setup_stimer0_irq(int *irq, int *vector, void (*handler)(void));
|
|
|
|
void hv_remove_stimer0_irq(int irq);
|
|
|
|
|
|
|
|
static inline void hv_enable_stimer0_percpu_irq(int irq) {}
|
|
|
|
static inline void hv_disable_stimer0_percpu_irq(int irq) {}
|
|
|
|
|
|
|
|
|
2017-01-18 23:45:02 +00:00
|
|
|
#if IS_ENABLED(CONFIG_HYPERV)
|
2017-02-04 16:57:13 +00:00
|
|
|
extern struct clocksource *hyperv_cs;
|
2017-08-02 16:09:14 +00:00
|
|
|
extern void *hv_hypercall_pg;
|
|
|
|
|
|
|
|
static inline u64 hv_do_hypercall(u64 control, void *input, void *output)
|
|
|
|
{
|
|
|
|
u64 input_address = input ? virt_to_phys(input) : 0;
|
|
|
|
u64 output_address = output ? virt_to_phys(output) : 0;
|
|
|
|
u64 hv_status;
|
|
|
|
|
|
|
|
#ifdef CONFIG_X86_64
|
|
|
|
if (!hv_hypercall_pg)
|
|
|
|
return U64_MAX;
|
|
|
|
|
|
|
|
__asm__ __volatile__("mov %4, %%r8\n"
|
2018-01-11 21:46:30 +00:00
|
|
|
CALL_NOSPEC
|
x86/asm: Fix inline asm call constraints for Clang
For inline asm statements which have a CALL instruction, we list the
stack pointer as a constraint to convince GCC to ensure the frame
pointer is set up first:
static inline void foo()
{
register void *__sp asm(_ASM_SP);
asm("call bar" : "+r" (__sp))
}
Unfortunately, that pattern causes Clang to corrupt the stack pointer.
The fix is easy: convert the stack pointer register variable to a global
variable.
It should be noted that the end result is different based on the GCC
version. With GCC 6.4, this patch has exactly the same result as
before:
defconfig defconfig-nofp distro distro-nofp
before 9820389 9491555 8816046 8516940
after 9820389 9491555 8816046 8516940
With GCC 7.2, however, GCC's behavior has changed. It now changes its
behavior based on the conversion of the register variable to a global.
That somehow convinces it to *always* set up the frame pointer before
inserting *any* inline asm. (Therefore, listing the variable as an
output constraint is a no-op and is no longer necessary.) It's a bit
overkill, but the performance impact should be negligible. And in fact,
there's a nice improvement with frame pointers disabled:
defconfig defconfig-nofp distro distro-nofp
before 9796316 9468236 9076191 8790305
after 9796957 9464267 9076381 8785949
So in summary, while listing the stack pointer as an output constraint
is no longer necessary for newer versions of GCC, it's still needed for
older versions.
Suggested-by: Andrey Ryabinin <aryabinin@virtuozzo.com>
Reported-by: Matthias Kaehlcke <mka@chromium.org>
Signed-off-by: Josh Poimboeuf <jpoimboe@redhat.com>
Cc: Alexander Potapenko <glider@google.com>
Cc: Andy Lutomirski <luto@kernel.org>
Cc: Arnd Bergmann <arnd@arndb.de>
Cc: Dmitriy Vyukov <dvyukov@google.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Miguel Bernal Marin <miguel.bernal.marin@linux.intel.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lkml.kernel.org/r/3db862e970c432ae823cf515c52b54fec8270e0e.1505942196.git.jpoimboe@redhat.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
2017-09-20 21:24:33 +00:00
|
|
|
: "=a" (hv_status), ASM_CALL_CONSTRAINT,
|
2017-08-02 16:09:14 +00:00
|
|
|
"+c" (control), "+d" (input_address)
|
2018-01-11 21:46:30 +00:00
|
|
|
: "r" (output_address),
|
|
|
|
THUNK_TARGET(hv_hypercall_pg)
|
2017-08-02 16:09:14 +00:00
|
|
|
: "cc", "memory", "r8", "r9", "r10", "r11");
|
|
|
|
#else
|
|
|
|
u32 input_address_hi = upper_32_bits(input_address);
|
|
|
|
u32 input_address_lo = lower_32_bits(input_address);
|
|
|
|
u32 output_address_hi = upper_32_bits(output_address);
|
|
|
|
u32 output_address_lo = lower_32_bits(output_address);
|
|
|
|
|
|
|
|
if (!hv_hypercall_pg)
|
|
|
|
return U64_MAX;
|
|
|
|
|
2018-01-11 21:46:30 +00:00
|
|
|
__asm__ __volatile__(CALL_NOSPEC
|
2017-08-02 16:09:14 +00:00
|
|
|
: "=A" (hv_status),
|
x86/asm: Fix inline asm call constraints for Clang
For inline asm statements which have a CALL instruction, we list the
stack pointer as a constraint to convince GCC to ensure the frame
pointer is set up first:
static inline void foo()
{
register void *__sp asm(_ASM_SP);
asm("call bar" : "+r" (__sp))
}
Unfortunately, that pattern causes Clang to corrupt the stack pointer.
The fix is easy: convert the stack pointer register variable to a global
variable.
It should be noted that the end result is different based on the GCC
version. With GCC 6.4, this patch has exactly the same result as
before:
defconfig defconfig-nofp distro distro-nofp
before 9820389 9491555 8816046 8516940
after 9820389 9491555 8816046 8516940
With GCC 7.2, however, GCC's behavior has changed. It now changes its
behavior based on the conversion of the register variable to a global.
That somehow convinces it to *always* set up the frame pointer before
inserting *any* inline asm. (Therefore, listing the variable as an
output constraint is a no-op and is no longer necessary.) It's a bit
overkill, but the performance impact should be negligible. And in fact,
there's a nice improvement with frame pointers disabled:
defconfig defconfig-nofp distro distro-nofp
before 9796316 9468236 9076191 8790305
after 9796957 9464267 9076381 8785949
So in summary, while listing the stack pointer as an output constraint
is no longer necessary for newer versions of GCC, it's still needed for
older versions.
Suggested-by: Andrey Ryabinin <aryabinin@virtuozzo.com>
Reported-by: Matthias Kaehlcke <mka@chromium.org>
Signed-off-by: Josh Poimboeuf <jpoimboe@redhat.com>
Cc: Alexander Potapenko <glider@google.com>
Cc: Andy Lutomirski <luto@kernel.org>
Cc: Arnd Bergmann <arnd@arndb.de>
Cc: Dmitriy Vyukov <dvyukov@google.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Miguel Bernal Marin <miguel.bernal.marin@linux.intel.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lkml.kernel.org/r/3db862e970c432ae823cf515c52b54fec8270e0e.1505942196.git.jpoimboe@redhat.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
2017-09-20 21:24:33 +00:00
|
|
|
"+c" (input_address_lo), ASM_CALL_CONSTRAINT
|
2017-08-02 16:09:14 +00:00
|
|
|
: "A" (control),
|
|
|
|
"b" (input_address_hi),
|
|
|
|
"D"(output_address_hi), "S"(output_address_lo),
|
2018-01-11 21:46:30 +00:00
|
|
|
THUNK_TARGET(hv_hypercall_pg)
|
2017-08-02 16:09:14 +00:00
|
|
|
: "cc", "memory");
|
|
|
|
#endif /* !x86_64 */
|
|
|
|
return hv_status;
|
|
|
|
}
|
2017-02-04 16:57:13 +00:00
|
|
|
|
2017-08-02 16:09:15 +00:00
|
|
|
/* Fast hypercall with 8 bytes of input and no output */
|
|
|
|
static inline u64 hv_do_fast_hypercall8(u16 code, u64 input1)
|
|
|
|
{
|
|
|
|
u64 hv_status, control = (u64)code | HV_HYPERCALL_FAST_BIT;
|
|
|
|
|
|
|
|
#ifdef CONFIG_X86_64
|
|
|
|
{
|
2018-01-11 21:46:30 +00:00
|
|
|
__asm__ __volatile__(CALL_NOSPEC
|
x86/asm: Fix inline asm call constraints for Clang
For inline asm statements which have a CALL instruction, we list the
stack pointer as a constraint to convince GCC to ensure the frame
pointer is set up first:
static inline void foo()
{
register void *__sp asm(_ASM_SP);
asm("call bar" : "+r" (__sp))
}
Unfortunately, that pattern causes Clang to corrupt the stack pointer.
The fix is easy: convert the stack pointer register variable to a global
variable.
It should be noted that the end result is different based on the GCC
version. With GCC 6.4, this patch has exactly the same result as
before:
defconfig defconfig-nofp distro distro-nofp
before 9820389 9491555 8816046 8516940
after 9820389 9491555 8816046 8516940
With GCC 7.2, however, GCC's behavior has changed. It now changes its
behavior based on the conversion of the register variable to a global.
That somehow convinces it to *always* set up the frame pointer before
inserting *any* inline asm. (Therefore, listing the variable as an
output constraint is a no-op and is no longer necessary.) It's a bit
overkill, but the performance impact should be negligible. And in fact,
there's a nice improvement with frame pointers disabled:
defconfig defconfig-nofp distro distro-nofp
before 9796316 9468236 9076191 8790305
after 9796957 9464267 9076381 8785949
So in summary, while listing the stack pointer as an output constraint
is no longer necessary for newer versions of GCC, it's still needed for
older versions.
Suggested-by: Andrey Ryabinin <aryabinin@virtuozzo.com>
Reported-by: Matthias Kaehlcke <mka@chromium.org>
Signed-off-by: Josh Poimboeuf <jpoimboe@redhat.com>
Cc: Alexander Potapenko <glider@google.com>
Cc: Andy Lutomirski <luto@kernel.org>
Cc: Arnd Bergmann <arnd@arndb.de>
Cc: Dmitriy Vyukov <dvyukov@google.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Miguel Bernal Marin <miguel.bernal.marin@linux.intel.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lkml.kernel.org/r/3db862e970c432ae823cf515c52b54fec8270e0e.1505942196.git.jpoimboe@redhat.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
2017-09-20 21:24:33 +00:00
|
|
|
: "=a" (hv_status), ASM_CALL_CONSTRAINT,
|
2017-08-02 16:09:15 +00:00
|
|
|
"+c" (control), "+d" (input1)
|
2018-01-11 21:46:30 +00:00
|
|
|
: THUNK_TARGET(hv_hypercall_pg)
|
2017-08-02 16:09:15 +00:00
|
|
|
: "cc", "r8", "r9", "r10", "r11");
|
|
|
|
}
|
|
|
|
#else
|
|
|
|
{
|
|
|
|
u32 input1_hi = upper_32_bits(input1);
|
|
|
|
u32 input1_lo = lower_32_bits(input1);
|
|
|
|
|
2018-01-11 21:46:30 +00:00
|
|
|
__asm__ __volatile__ (CALL_NOSPEC
|
2017-08-02 16:09:15 +00:00
|
|
|
: "=A"(hv_status),
|
|
|
|
"+c"(input1_lo),
|
x86/asm: Fix inline asm call constraints for Clang
For inline asm statements which have a CALL instruction, we list the
stack pointer as a constraint to convince GCC to ensure the frame
pointer is set up first:
static inline void foo()
{
register void *__sp asm(_ASM_SP);
asm("call bar" : "+r" (__sp))
}
Unfortunately, that pattern causes Clang to corrupt the stack pointer.
The fix is easy: convert the stack pointer register variable to a global
variable.
It should be noted that the end result is different based on the GCC
version. With GCC 6.4, this patch has exactly the same result as
before:
defconfig defconfig-nofp distro distro-nofp
before 9820389 9491555 8816046 8516940
after 9820389 9491555 8816046 8516940
With GCC 7.2, however, GCC's behavior has changed. It now changes its
behavior based on the conversion of the register variable to a global.
That somehow convinces it to *always* set up the frame pointer before
inserting *any* inline asm. (Therefore, listing the variable as an
output constraint is a no-op and is no longer necessary.) It's a bit
overkill, but the performance impact should be negligible. And in fact,
there's a nice improvement with frame pointers disabled:
defconfig defconfig-nofp distro distro-nofp
before 9796316 9468236 9076191 8790305
after 9796957 9464267 9076381 8785949
So in summary, while listing the stack pointer as an output constraint
is no longer necessary for newer versions of GCC, it's still needed for
older versions.
Suggested-by: Andrey Ryabinin <aryabinin@virtuozzo.com>
Reported-by: Matthias Kaehlcke <mka@chromium.org>
Signed-off-by: Josh Poimboeuf <jpoimboe@redhat.com>
Cc: Alexander Potapenko <glider@google.com>
Cc: Andy Lutomirski <luto@kernel.org>
Cc: Arnd Bergmann <arnd@arndb.de>
Cc: Dmitriy Vyukov <dvyukov@google.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Miguel Bernal Marin <miguel.bernal.marin@linux.intel.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lkml.kernel.org/r/3db862e970c432ae823cf515c52b54fec8270e0e.1505942196.git.jpoimboe@redhat.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
2017-09-20 21:24:33 +00:00
|
|
|
ASM_CALL_CONSTRAINT
|
2017-08-02 16:09:15 +00:00
|
|
|
: "A" (control),
|
|
|
|
"b" (input1_hi),
|
2018-01-11 21:46:30 +00:00
|
|
|
THUNK_TARGET(hv_hypercall_pg)
|
2017-08-02 16:09:15 +00:00
|
|
|
: "cc", "edi", "esi");
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
return hv_status;
|
|
|
|
}
|
|
|
|
|
2017-08-02 16:09:17 +00:00
|
|
|
/*
|
|
|
|
* Rep hypercalls. Callers of this functions are supposed to ensure that
|
|
|
|
* rep_count and varhead_size comply with Hyper-V hypercall definition.
|
|
|
|
*/
|
|
|
|
static inline u64 hv_do_rep_hypercall(u16 code, u16 rep_count, u16 varhead_size,
|
|
|
|
void *input, void *output)
|
|
|
|
{
|
|
|
|
u64 control = code;
|
|
|
|
u64 status;
|
|
|
|
u16 rep_comp;
|
|
|
|
|
|
|
|
control |= (u64)varhead_size << HV_HYPERCALL_VARHEAD_OFFSET;
|
|
|
|
control |= (u64)rep_count << HV_HYPERCALL_REP_COMP_OFFSET;
|
|
|
|
|
|
|
|
do {
|
|
|
|
status = hv_do_hypercall(control, input, output);
|
|
|
|
if ((status & HV_HYPERCALL_RESULT_MASK) != HV_STATUS_SUCCESS)
|
|
|
|
return status;
|
|
|
|
|
|
|
|
/* Bits 32-43 of status have 'Reps completed' data. */
|
|
|
|
rep_comp = (status & HV_HYPERCALL_REP_COMP_MASK) >>
|
|
|
|
HV_HYPERCALL_REP_COMP_OFFSET;
|
|
|
|
|
|
|
|
control &= ~HV_HYPERCALL_REP_START_MASK;
|
|
|
|
control |= (u64)rep_comp << HV_HYPERCALL_REP_START_OFFSET;
|
|
|
|
|
|
|
|
touch_nmi_watchdog();
|
|
|
|
} while (rep_comp < rep_count);
|
|
|
|
|
|
|
|
return status;
|
|
|
|
}
|
|
|
|
|
2017-08-02 16:09:18 +00:00
|
|
|
/*
|
|
|
|
* Hypervisor's notion of virtual processor ID is different from
|
|
|
|
* Linux' notion of CPU ID. This information can only be retrieved
|
|
|
|
* in the context of the calling CPU. Setup a map for easy access
|
|
|
|
* to this information.
|
|
|
|
*/
|
|
|
|
extern u32 *hv_vp_index;
|
2017-10-06 15:48:54 +00:00
|
|
|
extern u32 hv_max_vp_index;
|
2018-03-20 14:02:08 +00:00
|
|
|
extern struct hv_vp_assist_page **hv_vp_assist_page;
|
|
|
|
|
|
|
|
static inline struct hv_vp_assist_page *hv_get_vp_assist_page(unsigned int cpu)
|
|
|
|
{
|
|
|
|
if (!hv_vp_assist_page)
|
|
|
|
return NULL;
|
|
|
|
|
|
|
|
return hv_vp_assist_page[cpu];
|
|
|
|
}
|
2017-08-02 16:09:18 +00:00
|
|
|
|
|
|
|
/**
|
|
|
|
* hv_cpu_number_to_vp_number() - Map CPU to VP.
|
|
|
|
* @cpu_number: CPU number in Linux terms
|
|
|
|
*
|
|
|
|
* This function returns the mapping between the Linux processor
|
|
|
|
* number and the hypervisor's virtual processor number, useful
|
|
|
|
* in making hypercalls and such that talk about specific
|
|
|
|
* processors.
|
|
|
|
*
|
|
|
|
* Return: Virtual processor number in Hyper-V terms
|
|
|
|
*/
|
|
|
|
static inline int hv_cpu_number_to_vp_number(int cpu_number)
|
|
|
|
{
|
|
|
|
return hv_vp_index[cpu_number];
|
|
|
|
}
|
2017-02-04 16:57:13 +00:00
|
|
|
|
2017-01-18 23:45:02 +00:00
|
|
|
void hyperv_init(void);
|
2017-08-02 16:09:19 +00:00
|
|
|
void hyperv_setup_mmu_ops(void);
|
|
|
|
void hyper_alloc_mmu(void);
|
2017-10-29 18:33:41 +00:00
|
|
|
void hyperv_report_panic(struct pt_regs *regs, long err);
|
2017-12-22 18:19:02 +00:00
|
|
|
bool hv_is_hyperv_initialized(void);
|
2017-01-28 19:37:14 +00:00
|
|
|
void hyperv_cleanup(void);
|
2018-01-24 13:23:33 +00:00
|
|
|
|
|
|
|
void hyperv_reenlightenment_intr(struct pt_regs *regs);
|
|
|
|
void set_hv_tscchange_cb(void (*cb)(void));
|
|
|
|
void clear_hv_tscchange_cb(void);
|
|
|
|
void hyperv_stop_tsc_emulation(void);
|
2017-08-02 16:09:13 +00:00
|
|
|
#else /* CONFIG_HYPERV */
|
|
|
|
static inline void hyperv_init(void) {}
|
2017-12-22 18:19:02 +00:00
|
|
|
static inline bool hv_is_hyperv_initialized(void) { return false; }
|
2017-08-02 16:09:13 +00:00
|
|
|
static inline void hyperv_cleanup(void) {}
|
2017-08-02 16:09:19 +00:00
|
|
|
static inline void hyperv_setup_mmu_ops(void) {}
|
2018-01-24 13:23:33 +00:00
|
|
|
static inline void set_hv_tscchange_cb(void (*cb)(void)) {}
|
|
|
|
static inline void clear_hv_tscchange_cb(void) {}
|
|
|
|
static inline void hyperv_stop_tsc_emulation(void) {};
|
2018-03-20 14:02:08 +00:00
|
|
|
static inline struct hv_vp_assist_page *hv_get_vp_assist_page(unsigned int cpu)
|
|
|
|
{
|
|
|
|
return NULL;
|
|
|
|
}
|
2017-08-02 16:09:13 +00:00
|
|
|
#endif /* CONFIG_HYPERV */
|
|
|
|
|
2017-03-03 13:21:40 +00:00
|
|
|
#ifdef CONFIG_HYPERV_TSCPAGE
|
|
|
|
struct ms_hyperv_tsc_page *hv_get_tsc_page(void);
|
2018-01-24 13:23:32 +00:00
|
|
|
static inline u64 hv_read_tsc_page_tsc(const struct ms_hyperv_tsc_page *tsc_pg,
|
|
|
|
u64 *cur_tsc)
|
2017-03-03 13:21:41 +00:00
|
|
|
{
|
2018-01-24 13:23:32 +00:00
|
|
|
u64 scale, offset;
|
2017-03-03 13:21:41 +00:00
|
|
|
u32 sequence;
|
|
|
|
|
|
|
|
/*
|
|
|
|
* The protocol for reading Hyper-V TSC page is specified in Hypervisor
|
|
|
|
* Top-Level Functional Specification ver. 3.0 and above. To get the
|
|
|
|
* reference time we must do the following:
|
|
|
|
* - READ ReferenceTscSequence
|
|
|
|
* A special '0' value indicates the time source is unreliable and we
|
|
|
|
* need to use something else. The currently published specification
|
|
|
|
* versions (up to 4.0b) contain a mistake and wrongly claim '-1'
|
|
|
|
* instead of '0' as the special value, see commit c35b82ef0294.
|
|
|
|
* - ReferenceTime =
|
|
|
|
* ((RDTSC() * ReferenceTscScale) >> 64) + ReferenceTscOffset
|
|
|
|
* - READ ReferenceTscSequence again. In case its value has changed
|
|
|
|
* since our first reading we need to discard ReferenceTime and repeat
|
|
|
|
* the whole sequence as the hypervisor was updating the page in
|
|
|
|
* between.
|
|
|
|
*/
|
|
|
|
do {
|
|
|
|
sequence = READ_ONCE(tsc_pg->tsc_sequence);
|
|
|
|
if (!sequence)
|
|
|
|
return U64_MAX;
|
|
|
|
/*
|
|
|
|
* Make sure we read sequence before we read other values from
|
|
|
|
* TSC page.
|
|
|
|
*/
|
|
|
|
smp_rmb();
|
|
|
|
|
|
|
|
scale = READ_ONCE(tsc_pg->tsc_scale);
|
|
|
|
offset = READ_ONCE(tsc_pg->tsc_offset);
|
2018-01-24 13:23:32 +00:00
|
|
|
*cur_tsc = rdtsc_ordered();
|
2017-03-03 13:21:41 +00:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Make sure we read sequence after we read all other values
|
|
|
|
* from TSC page.
|
|
|
|
*/
|
|
|
|
smp_rmb();
|
|
|
|
|
|
|
|
} while (READ_ONCE(tsc_pg->tsc_sequence) != sequence);
|
|
|
|
|
2018-01-24 13:23:32 +00:00
|
|
|
return mul_u64_u64_shr(*cur_tsc, scale, 64) + offset;
|
|
|
|
}
|
|
|
|
|
|
|
|
static inline u64 hv_read_tsc_page(const struct ms_hyperv_tsc_page *tsc_pg)
|
|
|
|
{
|
|
|
|
u64 cur_tsc;
|
|
|
|
|
|
|
|
return hv_read_tsc_page_tsc(tsc_pg, &cur_tsc);
|
2017-03-03 13:21:41 +00:00
|
|
|
}
|
|
|
|
|
2017-03-03 13:21:40 +00:00
|
|
|
#else
|
|
|
|
static inline struct ms_hyperv_tsc_page *hv_get_tsc_page(void)
|
|
|
|
{
|
|
|
|
return NULL;
|
|
|
|
}
|
2018-01-24 13:23:32 +00:00
|
|
|
|
|
|
|
static inline u64 hv_read_tsc_page_tsc(const struct ms_hyperv_tsc_page *tsc_pg,
|
|
|
|
u64 *cur_tsc)
|
|
|
|
{
|
|
|
|
BUG();
|
|
|
|
return U64_MAX;
|
|
|
|
}
|
2017-03-03 13:21:40 +00:00
|
|
|
#endif
|
2010-05-06 19:08:41 +00:00
|
|
|
#endif
|