On Wed, Jun 24, 2015 at 07:55:44AM +0530, Bharata B Rao wrote: > On Tue, Jun 23, 2015 at 11:54:29AM +1000, David Gibson wrote: > > On Fri, Jun 19, 2015 at 03:47:55PM +0530, Bharata B Rao wrote: > > > Parse ibm,architecture.vec table obtained from the guest and enable > > > memory node configuration via ibm,dynamic-reconfiguration-memory if guest > > > supports it. This is in preparation to support memory hotplug for > > > sPAPR guests. > > > > > > This changes the way memory node configuration is done. Currently all > > > memory nodes are built upfront. But after this patch, only memory@0 node > > > for RMA is built upfront. Guest kernel boots with just that and rest of > > > the memory nodes (via memory@XXX or ibm,dynamic-reconfiguration-memory) > > > are built when guest does ibm,client-architecture-support call. > > > > > > Note: This patch needs a SLOF enhancement which is already part of > > > SLOF binary in QEMU. > > > > > > Signed-off-by: Bharata B Rao <[email protected]> > > > > [snip] > > > +int spapr_h_cas_compose_response(sPAPRMachineState *spapr, > > > + target_ulong addr, target_ulong size, > > > + bool cpu_update, bool memory_update) > > > +{ > > > + void *fdt, *fdt_skel; > > > + sPAPRDeviceTreeUpdateHeader hdr = { .version_id = 1 }; > > > + > > > + size -= sizeof(hdr); > > > + > > > + /* Create sceleton */ > > > + fdt_skel = g_malloc0(size); > > > + _FDT((fdt_create(fdt_skel, size))); > > > + _FDT((fdt_begin_node(fdt_skel, ""))); > > > + _FDT((fdt_end_node(fdt_skel))); > > > + _FDT((fdt_finish(fdt_skel))); > > > + fdt = g_malloc0(size); > > > + _FDT((fdt_open_into(fdt_skel, fdt, size))); > > > + g_free(fdt_skel); > > > + > > > + /* Fixup cpu nodes */ > > > + if (cpu_update) { > > > + _FDT((spapr_fixup_cpu_dt(fdt, spapr))); > > > + } > > > > The cpu_update parameter seems like its not related to memory hotplug > > at all. I'm guessing it relates to CPU hotplug, in which case please > > defer it until those patches are ready to go. > > This change isn't related to cpu hotplug. Earlier this compose response > routine only did CPU device tree fixup based on some conditions. I have > enabled it to check for availability DRCONF_MEMORY feature and accordingly > fixup memory DT. So this change just checks if cpu fixup is necessary > or not. Essentially we aren't changing any behaviour wrt cpu dt > fixup here.
Hm, ok. Would there be any problem with just unconditionally doing
both fixups? This is about as far from a hot path as its possible to
get.
>
> >
> > > +
> > > + /* Generate memory nodes or ibm,dynamic-reconfiguration-memory node
> > > */
> > > + if (memory_update && spapr->dr_lmb_enabled) {
> > > + _FDT((spapr_populate_drconf_memory(spapr, fdt)));
> > > + } else {
> > > + _FDT((spapr_populate_memory(spapr, fdt)));
> > > + }
> > > +
> > > + /* Pack resulting tree */
> > > + _FDT((fdt_pack(fdt)));
> > > +
> > > + if (fdt_totalsize(fdt) + sizeof(hdr) > size) {
> > > + trace_spapr_cas_failed(size);
> > > + return -1;
> > > + }
> > > +
> > > + cpu_physical_memory_write(addr, &hdr, sizeof(hdr));
> > > + cpu_physical_memory_write(addr + sizeof(hdr), fdt,
> > > fdt_totalsize(fdt));
> > > + trace_spapr_cas_continue(fdt_totalsize(fdt) + sizeof(hdr));
> > > + g_free(fdt);
> > > +
> > > + return 0;
> > > +}
> > > +
> > > static void spapr_finalize_fdt(sPAPRMachineState *spapr,
> > > hwaddr fdt_addr,
> > > hwaddr rtas_addr,
> > > @@ -756,10 +866,16 @@ static void spapr_finalize_fdt(sPAPRMachineState
> > > *spapr,
> > > /* open out the base tree into a temp buffer for the final tweaks */
> > > _FDT((fdt_open_into(spapr->fdt_skel, fdt, FDT_MAX_SIZE)));
> > >
> > > - ret = spapr_populate_memory(spapr, fdt);
> > > - if (ret < 0) {
> > > - fprintf(stderr, "couldn't setup memory nodes in fdt\n");
> > > - exit(1);
> > > + /*
> > > + * Add memory@0 node to represent RMA. Rest of the memory is either
> > > + * represented by memory nodes or ibm,dynamic-reconfiguration-memory
> > > + * node later during ibm,client-architecture-support call.
> > > + */
> > > + for (i = 0; i < nb_numa_nodes; ++i) {
> > > + if (numa_info[i].node_mem) {
> > > + spapr_populate_memory_node(fdt, i, 0, spapr->rma_size);
> > > + break;
> > > + }
> >
> > ?? The code doesn't seem to match the comment - you appear to be
> > creating a memory@0 node for every NUMA node, not just for the RMA,
> > which doesn't make much sense.
>
> I have a break there to ensure memory@0 is created only once from the 1st
> memory-less node. I am slightly changing this in next version to ensure
> that this works correctly even when -numa isn't specified.
Ah, sorry, I missed the break;. That should be ok then.
--
David Gibson | I'll have my music baroque, and my code
david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_
| _way_ _around_!
http://www.ozlabs.org/~dgibson
pgpTx6Ha6apuY.pgp
Description: PGP signature
